Bibliometric Analysis of Mixed Text Using Transformer-Based Architecture in Africa

  • Sello Prince Sekwatlakwatla North-West University
  • Vusumuzi Malele North-West University
  • Phetole Simon Ramalepe North-West University
  • Thipe Modipa North-West University

Keywords: Text Generation, Natural Language Processing, Bibliometric Analysis

Abstract

Deep learning techniques based on neural networks have been developed for text creation, a critical sub-task of natural language generation that aims to create human-readable content. Natural language processing (NLP) tasks are utilized to recognize speech in code-mixed comments on social media platforms like Facebook and Twitter, which enable users to interact and exchange ideas, views, status updates, pictures, and videos with people all over the world. Although NLP is widely investigated in the world and Africa is home to approximately 3,000 languages, many of which are derived from significant language families, in this regard, there are challenges that Africa faces in Natural Language Processing (NLP), especially mixed text using transformer-based architecture. The purpose of this study is to investigate the prevalence of mixed text using transformer-based architecture in Africa. Bibliometric analysis was used to assess natural language and mixed text in Africa, utilizing transformer-based architecture. show that sentiment analysis is the holistic tool that is used for mixed text using transformers, where social media, deep learning, codes, computational linguistics, and social networking are critical tools in generating human-like quality text. Therefore, this study proposes artificial intelligence, artificial neutral networks, and neural networks, as well as a prediction to estimate the technique or fluctuation as the method for mixed text using transformer-based architecture in Africa. This research sets the path for future studies that use mixed text using transformer-based architecture in Africa

Downloads

Download data is not yet available.

References

Y.Shen,X.Zhao" Reinforcement Learning in Natural Language Processing: A Survey"MLNLP : Proceedings of the 2023 6th International Conference on Machine Learning and Natural Language Processing.2023, pp 84–90.[Online].Available .https://doi.org/10.1145/3639479.3639496

M. Anand, K.B.Sahay, M.A.Ahmed, D.Sultan, R.R.Chandan, B.Singh."Deep learning and natural language processing in computation for offensive language detection in online social networks by feature selection and ensemble classification techniques" Journal of Theoretical Computer Science Vol 943, Pp 203-218,2023,,[Online].Available.https://doi.org/10.1016/j.tcs.2022.06.020

H. Adel, N. T. Vu, F. Kraus, T. Schlippe, H. Li and T. Schultz, "Recurrent neural network language modeling for code switching conversational speech," 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 2013, pp. 8411-8415, doi: 10.1109/ICASSP.2013.6639306.

S.Garg, T.Parekh, & P. Jyothi,”Code-switched Language Models Using Dual RNNs and Same-Source Pretraining”, 2018. [Online]. Available: http://arxiv.org/abs/1809.01962

D.Gupta,A. Ekbal, & P. Bhattacharyya,” Findings of the Association for Computational Linguistics A Semi-supervised Approach to Generate the Code-Mixed Text using Pre-trained Encoder and Transfer Learning “In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing:’, 2020,pp. 2267–2280.

H.Li,Y. Wang, Y.Liu, D. Tang, Z.Lei, & W.Li, “An Augmented Transformer Architecture for Natural Language Generation Tasks”, ‘In 2019 International Conference on Data Mining Workshops (ICDMW). IEEE’, pp. 1–7.

L.Yu, W. Zhang, J.Wang, & Y. Yu, “SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient”, in’In Proceedings of the AAAI conference on artificial intelligence.’, Vol. 31, pp. 2852–2858,2017

P. Carroll, B. Singh, E. Mangina,"Uncovering gender dimensions in energy policy using Natural Language Processing",journal of Renewable and Sustainable Energy Reviews Vol.193,2024,[Online].Available. https://doi.org/10.1016/j.rser.2024.114281

A.P Costa, R.R. Seabra , M.A. César, A.D.Santos"Manufacturing process encoding through natural language processing for prediction of material properties" journal of Computational Materials Science Vol.237,2024,[Online].Available. https://doi.org/10.1016/j.commatsci.2024.112896

. Y.Song,"Public cloud network intrusion and internet legal supervision based on abnormal feature detection", Journal of Computers and Electrical Engineering. Vol, no.112, 2023, [Online].Available. https://doi.org/10.1016/j.compeleceng.2023.109015

. J.J. Cavallo,I.O.Santo, J.L. Mezrich, H.P. Forman,"Clinical Implementation of a Combined Artificial Intelligence and Natural Language Processing Quality Assurance Program for Pulmonary Nodule Detection in the Emergency Department SettingJournal of the American College of Radiology.Vol/20, Pp 438-445,2023.[Online].Available.https://doi.org/10.1016/j.jacr.2022.12.016

. W.Lu,"Application cost of intelligent intrusion detection in medical logistics management under public cloud environment", Journal of Computers and Electrical Engineering, vol, no.112, 2023, [Online].Available.https://doi.org/10.1016/j.compeleceng.2023.109014

. J. Royer, E.Q. Wu, R. Ayyagari, S. Parravano, U. Pathare, M. Kisielinska,"MSR131 Prospects for Automation of Systemic Literature Reviews (SLRs) With Artificial Intelligence and Natural Language Processing"journal of Value in Health.Vol 26,2023, Pp 418,[Online].Available: https://doi.org/10.1016/j.jval.2023.09.2190

. S.Pham et al,Evaluation of Shared Resource Allocation Using SAND for ABR Streaming,ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 16, Issue 2s,Vol.70,2020, [Online].Available.https://doi.org/10.1145/3388926

. G.Takawane, A.Phaltankar, V.Patwardhan, A.Patil, R.Joshi, M.S.Takalikar,"Language augmentation approach for code-mixed text classification" journal of Natural Language Processing Journal .Vol 5, 2023, [Online].Available: https://doi.org/10.1016/j.nlp.2023.100042

J.Suzuki, H.Zen, H.Kazawa,"Extracting representative subset from extensive text data for training pre-trained language models",journal of Information Processing & ManagementVol 60, 2023,[Online].Available.https://doi.org/10.1016/j.ipm.2022.103249

. J.Cleland-Huang et al,"Extending MAPE-K to support human-machine teaming"SEAMS '22: Proceedings of the 17th Symposium on Software Engineering for Adaptive and Self-Managing Systems,Vol.131,2022, [Online].Available. https://doi.org/10.1145/3524844.3528054

. W.Nam, B.Jang."A survey on multimodal bidirectional machine learning translation of image and natural language processing" Journal of Expert Systems with Applications Vol 235,2024,[Online].Available. https://doi.org/10.1016/j.eswa.2023.121168

L.F. Pellicer, T.M.Ferreira, A.H.R.Costa"Data augmentation techniques in natural language processing", Journal of Applied Soft Computing,Vol 132, 2023,[Online].Available. https://doi.org/10.1016/j.asoc.2022.109803

Published
2024-07-31
How to Cite
Sekwatlakwatla, S. P., Malele, V., Ramalepe, P. S., & Modipa, T. (2024). Bibliometric Analysis of Mixed Text Using Transformer-Based Architecture in Africa. Indonesian Journal of Data and Science, 5(2), 115-120. https://doi.org/10.56705/ijodas.v5i2.131