Bibliometric Analysis of Mixed Text Using Transformer-Based Architecture in Africa
Abstract
Deep learning techniques based on neural networks have been developed for text creation, a critical sub-task of natural language generation that aims to create human-readable content. Natural language processing (NLP) tasks are utilized to recognize speech in code-mixed comments on social media platforms like Facebook and Twitter, which enable users to interact and exchange ideas, views, status updates, pictures, and videos with people all over the world. Although NLP is widely investigated in the world and Africa is home to approximately 3,000 languages, many of which are derived from significant language families, in this regard, there are challenges that Africa faces in Natural Language Processing (NLP), especially mixed text using transformer-based architecture. The purpose of this study is to investigate the prevalence of mixed text using transformer-based architecture in Africa. Bibliometric analysis was used to assess natural language and mixed text in Africa, utilizing transformer-based architecture. show that sentiment analysis is the holistic tool that is used for mixed text using transformers, where social media, deep learning, codes, computational linguistics, and social networking are critical tools in generating human-like quality text. Therefore, this study proposes artificial intelligence, artificial neutral networks, and neural networks, as well as a prediction to estimate the technique or fluctuation as the method for mixed text using transformer-based architecture in Africa. This research sets the path for future studies that use mixed text using transformer-based architecture in Africa
Downloads
References
Y.Shen,X.Zhao" Reinforcement Learning in Natural Language Processing: A Survey"MLNLP : Proceedings of the 2023 6th International Conference on Machine Learning and Natural Language Processing.2023, pp 84–90.[Online].Available .https://doi.org/10.1145/3639479.3639496
M. Anand, K.B.Sahay, M.A.Ahmed, D.Sultan, R.R.Chandan, B.Singh."Deep learning and natural language processing in computation for offensive language detection in online social networks by feature selection and ensemble classification techniques" Journal of Theoretical Computer Science Vol 943, Pp 203-218,2023,,[Online].Available.https://doi.org/10.1016/j.tcs.2022.06.020
H. Adel, N. T. Vu, F. Kraus, T. Schlippe, H. Li and T. Schultz, "Recurrent neural network language modeling for code switching conversational speech," 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 2013, pp. 8411-8415, doi: 10.1109/ICASSP.2013.6639306.
S.Garg, T.Parekh, & P. Jyothi,”Code-switched Language Models Using Dual RNNs and Same-Source Pretraining”, 2018. [Online]. Available: http://arxiv.org/abs/1809.01962
D.Gupta,A. Ekbal, & P. Bhattacharyya,” Findings of the Association for Computational Linguistics A Semi-supervised Approach to Generate the Code-Mixed Text using Pre-trained Encoder and Transfer Learning “In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing:’, 2020,pp. 2267–2280.
H.Li,Y. Wang, Y.Liu, D. Tang, Z.Lei, & W.Li, “An Augmented Transformer Architecture for Natural Language Generation Tasks”, ‘In 2019 International Conference on Data Mining Workshops (ICDMW). IEEE’, pp. 1–7.
L.Yu, W. Zhang, J.Wang, & Y. Yu, “SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient”, in’In Proceedings of the AAAI conference on artificial intelligence.’, Vol. 31, pp. 2852–2858,2017
P. Carroll, B. Singh, E. Mangina,"Uncovering gender dimensions in energy policy using Natural Language Processing",journal of Renewable and Sustainable Energy Reviews Vol.193,2024,[Online].Available. https://doi.org/10.1016/j.rser.2024.114281
A.P Costa, R.R. Seabra , M.A. César, A.D.Santos"Manufacturing process encoding through natural language processing for prediction of material properties" journal of Computational Materials Science Vol.237,2024,[Online].Available. https://doi.org/10.1016/j.commatsci.2024.112896
. Y.Song,"Public cloud network intrusion and internet legal supervision based on abnormal feature detection", Journal of Computers and Electrical Engineering. Vol, no.112, 2023, [Online].Available. https://doi.org/10.1016/j.compeleceng.2023.109015
. J.J. Cavallo,I.O.Santo, J.L. Mezrich, H.P. Forman,"Clinical Implementation of a Combined Artificial Intelligence and Natural Language Processing Quality Assurance Program for Pulmonary Nodule Detection in the Emergency Department SettingJournal of the American College of Radiology.Vol/20, Pp 438-445,2023.[Online].Available.https://doi.org/10.1016/j.jacr.2022.12.016
. W.Lu,"Application cost of intelligent intrusion detection in medical logistics management under public cloud environment", Journal of Computers and Electrical Engineering, vol, no.112, 2023, [Online].Available.https://doi.org/10.1016/j.compeleceng.2023.109014
. J. Royer, E.Q. Wu, R. Ayyagari, S. Parravano, U. Pathare, M. Kisielinska,"MSR131 Prospects for Automation of Systemic Literature Reviews (SLRs) With Artificial Intelligence and Natural Language Processing"journal of Value in Health.Vol 26,2023, Pp 418,[Online].Available: https://doi.org/10.1016/j.jval.2023.09.2190
. S.Pham et al,Evaluation of Shared Resource Allocation Using SAND for ABR Streaming,ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 16, Issue 2s,Vol.70,2020, [Online].Available.https://doi.org/10.1145/3388926
. G.Takawane, A.Phaltankar, V.Patwardhan, A.Patil, R.Joshi, M.S.Takalikar,"Language augmentation approach for code-mixed text classification" journal of Natural Language Processing Journal .Vol 5, 2023, [Online].Available: https://doi.org/10.1016/j.nlp.2023.100042
J.Suzuki, H.Zen, H.Kazawa,"Extracting representative subset from extensive text data for training pre-trained language models",journal of Information Processing & ManagementVol 60, 2023,[Online].Available.https://doi.org/10.1016/j.ipm.2022.103249
. J.Cleland-Huang et al,"Extending MAPE-K to support human-machine teaming"SEAMS '22: Proceedings of the 17th Symposium on Software Engineering for Adaptive and Self-Managing Systems,Vol.131,2022, [Online].Available. https://doi.org/10.1145/3524844.3528054
. W.Nam, B.Jang."A survey on multimodal bidirectional machine learning translation of image and natural language processing" Journal of Expert Systems with Applications Vol 235,2024,[Online].Available. https://doi.org/10.1016/j.eswa.2023.121168
L.F. Pellicer, T.M.Ferreira, A.H.R.Costa"Data augmentation techniques in natural language processing", Journal of Applied Soft Computing,Vol 132, 2023,[Online].Available. https://doi.org/10.1016/j.asoc.2022.109803
Copyright (c) 2024 Indonesian Journal of Data and Science
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- The work described has not been formally published before, except in the form of an abstract or as part of a published lecture, review, thesis, or overlay journal.
- The work is not under consideration for publication elsewhere.
- The work has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Copyright
Authors who publish with Indonesian Journal of Data and Science agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. (CC BY-NC 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.