Integration of Yolov8 And Instance Segmentation in The Chinese Sign Language (CSL) Recognition System

Mikel Ega Wijaya; Anik Nur  Handayani

doi:10.56705/ijodas.v6i2.247

Authors

Mikel Ega Wijaya Universitas Negeri Malang
Anik Nur Handayani Universitas Negeri Malang

DOI:

https://doi.org/10.56705/ijodas.v6i2.247

Keywords:

Chinese Sign Language, YOLOv8, Instance Segmentation, Hand Gesture Recognition, Deep Learning

Abstract

This research aims to develop an advanced recognition system for Chinese Sign Language (CSL) by integrating YOLOv8 and instance segmentation techniques. Communication through sign language is essential for the deaf community, and although CSL has been standardized in China, recognizing complex hand movements remains a significant challenge. YOLOv8 is employed for real-time object detection, while instance segmentation is used to provide more detailed analysis of hand gestures. This integration seeks to improve hand gesture recognition under varying lighting and background conditions, which is crucial for more effective communication between the deaf community and the wider society. The study evaluates the system’s performance using common metrics such as Mean Average Precision (mAP), precision, recall, and F1-score. The findings indicate that the non-segmentation model performs better than the segmentation model in terms of precision, recall, and mAP, especially when trained with a larger dataset ratio. The non-segmentation model provides faster and more accurate detection, while the segmentation model, despite using the same amount of data, shows potential for more detailed recognition of gestures. Although the segmentation model shows improvements in the F1-score with more detailed accuracy, the non-segmentation model remains superior in overall detection speed and accuracy. This research highlights the importance of integrating YOLOv8 and instance segmentation for improving CSL recognition, with better results on the non-segmentation model for more effective communication for the deaf

Downloads

Download data is not yet available.

References

K. Emmorey, “Ten Things You Should Know About Sign Languages,” Curr Dir Psychol Sci, vol. 32, no. 5, pp. 387–394, Oct. 2023, doi: 10.1177/09637214231173071.

A. N. Handayani, M. I. Akbar, H. Ar-Rosyid, M. Ilham, R. A. Asmara, and O. Fukuda, “Design of SIBI Sign Language Recognition Using Artificial Neural Network Backpropagation,” in 2022 2nd International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA), IEEE, Dec. 2022, pp. 192–197. doi: 10.1109/ICICyTA57421.2022.10038205.

T. N. Fitria, “THE USE OF SIGN LANGUAGE AS A MEDIA FOR DELIVERING INFORMATION ON NATIONAL TELEVISION NEWS BROADCASTS,” ELP (Journal of English Language Pedagogy), vol. 9, no. 1, pp. 118–131, Jan. 2024, doi: 10.36665/elp.v9i1.764.

C. Jo, “Overview of Chinese Sign Language,” Chinese Language and Literature, Dec. 2023, doi: 10.57237/j.cll.2023.05.002.

X. Jiang, S. C. Satapathy, L. Yang, S.-H. Wang, and Y.-D. Zhang, “A Survey on Artificial Intelligence in Chinese Sign Language Recognition,” Arab J Sci Eng, vol. 45, no. 12, pp. 9859–9894, Dec. 2020, doi: 10.1007/s13369-020-04758-2.

H. Wang, Y. Chen, Z. Yang, L. Zhu, Y. Zhao, and T. Tian, “Estimation and projection of the burden of hearing loss in China: findings from the Global Burden of Disease Study 2019,” Public Health, vol. 228, pp. 119–127, Mar. 2024, doi: 10.1016/j.puhe.2024.01.004.

X. Chen, L. Su, J. Zhao, K. Qiu, N. Jiang, and G. Zhai, “Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks,” Electronics (Basel), vol. 12, no. 4, p. 786, Feb. 2023, doi: 10.3390/electronics12040786.

D. Wiryany, S. Natasha, R. Kurniawan, J. I. Komunikasi, and M. Bandung, “PERKEMBANGAN TEKNOLOGI INFORMASI DAN KOMUNIKASI TERHADAP PERUBAHAN SISTEM KOMUNIKASI INDONESIA,” 2022.

- Ridwang, A. A. Ilham, I. Nurtanio, and - Syafaruddin, “Dynamic Sign Language Recognition Using Mediapipe Library and Modified LSTM Method,” Int J Adv Sci Eng Inf Technol, vol. 13, no. 6, pp. 2171–2180, Dec. 2023, doi: 10.18517/ijaseit.13.6.19401.

S. C. Mesbahi, M. A. Mahraz, J. Riffi, and H. Tairi, “Hand Gesture Recognition Based on Various Deep Learning YOLO Models,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 4, 2023, doi: 10.14569/IJACSA.2023.0140435.

Mr. J. Rajasekhar, “UNDERSTANDING YOLO: REAL-TIME OBJECT DETECTION EXPLAINED,” INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT, vol. 08, no. 07, pp. 1–9, Jul. 2024, doi: 10.55041/IJSREM36359.

S. Tyagi, P. Upadhyay, H. Fatima, S. Jain, I. Avinash, and K. Sharma, “American Sign Language Detection using YOLOv5 and YOLOv8,” 2023, doi: 10.21203/rs.3.rs-3126918/v1.

S. Daniels, N. Suciati, and C. Fathichah, “Indonesian Sign Language Recognition using YOLO Method,” IOP Conf Ser Mater Sci Eng, vol. 1077, no. 1, p. 012029, Feb. 2021, doi: 10.1088/1757-899X/1077/1/012029.

Z. Q. Zhao, P. Zheng, S. T. Xu, and X. Wu, “Object Detection with Deep Learning: A Review,” Nov. 01, 2019, Institute of Electrical and Electronics Engineers Inc. doi: 10.1109/TNNLS.2018.2876865.

J. C. Caicedo et al., “Data-analysis strategies for image-based cell profiling,” Nat Methods, vol. 14, no. 9, pp. 849–863, Sep. 2017, doi: 10.1038/nmeth.4397.

K. Maharana, S. Mondal, and B. Nemade, “A review: Data pre-processing and data augmentation techniques,” Global Transitions Proceedings, vol. 3, no. 1, pp. 91–99, Jun. 2022, doi: 10.1016/j.gltp.2022.04.020.

M. Guo et al., “Normal Workflow and Key Strategies for Data Cleaning Toward Real-World Data: Viewpoint,” Interact J Med Res, vol. 12, p. e44310, Sep. 2023, doi: 10.2196/44310.

A. N. Handayani, S. Amaliya, M. I. Akbar, M. Z. Wiryawan, Y. W. Liang, and W. C. Kurniawan, “Hand Keypoint-Based CNN for SIBI Sign Language Recognition,” International Journal of Robotics and Control Systems, vol. 5, no. 2, pp. 813–829, 2025, doi: 10.31763/ijrcs.v5i2.1745.

A. N. Handayani, M. I. Akbar, H. A. Rosyid, M. Arifianto, O. Fukuda, and R. A. Asmara, “K-NN for Hand Keypoints Detection Based on SIBI Standard,” in 2024 International Conference on Electrical and Information Technology (IEIT), IEEE, Sep. 2024, pp. 72–77. doi: 10.1109/IEIT64341.2024.10763365.

A. Onan, “SRL-ACO: A text augmentation framework based on semantic role labeling and ant colony optimization,” Journal of King Saud University - Computer and Information Sciences, vol. 35, no. 7, p. 101611, Jul. 2023, doi: 10.1016/j.jksuci.2023.101611.

R. Wulanningrum, A. N. Handayani, and A. P. Wibawa, “Perbandingan Instance Segmentation Image Pada Yolo8,” Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 11, no. 4, pp. 753–760, Aug. 2024, doi: 10.25126/jtiik.1148288.

G. Li, L. Zhou, Q. Tong, Y. Ding, X. Qi, and H. Liu, “A Data Augmentation Approach to Sentiment Analysis of MOOC Reviews,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 8, 2024, doi: 10.14569/IJACSA.2024.01508122.

A. N. Handayani, T. Andriyanto, D. F. Azizah, M. Z. Wiryawan, and H. A. Rosyid, “Comparison of ResNet-50 and EfficientNet-B0 Method for Classification of Indonesian Sign Language System (SIBI) Using Multi Background Dataset,” in 2024 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM), IEEE, Nov. 2024, pp. 1–6. doi: 10.1109/CENIM64038.2024.10882836.

V. R. Joseph, “Optimal ratio for data splitting,” Statistical Analysis and Data Mining: The ASA Data Science Journal, vol. 15, no. 4, pp. 531–538, Aug. 2022, doi: 10.1002/sam.11583.

Q. H. Nguyen et al., “Influence of Data Splitting on Performance of Machine Learning Models in Prediction of Shear Strength of Soil,” Math Probl Eng, vol. 2021, pp. 1–15, Feb. 2021, doi: 10.1155/2021/4832864.

A. Nurhopipah and U. Hasanah, “Dataset Splitting Techniques Comparison For Face Classification on CCTV Images,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 14, no. 4, p. 341, Oct. 2020, doi: 10.22146/ijccs.58092.

C.-J. Zhang et al., “Evaluation of the YOLO models for discrimination of the alfalfa pollinating bee species,” J Asia Pac Entomol, vol. 27, no. 1, p. 102195, Mar. 2024, doi: 10.1016/j.aspen.2023.102195.

J. Chen, C. Ji, J. Zhang, Q. Feng, Y. Li, and B. Ma, “A method for multi-target segmentation of bud-stage apple trees based on improved YOLOv8,” Comput Electron Agric, vol. 220, p. 108876, May 2024, doi: 10.1016/j.compag.2024.108876.

E. Lee, B. Park, M.-H. Jeon, H. Jang, A. Kim, and S. Lee, “Data augmentation using image translation for underwater sonar image segmentation,” PLoS One, vol. 17, no. 8, p. e0272602, Aug. 2022, doi: 10.1371/journal.pone.0272602.