An Efficient Convolutional Neural Network Classification Model for Several Sign Language Alphabets

Faculty Engineering Year: 2023
Type of Publication: ZU Hosted Pages:
Authors:
Journal: International Journal of Advanced Computer Science and Applications (The Science and Information Organization (SAI Volume:
Keywords : , Efficient Convolutional Neural Network Classification Model    
Abstract:
Although deaf people represent over 5% of the world’s population, according to what the World Health Organization stated in May 2022, they suffer from social and economic marginalization. One way to improve the lives of deaf people is to try to make communication between them and others easier. Sign language, the means through which deaf people can communicate with other people, can benefit from modern techniques in machine learning. In this study, several convolutional neural networks (CNN) models are designed to develop an efficient model, in terms of accuracy and computational time, for the classification of different signs. This research presents a methodology for developing an efficient CNN architecture from scratch to classify multiple sign language alphabets, which has numerous advantages over other contemporary CNN models in terms of prediction time and accuracy. This framework analyses the effect of varying CNN hyper-parameters, such as kernel size, number of layers, and number of filters in each layer, and picks the ideal parameters for CNN model construction. In addition, the suggested CNN architecture operates directly on unprocessed data without the need for preprocessing to generalize it across other datasets. In addition, the capacity of the model to generalize to diverse sign languages is rigorously evaluated using three distinct sign language alphabets and five datasets, namely, Arabic (ArSL), two American English (ASL), Korean (KSL), and the combination of Arabic and American datasets. The proposed CNN architecture (SL-CNN) outperforms state-of-the-art CNN models and traditional machine learning models achieving an accuracy of 100%, 98.47%, 100%, and 99.5% for English, Arabic, Korean, and combined Arabic-English alphabets, respectively. The prediction or inference time of the model is about three milliseconds on average, making it suitable for real-time applications. So, in the future, it is easy to turn this model into a mobile application.
   
     
 
       

Author Related Publications

  • Ibrahiem Elsayed Mohamed Zedan, "Improved subspace identication with prior information using constrained least-squares", IET, 2011 More
  • Ibrahiem Elsayed Mohamed Zedan, "Enhancing Face Recognition using Average per region", Published by Foundation of Computer Science, New York, USA, 2013 More
  • Ibrahiem Elsayed Mohamed Zedan, "Face Recognition in Gray Images by Using XOR", Military Technical College, Kobry Elkobbah, Cairo, Egypt, 2013 More
  • Ibrahiem Elsayed Mohamed Zedan, "Simple tuning rules of PID controllers for integrator/dead time processes", ICGST, 2005 More
  • Ibrahiem Elsayed Mohamed Zedan, "Improved Low Energy Adaptive Clustering Hierarchy in Wireless Sensor Network Routing Protocols", International Journal of Engineering and Technology, 2018 More

Department Related Publications

  • Mira Magdy Sobhy Suliman, "COMPARISON BETWEEN HAAR WAVELET TRANSFORM, DCT AND A PROPOSED COLUMN-MEAN-METHOD BASED IRIS ENCODERS", جامعة الزقازيق-المجلة العلمية, 2014 More
  • Mohammed Atef Meselhy AbdulHamid, "Hybrid Named Entity Recognition - Application to Arabic Language", IEEE, 2015 More
  • Mohammed Nour Abdelgawad Ahmed, "Using Industrial Actuators for Rapid Development of Electric Car Applications", WFB Wirtschaftsförderung Bremen, 2014 More
  • Mohammed Nour Abdelgawad Ahmed, "A simulation-based design of extraterrestrial six-legged robot system", IEEE, 2009 More
  • Sanaa Fekry Abdelsadek Hassanien Marzok, "Supervised Classification of Cancers Based on Copy Number Variation", Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2018, 2018 More
Tweet