杨淑莹, 赵敏, 郭杨杨, 田迪. 基于改进的EfficientDet的手语识别算法[J]. 微电子学与计算机, 2022, 39(2): 84-91. DOI: 10.19304/J.ISSN1000-7180.2021.0751
引用本文: 杨淑莹, 赵敏, 郭杨杨, 田迪. 基于改进的EfficientDet的手语识别算法[J]. 微电子学与计算机, 2022, 39(2): 84-91. DOI: 10.19304/J.ISSN1000-7180.2021.0751
YANG Shuying, ZHAO Min, GUO Yangyang, TIAN Di. Sign language recognition algorithm based on improved EfficientDet[J]. Microelectronics & Computer, 2022, 39(2): 84-91. DOI: 10.19304/J.ISSN1000-7180.2021.0751
Citation: YANG Shuying, ZHAO Min, GUO Yangyang, TIAN Di. Sign language recognition algorithm based on improved EfficientDet[J]. Microelectronics & Computer, 2022, 39(2): 84-91. DOI: 10.19304/J.ISSN1000-7180.2021.0751

基于改进的EfficientDet的手语识别算法

Sign language recognition algorithm based on improved EfficientDet

  • 摘要: 手语识别在聋哑人与正常人的交流中起至关重要的作用.为了解决传统手语识别算法由于手部特征多尺度造成的手势特征提取不充分、特征融合丢失细节信息等问题,提出了基于改进的EfficientDet-D0的手语检测识别算法.该算法首先在EfficientDet-D0的主干网络中增加了空间注意力机制,能更加准确的定位图像中的手部特征;其次在特征融合网络中,为了描述下采样丢失了的高频细节信息,利用拉普拉斯金字塔的思想,在自上而下的融合路径中将细节特征图进行融合,并增加跨级连接,使不同分辨率的特征信息得到充分利用,从而使获取的高级特征图信息更加丰富;最后使用迁移学习技术和Adam优化器训练整个网络.实验结果表明,该模型能够在各种背景下快速准确的识别出手语动作,最终准确率达到94.1%,比传统算法具有更高的准确率和更强的鲁棒性,同时基于该算法设计了手语双向翻译仿真网站,实际应用性强.

     

    Abstract: Sign language recognition plays a vital role in the communication between deaf-mute and normal people. In order to solve the problem of insufficient gesture feature extraction, caused by the multi-scale characteristics of hand features, and loss of detail information in feature fusion of traditional sign language recognition algorithm, an improved algorithm based on EfficientDet-D0 is proposed. To be specific, this algorithm adds spatial attention mechanism in the backbone of EfficientDet-D0, making it be capable of locating the hand features effectively. Then, in the feature fusion network, the idea of Laplacian Pyramid and the cross-level connection is used, which enable it to fuse detailed feature maps and make full use of features of resolutions. So that, the information of high-level feature map is richer, and the high-frequency detail information lost by downsampling is fully described. Moreover, the transfer learning technology and Adam optimizer are used to train the whole network. Experiments show that this new algorithm can quickly and accurately identify different sign language actions in various backgrounds. The final accuracy rate reaches 94.1%. In a word, it has higher accuracy rate and stronger robustness than traditional algorithms. And, a two-way translation application is designed based on the algorithm, which is practical.

     

/

返回文章
返回