胡月, 郭英, 张东伟, 侯文林, 李红光. 基于CV音节的高效语种识别方法[J]. 微电子学与计算机, 2015, 32(10): 26-30. DOI: 10.19304/j.cnki.issn1000-7180.2015.10.006
引用本文: 胡月, 郭英, 张东伟, 侯文林, 李红光. 基于CV音节的高效语种识别方法[J]. 微电子学与计算机, 2015, 32(10): 26-30. DOI: 10.19304/j.cnki.issn1000-7180.2015.10.006
HU Yue, GUO Ying, ZHANG Dong-wei, HOU Wen-lin, LI Hong-guang. An Efficient Language Identification Method Based on CV-syllables[J]. Microelectronics & Computer, 2015, 32(10): 26-30. DOI: 10.19304/j.cnki.issn1000-7180.2015.10.006
Citation: HU Yue, GUO Ying, ZHANG Dong-wei, HOU Wen-lin, LI Hong-guang. An Efficient Language Identification Method Based on CV-syllables[J]. Microelectronics & Computer, 2015, 32(10): 26-30. DOI: 10.19304/j.cnki.issn1000-7180.2015.10.006

基于CV音节的高效语种识别方法

An Efficient Language Identification Method Based on CV-syllables

  • 摘要: 为了快速有效地识别语种,提出了基于元音起始点(Vowel Onset Points,VOP)检测的CV音节划分法,并据此研究了一种新的基于CV音节的语种识别方法.首先,给出一种能有效避免语音结束点错判的双边双门限端点检测法提取语音段;然后采用线性预测残差(Linear Prediction Residue Error,LPRE)检测语音段中的VOP,从而划分出CV音节;最后,提取各CV音节的特征矢量并利用支持向量机(Support Vector Machine,SVM)模型实现语种识别.通过对英语、汉语普通话及粤语三种语言的识别实验表明,所提VOP检测法可确保CV音节的精确划分;新方法识别率高,且识别结果对CV音节长度不敏感,模型训练时间短,可实现语种的高效识别.

     

    Abstract: In order to identify language rapidly and effectively, a method for CV-syllable extraction based on the Vowel Onset Points (VOP) detection is proposed, on this basis, a new method for language identification based on CV-syllables is researched. First, a double-threshold endpoints detection from both sides is given to get speech segment, which could avoid great risk of wrong decision. Second, VOP are detected by Linear Prediction Residue Error (LPRE) and the CV-syllables from the speech segment are obtained exactly. At last, feature vectors for each CV-syllable are extracted. The Support Vector Machine (SVM) is adopted to realize language identification. The simulation experiment for English, Mandarin and Cantonese shows that VOP detection makes sure the precision of each CV-syllable extraction. The new method has the high correct response rate. The change of CV-syllable length almost has no effect on identification results. The training time for model is so short that language identification could complete efficiently.

     

/

返回文章
返回