Recognition of Speech Isolated Words Based On Pyramid Phonetic Bag of Words Model Display and Kernel-Based Support Vector Machine Classifier Model
سال انتشار: 1396
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 403
فایل این مقاله در 13 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
COMCONF05_190
تاریخ نمایه سازی: 21 اردیبهشت 1397
چکیده مقاله:
This study aimed to improve the classification of individual (isolated) words, and specifically, the numbers from one to twenty. In this study, a strong model was suggested to gain a unified view of voice. It is based on the idea of phonetic bag for voice that has been developed into a pyramid state. The pyramid idea can model temporal relationships. One of the problems of Support Vector Machine to classify words is its inability to model temporal relationships unlike hidden Markov models. Using the BOW-based pyramid idea in the extraction of the display containing temporal information of voice, the SVM can be given the capability of considering the time relationships of speech frames. One of the main advantages of Support Vector Machine model is its fewer parameters than the hidden Markov model. As the experiments results have shown, it has much higher accuracy than the hidden Markov model in applications such as the recognition of single words, where the data set volume is limited. Using the pyramid BOW idea, the accuracy of SVM-based method can be increased as 20% compared to previous methods.
کلیدواژه ها:
Speech recognition ، Isolated words recognition ، Classification of speechIntroduction ، Display of phonetic bag of words ، Support vector machine
نویسندگان
Sodabeh Salehi Rekavandi
Department of Computer Engineering, Islamic Azad University, Ferdows, Iran
Hamid Reza Ghaffari
Department of Computer Engineering, Islamic Azad University, Ferdows, Iran
Maryam Davodpour
Department of Computer Engineering, Islamic Azad University, Ferdows, Iran