Kurdish Speaker Identification Based On One Dimensional Convolutional Neural Network

سال انتشار: 1398
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 457

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICNS04_060

تاریخ نمایه سازی: 8 تیر 1398

چکیده مقاله:

Speech recognition is a vital in human- computer interaction area. In this paper, two different models are implemented for speaker identification which are 1 D convolutional neural network (CNN) and feature based model. In the experimental, the three spectral based features including Mel frequency Cepstral Coefficient (MFCC), Linear Prediction Code (LPC) and Local Binary pattern (LBP) are fed to Support Vector Machine (SVM) and K Nearest Neighbor (kNN). results are shown the MFCC is the best feature among the others. MFCC is then extracted for framed signal and used to both proposed models. The result shows that there is not significant different between the accuracy of the CNN and kNN. While SVM is less sensitive compared with others.

نویسندگان

Zrar KH.Abdul

Department of applied computer, Charmo University, Sulaymaniyah, Iraq and Department of computer, Halabja University, Sulaymaniyah, Iraq