Recognition of Persian Digits from Zero to Nine using Acoustic Images based on Mel Capstrom Coefficients and Neural Network

سال انتشار: 1400
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 117

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJMEC-11-42_003

تاریخ نمایه سازی: 28 تیر 1402

چکیده مقاله:

this article, first, the database of zero to nine Persian digits have been recorded and collected using the voices of ۵۰ men and women in the environment . In the proposed method, we first frame the preprocessed signal and then go through the improved window; in the next step, it enters the Fourier transform block .Now the Fourier transform spectrum is given to the Gaussian filter bank, and then the output power spectrum of the Gaussian bank filter is passed through Root Function, and then by applying cosine transform to compress the components, Mel-Capstrom coefficients are obtained .Finally, the acoustic image is formed as a matrix containing the temporal and frequency features of the speech signal using a two-dimensional inverse Fourier transform of the Mel Capstrom coefficient matrix .To classify and test the data, the features obtained are trained using an improved algorithm in the perceptron neural network with two hidden layers, and the recognition rate is reported at the end .The test results for the signal to different noises show the improvement of the noise signal detection rate by the proposed method, so that the recognition rate of the proposed algorithm without noise is ۹۸.۸۵.

نویسندگان

Seyed Mehdi Hoseini

Department of Computer Science, University of Mazandaran, Babolsar, Iran