Persian Phoneme Recognition using Long Short-Term Memory Neural Network

سال انتشار: 1395
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 707

فایل این مقاله در 5 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICIKT08_046

تاریخ نمایه سازی: 5 بهمن 1395

چکیده مقاله:

Recently Recurrent Neural Networks (RNNs) have shown impressive performance in sequence classification tasks. In this paper we apply Long Short-Term Memory (LSTM) network on Persian phoneme recognition. For years Hidden Markov Model (HMM) was the dominant technique in speech recognition system but after introducing LSTM, RNNs outperformed HHM-based methods. We apply LSTM and deep LSTM on FARSDAT speech database and find that both LSTM and deep LSTM outperforms HMM in Persian phoneme recognition. Our evaluation show that deep LSTM achieves 17.55% error in FARSDAT phoneme recognition on test set which to our knowledge is the best recorded result.

کلیدواژه ها:

Long Short-Term Memory (LSTM) ، Farsdat ، Persian Speech Recognition ، Connectionist Temporal Classification (CTC)

نویسندگان

Mohammad Daneshvar

Faculty of New Sciences and Technologies University of Tehran Tehran, Iran

Hadi Veisi

Faculty of New Sciences and Technologies University of Tehran Tehran, Iran