Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

سال انتشار: 1394
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 357

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JCR-7-1_002

تاریخ نمایه سازی: 23 دی 1396

چکیده مقاله:

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unitselection speech synthesis and statistical parametric speech synthesis are two dominant speech synthesizer techniques. The naturalness is the main challenge of all speech synthesis approaches. The Intonation, speech style and emotional state are included in naturalness factor and all of them are considered as suprasegmental features. Equipped synthesized speech with paralinguistic information is more believable from the perceptual aspect. Prosody information plays an important role on the synthesized speech quality of text to speech systems. The first purpose of modern speech synthesizer systems is text to speech conversion and the second purpose is transferring the emotional states of text in the voice form. In this paper two main speech synthesis approaches and their challenges are investigated in detail.

کلیدواژه ها:

نویسندگان

Mohammad Savargiv

Faculty of Computer and Information Technology Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran

Azam Bastanfard

Faculty of Media Engineering, Islamic Republic of Iran Broadcast University, Tehran, Iran