Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Text-to-speech [synthesis]: Technical Presentation

Text-to-speech [synthesis]: Technical Presentation

LIS 9732 @ UWO // Natural Language Processing

Avatar for AhemNason

AhemNason

March 06, 2013
Tweet

More Decks by AhemNason

Other Decks in Education

Transcript

  1. text analysis and text normalization text normalization (“verbalizing”) numbers (1772

    as date/#/quantifier) abbreviations (Mrs. – Misses) acronyms (H.I.V. –“aitch eye ve”) word segmentation (NATO – “nayto”) increasingly, may employ POS tagging as well as rules and dictionaries Wednesday, 6 March, 13
  2. phonetic analysis grapheme to phoneme conversion something like the phonetic

    alphabet controversy = /k o1 n t r ax0 v er2 s iy/ dynamic time warping (dtw)! disambiguation (record/rɛkɝd/rɪkɔrd) often uses dictionary- and rule-based approaches, and probably a lexicon, to determine proper word choice Wednesday, 6 March, 13
  3. prosody prediction pattern, rhythm, and intonation “I am speaking” /

    “I am speaking” accurate prosody modelling essential for natural- sounding systems (instead of flat robot sounds) “sentence-final” prosody NLP processes that can identify emotion or sentiment will help increase the accuracy of prosody generation in TTS applications. Wednesday, 6 March, 13
  4. acoustic modelling once we have a phonetic structure for the

    synthesizer to read, there are two main kinds of speech synthesis: synthesis by rule formant-based (speak & spell) articulation-based (speech “organs”) concatenative synthesis (w/real voices) Wednesday, 6 March, 13
  5. thanks again! Mitkov, R. (2005). The Oxford Handbook of Computational

    Linguistics. Oxford University Press. Taylor, P. A. (2009). Text-to-speech synthesis. Cambridge, U.K. ; New York: Cambridge University Press. D. Sasirekha, & Chandra, E. (2012). Text to Speech: A Simple Tutorial. International Journal of Soft Computing and Engineering, 2(1), 275–278. Wednesday, 6 March, 13