A monophone speech generation system

dc.contributor.authorKlompje G.
dc.contributor.authorNiesler T.R.
dc.date.accessioned2011-05-15T16:00:41Z
dc.date.available2011-05-15T16:00:41Z
dc.date.issued2007
dc.description.abstractCurrent speech synthesis systems generally require large and carefully annotated speech corpora for their development. However, for many languages these resources are not available. This paper describes a speech generation algorithm based on monophone subword units for minimal reliance on such databases. The system is based on the source-filter speech production framework, and includes a linear prediction based vocal tract model as well as an excitation model. An interpolation algorithm is presented to allow coarticulation between monophone units to be modelled. The excitation model includes a method for dealing with voiced and partiallyvoiced sounds based on a Gaussianity measure applied to the excitation spectrum. Promising first results were obtained when evaluating the intelligibility of the developed system's South African English speech output using the modified rhyme test and semantically unpredictable sentences.
dc.description.versionArticle
dc.identifier.citationTransactions of the South African Institute of Electrical Engineers
dc.identifier.citation98
dc.identifier.citation4
dc.identifier.issn382221
dc.identifier.urihttp://hdl.handle.net/10019.1/11822
dc.subjectCo-articulation
dc.subjectExcitation models
dc.subjectExcitation spectrum
dc.subjectGaussianity
dc.subjectInterpolation algorithms
dc.subjectLinear prediction
dc.subjectModified rhyme test
dc.subjectMultilingual speech synthesis
dc.subjectSpeech corpora
dc.subjectSpeech generation
dc.subjectSpeech output
dc.subjectSpeech production
dc.subjectSpeech synthesis system
dc.subjectSubword units
dc.subjectText to speech
dc.subjectVocal tract models
dc.subjectSpeech synthesis
dc.subjectSpeech intelligibility
dc.titleA monophone speech generation system
dc.typeArticle
Files