Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT

Goussard, George; Niesler, Thomas

Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT

dc.contributor.author	Goussard, George
dc.contributor.author	Niesler, Thomas
dc.date.accessioned	2011-05-27T08:51:43Z
dc.date.available	2011-05-27T08:51:43Z
dc.date.issued	2010-11
dc.description	Both authors from Stellenbosch University.	en_ZA
dc.description	Proceedings of the twenty-first annual symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, November 2010.	en_ZA
dc.description.abstract	We address the automatic generation of acoustic subword units and an associated pronunciation dictionary for speech recognition. The speech audio is first segmented into phoneme-like units by detecting points at which the spectral characteristics of the signal change abruptly. These audio segments are subsequently subjected to agglomerative clustering in order to group similar acoustic segments. Finally, the orthography is iteratively aligned with the resulting transcription in terms of audio clusters in order to determine pronunciations of the training words. The approach is evaluated by applying it to two subsets of the TIMIT corpus, both of which have a closed vocabulary. It is found that, when vocabulary words occur often in the training set, the proposed technique delivers performance that is close to but lower than a system based on the TIMIT phonetic transcriptions. When vocabulary words are not repeated often in the training set, the best system is able to outperform its counterpart based on the TIMIT phonetic transcriptions, although recognition performance in both cases is poor.	en_ZA
dc.format.extent	6 p. : ill.
dc.identifier.citation	Goussard, GW & Niesler, TR 2010. Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT. Proceedings of the twenty-first annual symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, November 2010.	en_ZA
dc.identifier.isbn	9780799224702
dc.identifier.uri	http://hdl.handle.net/10019.1/14773
dc.language.iso	en_ZA	en_ZA
dc.publisher	PRASA	en_ZA
dc.rights.holder	The Author	en_ZA
dc.subject	Automatic subword unit discovery	en_ZA
dc.subject	Automatic speech recognition	en_ZA
dc.subject	TIMIT	en_ZA
dc.title	Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT	en_ZA
dc.type	Article	en_ZA

Files

Original bundle

Now showing 1 - 1 of 1

Name:: goussard_automatic_2010.pdf
Size:: 94.1 KB
Format:: Adobe Portable Document Format
Description:: Research paper

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Conference Proceedings (High Performance Computing)