Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT

dc.contributor.authorGoussard, George
dc.contributor.authorNiesler, Thomas
dc.date.accessioned2011-05-27T08:51:43Z
dc.date.available2011-05-27T08:51:43Z
dc.date.issued2010-11
dc.descriptionBoth authors from Stellenbosch University.en_ZA
dc.descriptionProceedings of the twenty-first annual symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, November 2010.en_ZA
dc.description.abstractWe address the automatic generation of acoustic subword units and an associated pronunciation dictionary for speech recognition. The speech audio is first segmented into phoneme-like units by detecting points at which the spectral characteristics of the signal change abruptly. These audio segments are subsequently subjected to agglomerative clustering in order to group similar acoustic segments. Finally, the orthography is iteratively aligned with the resulting transcription in terms of audio clusters in order to determine pronunciations of the training words. The approach is evaluated by applying it to two subsets of the TIMIT corpus, both of which have a closed vocabulary. It is found that, when vocabulary words occur often in the training set, the proposed technique delivers performance that is close to but lower than a system based on the TIMIT phonetic transcriptions. When vocabulary words are not repeated often in the training set, the best system is able to outperform its counterpart based on the TIMIT phonetic transcriptions, although recognition performance in both cases is poor.en_ZA
dc.format.extent6 p. : ill.
dc.identifier.citationGoussard, GW & Niesler, TR 2010. Automatic discovery of subword units and pronunciations for automatic speech recognition using TIMIT. Proceedings of the twenty-first annual symposium of the Pattern Recognition Association of South Africa (PRASA), Stellenbosch, South Africa, November 2010.en_ZA
dc.identifier.isbn9780799224702
dc.identifier.urihttp://hdl.handle.net/10019.1/14773
dc.language.isoen_ZAen_ZA
dc.publisherPRASAen_ZA
dc.rights.holderThe Authoren_ZA
dc.subjectAutomatic subword unit discoveryen_ZA
dc.subjectAutomatic speech recognitionen_ZA
dc.subjectTIMITen_ZA
dc.titleAutomatic discovery of subword units and pronunciations for automatic speech recognition using TIMITen_ZA
dc.typeArticleen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
goussard_automatic_2010.pdf
Size:
94.1 KB
Format:
Adobe Portable Document Format
Description:
Research paper
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: