Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages

Hermann, Enno; Kamper, Herman; Goldwater, Sharon

Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages

dc.contributor.author	Hermann, Enno	en_ZA
dc.contributor.author	Kamper, Herman	en_Za
dc.contributor.author	Goldwater, Sharon	en_ZA
dc.date.accessioned	2023-05-04T09:11:40Z	en_ZA
dc.date.available	2023-05-04T09:11:40Z	en_ZA
dc.date.issued	2021-04	en_ZA
dc.description	CITATION: Hermann, E. Kamper, H, and Goldwater, S. 2021. Multilingual and Unsupervised Subword Modeling. Computer Speech & Language 65(2021):17 pages. doi.10.1016/j.csl.2020.101098	en_ZA
dc.description	The original publication is available at: sciencedirect.com	en_ZA
dc.description.abstract	Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should capture phonetic content and abstract away from other types of variability, such as speaker differences and channel noise. Previous work in this area has primarily focused unsupervised learning from target language data only, and has been evaluated only intrinsically. Here we directly compare multiple methods, including some that use only target language speech data and some that use transcribed speech from other (non-target) languages, and we evaluate using two intrinsic measures as well as on a downstream unsupervised word segmentation and clustering task. We find that combining two existing target-language-only methods yields better features than either method alone. Nevertheless, even better results are obtained by extracting target language bottleneck features using a model trained on other languages. Cross-lingual training using just one other language is enough to provide this benefit, but multilingual training helps even more. In addition to these results, which hold across both intrinsic measures and the extrinsic task, we discuss the qualitative differences between the different types of learned features.	en_ZA
dc.description.version	Publisher’s version	en_ZA
dc.format.extent	17 pages	en_ZA
dc.identifier.citation	Hermann, E. Kamper, H, and Goldwater, S. 2021. Multilingual and Unsupervised Subword Modeling. Computer Speech & Language 65(2021):17 pages. doi.10.1016/j.csl.2020.101098	en_ZA
dc.identifier.issn	0885-2308 (online)	en_ZA
dc.identifier.other	doi.10.1016/j.csl.2020.101098	en_ZA
dc.identifier.uri	http://hdl.handle.net/10019.1/126865	en_ZA
dc.language.iso	en_ZA	en_ZA
dc.publisher	Elsevier Ltd	en_ZA
dc.rights.holder	Authors retain copyright	en_ZA
dc.subject	Computational linguistics	en_ZA
dc.subject	Artificial intelligence	en_ZA
dc.subject	Text processing (Computer science)	en_ZA
dc.subject	Zero-resource speech technology	en_ZA
dc.subject	Subword modeling	en_ZA
dc.subject	Unsupervised feature extraction	en_ZA
dc.title	Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages	en_ZA
dc.type	Article	en_ZA

Files

Original bundle

Now showing 1 - 1 of 1

Name:: enno_multilingual_2021.pdf
Size:: 1.18 MB
Format:: Adobe Portable Document Format
Description:: download article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Research Articles (Electrical and Electronic Engineering)