TY - GEN
T1 - Modeling pronunciation variation with context-dependent articulatory feature decision trees
AU - Bowman, Sam
AU - Livescu, Karen
PY - 2010
Y1 - 2010
N2 - We consider the problem of predicting the surface pronunciations of a word in conversational speech, using a model of pronunciation variation based on articulatory features. We build context-dependent decision trees for both phone-based and feature-based models, and compare their perplexities on conversational data from the Switchboard Transcription Project. We find that a fully-factored model, with separate decision trees for each articulatory feature, does not perform well, but a feature-based model using a smaller number of "feature bundles" outperforms both the fully-factored model and a phone-based model. The articulatory feature-based decision trees are also much more robust to reductions in training data. We also analyze the usefulness of various context variables.
AB - We consider the problem of predicting the surface pronunciations of a word in conversational speech, using a model of pronunciation variation based on articulatory features. We build context-dependent decision trees for both phone-based and feature-based models, and compare their perplexities on conversational data from the Switchboard Transcription Project. We find that a fully-factored model, with separate decision trees for each articulatory feature, does not perform well, but a feature-based model using a smaller number of "feature bundles" outperforms both the fully-factored model and a phone-based model. The articulatory feature-based decision trees are also much more robust to reductions in training data. We also analyze the usefulness of various context variables.
KW - Articulatory features
KW - Pronunciation modeling
UR - http://www.scopus.com/inward/record.url?scp=79959828525&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79959828525&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:79959828525
T3 - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
SP - 326
EP - 329
BT - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
PB - International Speech Communication Association
ER -