TY - GEN
T1 - Pipelining acoustic model training for speech recognition using storm
AU - Sitaram, Dinkar
AU - Srinivasaraghavan, Haripriya
AU - Agarwal, Kapish
AU - Agrawal, Amritanshu
AU - Joshi, Neha
AU - Ray, Debraj
PY - 2013
Y1 - 2013
N2 - Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.
AB - Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.
KW - CMU Sphinx
KW - Parallel Computing
KW - Pipelining
KW - Speech
KW - SphinxTrain
KW - Storm
UR - http://www.scopus.com/inward/record.url?scp=84892765278&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84892765278&partnerID=8YFLogxK
U2 - 10.1109/CIMSim.2013.42
DO - 10.1109/CIMSim.2013.42
M3 - Conference contribution
AN - SCOPUS:84892765278
SN - 9780769551555
T3 - Proceedings of International Conference on Computational Intelligence, Modelling and Simulation
SP - 219
EP - 224
BT - Proceedings - 2013 5th International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2013
PB - IEEE Computer Society
T2 - 2013 5th International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2013
Y2 - 24 September 2013 through 26 September 2013
ER -