Pipelining acoustic model training for speech recognition using storm

Dinkar Sitaram, Haripriya Srinivasaraghavan, Kapish Agarwal, Amritanshu Agrawal, Neha Joshi, Debraj Ray

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.

    Original languageEnglish (US)
    Title of host publicationProceedings - 2013 5th International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2013
    PublisherIEEE Computer Society
    Pages219-224
    Number of pages6
    ISBN (Print)9780769551555
    DOIs
    StatePublished - 2013
    Event2013 5th International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2013 - Seoul, Korea, Republic of
    Duration: Sep 24 2013Sep 26 2013

    Publication series

    NameProceedings of International Conference on Computational Intelligence, Modelling and Simulation
    ISSN (Print)2166-8523
    ISSN (Electronic)2166-8531

    Conference

    Conference2013 5th International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2013
    Country/TerritoryKorea, Republic of
    CitySeoul
    Period9/24/139/26/13

    Keywords

    • CMU Sphinx
    • Parallel Computing
    • Pipelining
    • Speech
    • SphinxTrain
    • Storm

    ASJC Scopus subject areas

    • Computational Theory and Mathematics
    • Applied Mathematics
    • Modeling and Simulation

    Fingerprint

    Dive into the research topics of 'Pipelining acoustic model training for speech recognition using storm'. Together they form a unique fingerprint.

    Cite this