Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop

G. Zweig, P. Nguyen, D. Van Compernolle, K. Demuynck, L. Atlas, P. Clark, G. Sell, M. Wang, F. Sha, H. Hermansky, D. Karakos, A. Jansen, S. Thomas, G. S.V.S. S, S. Bowman, J. Kao

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state-of-the-art baseline as a springboard from which to add a suite of novel features including ones derived from acoustic templates, deep neural net phoneme detections, duration models, modulation features, and whole word point-process models. The SCRF framework is able to appropriately weight these different information sources to produce significant gains on both the Broadcast News and Wall Street Journal tasks.

    Original languageEnglish (US)
    Title of host publication2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
    Pages5044-5047
    Number of pages4
    DOIs
    StatePublished - 2011
    Event36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic
    Duration: May 22 2011May 27 2011

    Publication series

    NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    ISSN (Print)1520-6149

    Other

    Other36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
    CountryCzech Republic
    CityPrague
    Period5/22/115/27/11

    Keywords

    • CRF
    • Segmental Conditional Random Field
    • Speech Recognition

    ASJC Scopus subject areas

    • Software
    • Signal Processing
    • Electrical and Electronic Engineering

    Fingerprint Dive into the research topics of 'Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop'. Together they form a unique fingerprint.

  • Cite this

    Zweig, G., Nguyen, P., Van Compernolle, D., Demuynck, K., Atlas, L., Clark, P., Sell, G., Wang, M., Sha, F., Hermansky, H., Karakos, D., Jansen, A., Thomas, S., S, G. S. V. S., Bowman, S., & Kao, J. (2011). Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop. In 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings (pp. 5044-5047). [5947490] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2011.5947490