ALTIS: A new algorithm for adaptive long-term SNR estimation in multi-talker babble

Roozbeh Soleymani, Ivan W. Selesnick, David M. Landsberger

Research output: Contribution to journalArticlepeer-review

Abstract

We introduce a real-time capable algorithm which estimates the long-term signal to noise ratio (SNR) of the speech in multi-talker babble noise. In real-time applications, long-term SNR is calculated over a sufficiently long moving frame of the noisy speech ending at the current time. The algorithm performs the real-time long-term SNR estimation by averaging “speech-likeness” values of multiple consecutive short-frames of the noisy speech which collectively form a long-frame with an adaptive length. The algorithm is calibrated to be insensitive to short-term fluctuations and transient changes in speech or noise level. However, it quickly responds to non-transient changes in long-term SNR by adjusting the duration of the long-frame on which the long-term SNR is measured. This ability is obtained by employing an event detector and adaptive frame duration. The event detector identifies non-transient changes of the long-term SNR and optimizes the duration of the long-frame accordingly. The algorithm was trained and tested for randomly generated speech samples corrupted with multi-talker babble. In addition to its ability to provide an adaptive long-term SNR estimation in a dynamic noisy situation, the evaluation results show that the algorithm outperforms the existing overall SNR estimation methods in multi-talker babble over a wide range of number of talkers and SNRs. The relatively low computational cost and the ability to update the estimated long-term SNR several times per second make this algorithm capable of operating in real-time speech processing applications.

Original languageEnglish (US)
Pages (from-to)231-246
Number of pages16
JournalComputer Speech and Language
Volume58
DOIs
StatePublished - Nov 2019

Keywords

  • Adaptive SNR
  • Long-term SNR
  • Multi-talker babble
  • Real-time SNR
  • Signal-to-noise ratio

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'ALTIS: A new algorithm for adaptive long-term SNR estimation in multi-talker babble'. Together they form a unique fingerprint.

Cite this