Predicting the perceived spaciousness of stereophonic music recordings

Andy M. Sarroff, Juan P. Bello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In a stereophonic music production, music producers seek to impart impressions of one or more virtual spaces upon the recording with two channels of audio. Our goal is to map spaciousness in stereophonic music to objective signal attributes. This is accomplished by building predictive functions by exemplar-based learning. First, spaciousness of recorded stereophonic music is parameterized by three discrete dimensions of perception-the width of the source ensemble, the extent of reverberation, and the extent of immersion. A data set of 50 song excerpts is collected and annotated by humans for each dimension of spaciousness. A verbose feature set is generated on the music recordings and correlation-based feature selection is used to reduce the feature spaces. Exemplar-based support vector regression maps the feature sets to perceived spaciousness. We show that the predictive algorithms perform well on all dimensions and that perceived spaciousness can be successfully mapped to objective attributes of the audio signal.

Original languageEnglish (US)
Title of host publicationProceedings of the 6th Sound and Music Computing Conference, SMC 2009
PublisherSound and music Computing network
Pages83-88
Number of pages6
ISBN (Print)9789899557765
StatePublished - 2009
Event6th Sound and Music Computing Conference, SMC 2009 - Porto, Portugal
Duration: Jul 23 2009Jul 25 2009

Publication series

NameProceedings of the 6th Sound and Music Computing Conference, SMC 2009

Other

Other6th Sound and Music Computing Conference, SMC 2009
Country/TerritoryPortugal
CityPorto
Period7/23/097/25/09

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Predicting the perceived spaciousness of stereophonic music recordings'. Together they form a unique fingerprint.

Cite this