TY - GEN
T1 - Predicting the perceived spaciousness of stereophonic music recordings
AU - Sarroff, Andy M.
AU - Bello, Juan P.
PY - 2009
Y1 - 2009
N2 - In a stereophonic music production, music producers seek to impart impressions of one or more virtual spaces upon the recording with two channels of audio. Our goal is to map spaciousness in stereophonic music to objective signal attributes. This is accomplished by building predictive functions by exemplar-based learning. First, spaciousness of recorded stereophonic music is parameterized by three discrete dimensions of perception-the width of the source ensemble, the extent of reverberation, and the extent of immersion. A data set of 50 song excerpts is collected and annotated by humans for each dimension of spaciousness. A verbose feature set is generated on the music recordings and correlation-based feature selection is used to reduce the feature spaces. Exemplar-based support vector regression maps the feature sets to perceived spaciousness. We show that the predictive algorithms perform well on all dimensions and that perceived spaciousness can be successfully mapped to objective attributes of the audio signal.
AB - In a stereophonic music production, music producers seek to impart impressions of one or more virtual spaces upon the recording with two channels of audio. Our goal is to map spaciousness in stereophonic music to objective signal attributes. This is accomplished by building predictive functions by exemplar-based learning. First, spaciousness of recorded stereophonic music is parameterized by three discrete dimensions of perception-the width of the source ensemble, the extent of reverberation, and the extent of immersion. A data set of 50 song excerpts is collected and annotated by humans for each dimension of spaciousness. A verbose feature set is generated on the music recordings and correlation-based feature selection is used to reduce the feature spaces. Exemplar-based support vector regression maps the feature sets to perceived spaciousness. We show that the predictive algorithms perform well on all dimensions and that perceived spaciousness can be successfully mapped to objective attributes of the audio signal.
UR - http://www.scopus.com/inward/record.url?scp=84905190575&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905190575&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84905190575
SN - 9789899557765
T3 - Proceedings of the 6th Sound and Music Computing Conference, SMC 2009
SP - 83
EP - 88
BT - Proceedings of the 6th Sound and Music Computing Conference, SMC 2009
PB - Sound and music Computing network
T2 - 6th Sound and Music Computing Conference, SMC 2009
Y2 - 23 July 2009 through 25 July 2009
ER -