TY - GEN
T1 - Sound texture synthesis via filter statistics
AU - McDermott, Josh H.
AU - Oxenham, Andrew J.
AU - Simoncelli, Eero P.
N1 - Copyright:
Copyright 2010 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - Many natural sounds, such as those produced by rainstorms, fires, or insects at night, consist of large numbers of rapidly occurring acoustic events. We hypothesize that humans encode these "sound textures" with statistical measurements that capture their constituent features and the relationship between them. We explored this hypothesis using a synthesis algorithm that measures statistics in a real sound and imposes them on a sample of noise. Simply matching the marginal statistics (variance, kurtosis) of individual frequency subbands was generally necessary, but insufficient, to yield good results. Imposing various pairwise envelope statistics (correlations between bands, and autocorrelations within each band) greatly improved the results, frequently producing synthetic textures that sounded natural and that listeners could reliably recognize. The results suggest that such statistical representations could underlie sound texture perception, and that the auditory system may use fairly simple statistics to recognize many natural sound textures.
AB - Many natural sounds, such as those produced by rainstorms, fires, or insects at night, consist of large numbers of rapidly occurring acoustic events. We hypothesize that humans encode these "sound textures" with statistical measurements that capture their constituent features and the relationship between them. We explored this hypothesis using a synthesis algorithm that measures statistics in a real sound and imposes them on a sample of noise. Simply matching the marginal statistics (variance, kurtosis) of individual frequency subbands was generally necessary, but insufficient, to yield good results. Imposing various pairwise envelope statistics (correlations between bands, and autocorrelations within each band) greatly improved the results, frequently producing synthetic textures that sounded natural and that listeners could reliably recognize. The results suggest that such statistical representations could underlie sound texture perception, and that the auditory system may use fairly simple statistics to recognize many natural sound textures.
KW - Correlations
KW - Envelope
KW - Statistics
KW - Synthesis
KW - Texture
UR - http://www.scopus.com/inward/record.url?scp=77950119016&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77950119016&partnerID=8YFLogxK
U2 - 10.1109/ASPAA.2009.5346467
DO - 10.1109/ASPAA.2009.5346467
M3 - Conference contribution
AN - SCOPUS:77950119016
SN - 9781424436798
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 297
EP - 300
BT - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
T2 - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Y2 - 18 October 2009 through 21 October 2009
ER -