TY - JOUR
T1 - A Redundant Cortical Code for Speech Envelope
AU - Penikis, Kristina B.
AU - Sanes, Dan H.
N1 - Funding Information:
Received Aug. 6, 2021; revised Aug. 19, 2022; accepted Oct. 23, 2022. Author contributions: K.B.P. and D.H.S. designed research; K.B.P. performed research; K.B.P. analyzed data; K.B.P. wrote the first draft of the paper; K.B.P. and D.H.S. edited the paper; K.B.P. and D.H.S. wrote the paper. This work was supported by National Institutes of Health Grants F31DC014903 (to K.B.P.) and R01DC011284 (to D.H.S.). We thank Shihab Shamma and Yulia Oganian for providing code that contributed to the analyses in Figures 7 and 9. K. B. Penikis’ present address: Department of Psychology, West Virginia University, Morgantown, WV 26505. The authors declare no competing financial interests. Correspondence should be addressed to Kristina B. Penikis at kbp254@nyu.edu. https://doi.org/10.1523/JNEUROSCI.1616-21.2022 Copyright © 2023 the authors
Publisher Copyright:
Copyright © 2023 the authors.
PY - 2022/1/4
Y1 - 2022/1/4
N2 - Animal communication sounds exhibit complex temporal structure because of the amplitude fluctuations that comprise the sound envelope. In human speech, envelope modulations drive synchronized activity in auditory cortex (AC), which correlates strongly with comprehension (Giraud and Poeppel, 2012; Peelle and Davis, 2012; Haegens and Zion Golumbic, 2018). Studies of envelope coding in single neurons, performed in nonhuman animals, have focused on periodic amplitude modulation (AM) stimuli and use response metrics that are not easy to juxtapose with data from humans. In this study, we sought to bridge these fields. Specifically, we looked directly at the temporal relationship between stimulus envelope and spiking, and we assessed whether the apparent diversity across neurons’ AM responses contributes to the population representation of speech-like sound envelopes. We gathered responses from single neurons to vocoded speech stimuli and compared them to sinusoidal AM responses in auditory cortex (AC) of alert, freely moving Mongolian gerbils of both sexes. While AC neurons displayed heterogeneous tuning to AM rate, their temporal dynamics were stereotyped. Preferred response phases accumulated near the onsets of sinusoidal AM periods for slower rates (<8 Hz), and an over-representation of amplitude edges was apparent in population responses to both sinusoidal AM and vocoded speech envelopes. Crucially, this encoding bias imparted a decoding benefit: a classifier could discriminate vocoded speech stimuli using summed population activity, while higher frequency modulations required a more sophisticated decoder that tracked spiking responses from individual cells. Together, our results imply that the envelope structure relevant to parsing an acoustic stream could be read-out from a distributed, redundant population code.
AB - Animal communication sounds exhibit complex temporal structure because of the amplitude fluctuations that comprise the sound envelope. In human speech, envelope modulations drive synchronized activity in auditory cortex (AC), which correlates strongly with comprehension (Giraud and Poeppel, 2012; Peelle and Davis, 2012; Haegens and Zion Golumbic, 2018). Studies of envelope coding in single neurons, performed in nonhuman animals, have focused on periodic amplitude modulation (AM) stimuli and use response metrics that are not easy to juxtapose with data from humans. In this study, we sought to bridge these fields. Specifically, we looked directly at the temporal relationship between stimulus envelope and spiking, and we assessed whether the apparent diversity across neurons’ AM responses contributes to the population representation of speech-like sound envelopes. We gathered responses from single neurons to vocoded speech stimuli and compared them to sinusoidal AM responses in auditory cortex (AC) of alert, freely moving Mongolian gerbils of both sexes. While AC neurons displayed heterogeneous tuning to AM rate, their temporal dynamics were stereotyped. Preferred response phases accumulated near the onsets of sinusoidal AM periods for slower rates (<8 Hz), and an over-representation of amplitude edges was apparent in population responses to both sinusoidal AM and vocoded speech envelopes. Crucially, this encoding bias imparted a decoding benefit: a classifier could discriminate vocoded speech stimuli using summed population activity, while higher frequency modulations required a more sophisticated decoder that tracked spiking responses from individual cells. Together, our results imply that the envelope structure relevant to parsing an acoustic stream could be read-out from a distributed, redundant population code.
KW - amplitude modulation
KW - auditory
KW - cortex
KW - envelope
KW - speech
KW - temporal coding
UR - http://www.scopus.com/inward/record.url?scp=85145641437&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85145641437&partnerID=8YFLogxK
U2 - 10.1523/JNEUROSCI.1616-21.2022
DO - 10.1523/JNEUROSCI.1616-21.2022
M3 - Article
C2 - 36379706
AN - SCOPUS:85145641437
SN - 0270-6474
VL - 43
SP - 93
EP - 112
JO - Journal of Neuroscience
JF - Journal of Neuroscience
IS - 1
ER -