TY - JOUR
T1 - Dynamics of self-monitoring and error detection in speech production
T2 - Evidence from mental imagery and MEG
AU - Tian, Xing
AU - Poeppel, David
N1 - Publisher Copyright:
© 2014 Massachusetts Institute of Technology.
PY - 2014/3/1
Y1 - 2014/3/1
N2 - A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.
AB - A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.
UR - http://www.scopus.com/inward/record.url?scp=84920044054&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84920044054&partnerID=8YFLogxK
U2 - 10.1162/jocn_a_00692
DO - 10.1162/jocn_a_00692
M3 - Article
C2 - 25061925
AN - SCOPUS:84920044054
SN - 0898-929X
VL - 27
SP - 352
EP - 364
JO - Journal of Cognitive Neuroscience
JF - Journal of Cognitive Neuroscience
IS - 2
ER -