TY - GEN
T1 - On the Relation between Sensitivity and Accuracy in In-Context Learning
AU - Chen, Yanda
AU - Zhao, Chen
AU - Yu, Zhou
AU - McKeown, Kathleen
AU - He, He
N1 - Publisher Copyright:
© 2023 Association for Computational Linguistics.
PY - 2023
Y1 - 2023
N2 - In-context learning (ICL) suffers from oversensitivity to the prompt, making it unreliable in real-world scenarios. We study the sensitivity of ICL with respect to multiple perturbation types. First, we find that label bias obscures the true sensitivity, and therefore prior work may have significantly underestimated ICL sensitivity. Second, we observe a strong negative correlation between ICL sensitivity and accuracy: predictions sensitive to perturbations are less likely to be correct. Motivated by these findings, we propose SENSEL, a few-shot selective prediction method that abstains from sensitive predictions. Experiments on ten classification datasets show that SENSEL consistently outperforms two commonly used confidence-based and entropy-based baselines on abstention decisions.
AB - In-context learning (ICL) suffers from oversensitivity to the prompt, making it unreliable in real-world scenarios. We study the sensitivity of ICL with respect to multiple perturbation types. First, we find that label bias obscures the true sensitivity, and therefore prior work may have significantly underestimated ICL sensitivity. Second, we observe a strong negative correlation between ICL sensitivity and accuracy: predictions sensitive to perturbations are less likely to be correct. Motivated by these findings, we propose SENSEL, a few-shot selective prediction method that abstains from sensitive predictions. Experiments on ten classification datasets show that SENSEL consistently outperforms two commonly used confidence-based and entropy-based baselines on abstention decisions.
UR - http://www.scopus.com/inward/record.url?scp=85183289916&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85183289916&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85183289916
T3 - Findings of the Association for Computational Linguistics: EMNLP 2023
SP - 155
EP - 167
BT - Findings of the Association for Computational Linguistics
PB - Association for Computational Linguistics (ACL)
T2 - 2023 Findings of the Association for Computational Linguistics: EMNLP 2023
Y2 - 6 December 2023 through 10 December 2023
ER -