TY - GEN
T1 - Simulating Bandit Learning from User Feedback for Extractive Question Answering
AU - Gao, Ge
AU - Choi, Eunsol
AU - Artzi, Yoav
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022
Y1 - 2022
N2 - We study learning from user feedback for extractive question answering by simulating feedback using supervised data. We cast the problem as contextual bandit learning, and analyze the characteristics of several learning scenarios with focus on reducing data annotation. We show that systems initially trained on a small number of examples can dramatically improve given feedback from users on model-predicted answers, and that one can use existing datasets to deploy systems in new domains without any annotation, but instead improving the system on-the-fly via user feedback.
AB - We study learning from user feedback for extractive question answering by simulating feedback using supervised data. We cast the problem as contextual bandit learning, and analyze the characteristics of several learning scenarios with focus on reducing data annotation. We show that systems initially trained on a small number of examples can dramatically improve given feedback from users on model-predicted answers, and that one can use existing datasets to deploy systems in new domains without any annotation, but instead improving the system on-the-fly via user feedback.
UR - http://www.scopus.com/inward/record.url?scp=85144946591&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85144946591&partnerID=8YFLogxK
U2 - 10.18653/v1/2022.acl-long.355
DO - 10.18653/v1/2022.acl-long.355
M3 - Conference contribution
AN - SCOPUS:85144946591
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 5167
EP - 5179
BT - ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)
A2 - Muresan, Smaranda
A2 - Nakov, Preslav
A2 - Villavicencio, Aline
PB - Association for Computational Linguistics (ACL)
T2 - 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022
Y2 - 22 May 2022 through 27 May 2022
ER -