TY - GEN
T1 - Supervised discovery of unknown unknowns through test sample mining (student abstract)
AU - Wang, Zheng
AU - Abrahao, Bruno
AU - Kamar, Ece
N1 - Funding Information:
∗Supported by a National Natural Science Foundation of China (NSFC) grant #61850410536. Copyright ©c 2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
Publisher Copyright:
Copyright © 2020 Association for the Advancement of Artificial Intelligence. All rights reserved.
PY - 2020
Y1 - 2020
N2 - Given a fixed hypothesis space, defined to model class structure in a particular domain of application, unknown unknowns (u.u.s) are data examples that form classes in the feature space whose structure is not represented in a trained model. Accordingly, this leads to incorrect class prediction with high confidence, which represents one of the major sources of blind spots in machine learning. Our method seeks to reduce the structural mismatch between the training model and that of the target space in a supervised way. We illuminate further structure through cross-validation on a modified training model, set up to mine and trap u.u.s in a marginal training class, created from examples of a random sample of the test set. Contrary to previous approaches, our method simplifies the solution, as it does not rely on budgeted queries to an Oracle whose outcomes inform adjustments to training. In addition, our empirically results exhibit consistent performance improvements over baselines, on both synthetic and real-world data sets.
AB - Given a fixed hypothesis space, defined to model class structure in a particular domain of application, unknown unknowns (u.u.s) are data examples that form classes in the feature space whose structure is not represented in a trained model. Accordingly, this leads to incorrect class prediction with high confidence, which represents one of the major sources of blind spots in machine learning. Our method seeks to reduce the structural mismatch between the training model and that of the target space in a supervised way. We illuminate further structure through cross-validation on a modified training model, set up to mine and trap u.u.s in a marginal training class, created from examples of a random sample of the test set. Contrary to previous approaches, our method simplifies the solution, as it does not rely on budgeted queries to an Oracle whose outcomes inform adjustments to training. In addition, our empirically results exhibit consistent performance improvements over baselines, on both synthetic and real-world data sets.
UR - http://www.scopus.com/inward/record.url?scp=85106612989&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106612989&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85106612989
T3 - AAAI 2020 - 34th AAAI Conference on Artificial Intelligence
SP - 13959
EP - 13960
BT - AAAI 2020 - 34th AAAI Conference on Artificial Intelligence
PB - AAAI press
T2 - 34th AAAI Conference on Artificial Intelligence, AAAI 2020
Y2 - 7 February 2020 through 12 February 2020
ER -