TY - GEN
T1 - Causal BERT
T2 - 18th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021
AU - Resnick, Cinjon
AU - Litany, Or
AU - Kar, Amlan
AU - Kreis, Karsten
AU - Lucas, James
AU - Cho, Kyunghyun
AU - Fidler, Sanja
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Autonomous vehicles (AV) often rely on perception modules built upon neural networks for object detection. These modules frequently have low expected error overall but high error on unknown groups due to biases inherent in the training process. When these errors cause vehicle failure, manufacturers pay humans to comb through the associated images and label what group they are from. Data from that group is then collected, annotated, and added to the training set before retraining the model to fix the issue. In other words, group errors are found and addressed in hindsight. Our main contribution is a method to find such groups in foresight, leveraging advances in simulation as well as masked language modeling in order to perform causal interventions on simulated driving scenes. We then use the found groups to improve detection, exemplified by Diamondback bikes, whose performance we improve by 30 AP points. Such a solution is of high priority because it would greatly improve the robustness and safety of AV systems. Our second contribution is the tooling to run interventions, which will benefit the causal community tremendously.
AB - Autonomous vehicles (AV) often rely on perception modules built upon neural networks for object detection. These modules frequently have low expected error overall but high error on unknown groups due to biases inherent in the training process. When these errors cause vehicle failure, manufacturers pay humans to comb through the associated images and label what group they are from. Data from that group is then collected, annotated, and added to the training set before retraining the model to fix the issue. In other words, group errors are found and addressed in hindsight. Our main contribution is a method to find such groups in foresight, leveraging advances in simulation as well as masked language modeling in order to perform causal interventions on simulated driving scenes. We then use the found groups to improve detection, exemplified by Diamondback bikes, whose performance we improve by 30 AP points. Such a solution is of high priority because it would greatly improve the robustness and safety of AV systems. Our second contribution is the tooling to run interventions, which will benefit the causal community tremendously.
UR - http://www.scopus.com/inward/record.url?scp=85121409574&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121409574&partnerID=8YFLogxK
U2 - 10.1109/ICCVW54120.2021.00332
DO - 10.1109/ICCVW54120.2021.00332
M3 - Conference contribution
AN - SCOPUS:85121409574
T3 - Proceedings of the IEEE International Conference on Computer Vision
SP - 2972
EP - 2981
BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 11 October 2021 through 17 October 2021
ER -