TY - JOUR
T1 - Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening
AU - Wu, Nan
AU - Phang, Jason
AU - Park, Jungkyu
AU - Shen, Yiqiu
AU - Huang, Zhe
AU - Zorin, Masha
AU - Jastrzebski, Stanislaw
AU - Fevry, Thibault
AU - Katsnelson, Joe
AU - Kim, Eric
AU - Wolfson, Stacey
AU - Parikh, Ujas
AU - Gaddam, Sushma
AU - Lin, Leng Leng Young
AU - Ho, Kara
AU - Weinstein, Joshua D.
AU - Reig, Beatriu
AU - Gao, Yiming
AU - Toth, Hildegard
AU - Pysarenko, Kristine
AU - Lewin, Alana
AU - Lee, Jiyon
AU - Airola, Krystal
AU - Mema, Eralda
AU - Chung, Stephanie
AU - Hwang, Esther
AU - Samreen, Naziya
AU - Kim, S. Gene
AU - Heacock, Laura
AU - Moy, Linda
AU - Cho, Kyunghyun
AU - Geras, Krzysztof J.
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2020/4
Y1 - 2020/4
N2 - We present a deep convolutional neural network for breast cancer screening exam classification, trained, and evaluated on over 200000 exams (over 1000000 images). Our network achieves an AUC of 0.895 in predicting the presence of cancer in the breast, when tested on the screening population. We attribute the high accuracy to a few technical advances. 1) Our network's novel two-stage architecture and training procedure, which allows us to use a high-capacity patch-level network to learn from pixel-level labels alongside a network learning from macroscopic breast-level labels. 2) A custom ResNet-based network used as a building block of our model, whose balance of depth and width is optimized for high-resolution medical images. 3) Pretraining the network on screening BI-RADS classification, a related task with more noisy labels. 4) Combining multiple input views in an optimal way among a number of possible choices. To validate our model, we conducted a reader study with 14 readers, each reading 720 screening mammogram exams, and show that our model is as accurate as experienced radiologists when presented with the same data. We also show that a hybrid model, averaging the probability of malignancy predicted by a radiologist with a prediction of our neural network, is more accurate than either of the two separately. To further understand our results, we conduct a thorough analysis of our network's performance on different subpopulations of the screening population, the model's design, training procedure, errors, and properties of its internal representations. Our best models are publicly available at https://github.com/nyukat/breast_cancer_classifier.
AB - We present a deep convolutional neural network for breast cancer screening exam classification, trained, and evaluated on over 200000 exams (over 1000000 images). Our network achieves an AUC of 0.895 in predicting the presence of cancer in the breast, when tested on the screening population. We attribute the high accuracy to a few technical advances. 1) Our network's novel two-stage architecture and training procedure, which allows us to use a high-capacity patch-level network to learn from pixel-level labels alongside a network learning from macroscopic breast-level labels. 2) A custom ResNet-based network used as a building block of our model, whose balance of depth and width is optimized for high-resolution medical images. 3) Pretraining the network on screening BI-RADS classification, a related task with more noisy labels. 4) Combining multiple input views in an optimal way among a number of possible choices. To validate our model, we conducted a reader study with 14 readers, each reading 720 screening mammogram exams, and show that our model is as accurate as experienced radiologists when presented with the same data. We also show that a hybrid model, averaging the probability of malignancy predicted by a radiologist with a prediction of our neural network, is more accurate than either of the two separately. To further understand our results, we conduct a thorough analysis of our network's performance on different subpopulations of the screening population, the model's design, training procedure, errors, and properties of its internal representations. Our best models are publicly available at https://github.com/nyukat/breast_cancer_classifier.
KW - Deep learning
KW - breast cancer screening
KW - deep convolutional neural networks
KW - mammography
UR - http://www.scopus.com/inward/record.url?scp=85082990181&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082990181&partnerID=8YFLogxK
U2 - 10.1109/TMI.2019.2945514
DO - 10.1109/TMI.2019.2945514
M3 - Article
C2 - 31603772
AN - SCOPUS:85082990181
SN - 0278-0062
VL - 39
SP - 1184
EP - 1194
JO - IEEE Transactions on Medical Imaging
JF - IEEE Transactions on Medical Imaging
IS - 4
M1 - 8861376
ER -