TY - GEN
T1 - A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks
AU - Zhou, Yefan
AU - Shen, Yiru
AU - Yan, Yujun
AU - Feng, Chen
AU - Yang, Yaoqing
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Neural networks (NN) for single-view 3D reconstruction (SVR) have gained in popularity. Recent work points out that for SVR,most cutting-edge NNs have limited performance on reconstructing unseen objects because they rely primarily on recognition (i.e.,classification-based methods) rather than shape reconstruction. To understand this issue in depth,we provide a systematic study on when and why NNs prefer recognition to reconstruction and vice versa. Our finding shows that a leading factor in determining recognition versus reconstruction is how 'dispersed' the training data is. Thus,we introduce the dispersion score,a new data-driven metric,to quantify this leading factor and study its effect on NNs. We hypothesize that NNs are biased toward recognition when training images are more dispersed and training shapes are less dispersed. Our hypothesis is supported and the dispersion score is proved effective through our experiments on synthetic and benchmark datasets. We show that the proposed metric is a principal way to analyze reconstruction quality and provides novel information in addition to the conventional reconstruction score. We have open-sourced our code.1
AB - Neural networks (NN) for single-view 3D reconstruction (SVR) have gained in popularity. Recent work points out that for SVR,most cutting-edge NNs have limited performance on reconstructing unseen objects because they rely primarily on recognition (i.e.,classification-based methods) rather than shape reconstruction. To understand this issue in depth,we provide a systematic study on when and why NNs prefer recognition to reconstruction and vice versa. Our finding shows that a leading factor in determining recognition versus reconstruction is how 'dispersed' the training data is. Thus,we introduce the dispersion score,a new data-driven metric,to quantify this leading factor and study its effect on NNs. We hypothesize that NNs are biased toward recognition when training images are more dispersed and training shapes are less dispersed. Our hypothesis is supported and the dispersion score is proved effective through our experiments on synthetic and benchmark datasets. We show that the proposed metric is a principal way to analyze reconstruction quality and provides novel information in addition to the conventional reconstruction score. We have open-sourced our code.1
KW - 3D vision
KW - Reconstruction metric
KW - Single view 3D reconstruction
UR - http://www.scopus.com/inward/record.url?scp=85125009094&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125009094&partnerID=8YFLogxK
U2 - 10.1109/3DV53792.2021.00140
DO - 10.1109/3DV53792.2021.00140
M3 - Conference contribution
AN - SCOPUS:85125009094
T3 - Proceedings - 2021 International Conference on 3D Vision, 3DV 2021
SP - 1331
EP - 1340
BT - Proceedings - 2021 International Conference on 3D Vision, 3DV 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on 3D Vision, 3DV 2021
Y2 - 1 December 2021 through 3 December 2021
ER -