TY - GEN
T1 - Egocentric Prediction of Action Target in 3D
AU - Li, Yiming
AU - Cao, Ziang
AU - Liang, Andrew
AU - Liang, Benjamin
AU - Chen, Luoyao
AU - Zhao, Hang
AU - Feng, Chen
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - We are interested in anticipating as early as possible the target location of a person's object manipulation action in a 3D workspace from egocentric vision. It is important in fields like human-robot collaboration, but has not yet received enough attention from vision and learning communities. To stimulate more research on this challenging egocentric vision task, we propose a large multimodality dataset of more than 1 million frames of RGB-D and IMU streams, and provide evaluation metrics based on our high-quality 2D and 3D labels from semi-automatic annotation. Meanwhile, we design baseline methods using recurrent neural networks and conduct various ablation studies to validate their effectiveness. Our results demonstrate that this new task is worthy of further study by researchers in robotics, vision, and learning communities.
AB - We are interested in anticipating as early as possible the target location of a person's object manipulation action in a 3D workspace from egocentric vision. It is important in fields like human-robot collaboration, but has not yet received enough attention from vision and learning communities. To stimulate more research on this challenging egocentric vision task, we propose a large multimodality dataset of more than 1 million frames of RGB-D and IMU streams, and provide evaluation metrics based on our high-quality 2D and 3D labels from semi-automatic annotation. Meanwhile, we design baseline methods using recurrent neural networks and conduct various ablation studies to validate their effectiveness. Our results demonstrate that this new task is worthy of further study by researchers in robotics, vision, and learning communities.
KW - Behavior analysis
KW - Datasets and evaluation
KW - Robot vision
KW - Vision applications and systems
UR - http://www.scopus.com/inward/record.url?scp=85141787232&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141787232&partnerID=8YFLogxK
U2 - 10.1109/CVPR52688.2022.02033
DO - 10.1109/CVPR52688.2022.02033
M3 - Conference contribution
AN - SCOPUS:85141787232
T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
SP - 20971
EP - 20980
BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
PB - IEEE Computer Society
T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
Y2 - 19 June 2022 through 24 June 2022
ER -