TY - GEN
T1 - Goal-Driven Transformer for Robot Behavior Learning from Play Data
AU - Wen, Congcong
AU - Liang, Jiazhao
AU - Yuan, Shuaihang
AU - Huang, Hao
AU - Hao, Yu
AU - Lin, Hui
AU - Liu, Yu Shen
AU - Fang, Yi
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - Robot behavior learning has emerged as a crucial field, allowing robots to adapt and improve their actions based on experiential knowledge rather than being solely reliant on predefined instructions. However, the effectiveness of such learning is often hindered by the limitations of offline reinforcement learning, which relies on pre-defined reward labels, and traditional imitation learning, which depends on high-quality expert demonstrations. To address these challenges, in this paper, we propose a novel Goal-Driven Transformer (GDT) for robotic behavior learning from play data. The core module of the GDT is the inclusion of the Goal-Driven Attention Block (GDAB) that utilizes attention mechanisms to concentrate the model’s focus on particular objectives, enabling the GDT to selectively focus on critical parts of the observation data to perform behavioral learning for specific goals. Moreover, we employ the Standard Attention Block (SAB) to ensure that this goal-directed learning occurs with a comprehensive understanding of the environment and the sequence of actions required. Experimental validation of the proposed GDT framework is conducted in two simulated environments: Block-pushing and Franka Kitchen. The results demonstrate that the GDT framework has achieved state-of-the-art performance in the realm of robot behavior learning from play data. Videos are available at: https://gdt-bl.github.io/.
AB - Robot behavior learning has emerged as a crucial field, allowing robots to adapt and improve their actions based on experiential knowledge rather than being solely reliant on predefined instructions. However, the effectiveness of such learning is often hindered by the limitations of offline reinforcement learning, which relies on pre-defined reward labels, and traditional imitation learning, which depends on high-quality expert demonstrations. To address these challenges, in this paper, we propose a novel Goal-Driven Transformer (GDT) for robotic behavior learning from play data. The core module of the GDT is the inclusion of the Goal-Driven Attention Block (GDAB) that utilizes attention mechanisms to concentrate the model’s focus on particular objectives, enabling the GDT to selectively focus on critical parts of the observation data to perform behavioral learning for specific goals. Moreover, we employ the Standard Attention Block (SAB) to ensure that this goal-directed learning occurs with a comprehensive understanding of the environment and the sequence of actions required. Experimental validation of the proposed GDT framework is conducted in two simulated environments: Block-pushing and Franka Kitchen. The results demonstrate that the GDT framework has achieved state-of-the-art performance in the realm of robot behavior learning from play data. Videos are available at: https://gdt-bl.github.io/.
KW - Attention Mechanism.
KW - Robot Behavior Learning
KW - Robot Play Data
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85212275407&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85212275407&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-78113-1_23
DO - 10.1007/978-3-031-78113-1_23
M3 - Conference contribution
AN - SCOPUS:85212275407
SN - 9783031781124
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 346
EP - 359
BT - Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings
A2 - Antonacopoulos, Apostolos
A2 - Chaudhuri, Subhasis
A2 - Chellappa, Rama
A2 - Liu, Cheng-Lin
A2 - Bhattacharya, Saumik
A2 - Pal, Umapada
PB - Springer Science and Business Media Deutschland GmbH
T2 - 27th International Conference on Pattern Recognition, ICPR 2024
Y2 - 1 December 2024 through 5 December 2024
ER -