TY - JOUR
T1 - Emergence of addictive behaviors in reinforcement learning agents
AU - Behzadan, Vahid
AU - Yampolskiy, Roman V.
AU - Munir, Arslan
N1 - Publisher Copyright:
© 2018 CEUR-WS. All rights reserved.
PY - 2019
Y1 - 2019
N2 - This paper presents a novel approach to the technical analysis of wireheading in intelligent agents. Inspired by the natural analogues of wireheading and their prevalent manifestations, we propose the modeling of such phenomenon in Reinforcement Learning (RL) agents as psychological disorders. In a preliminary step towards evaluating this proposal, we study the feasibility and dynamics of emergent addictive policies in Q-learning agents in the tractable environment of the game of Snake. We consider a slightly modified version of this game, in which the environment provides a “drug” seed alongside the original “healthy” seed for the consumption of the snake. We adopt and extend an RL-based model of natural addiction to Q-learning agents in these settings, and derive sufficient parametric conditions for the emergence of addictive behaviors in such agents. Furthermore, we evaluate our theoretical analysis with three sets of simulation-based experiments. The results demonstrate the feasibility of addictive wireheading in RL agents, and provide promising venues of further research on the psychopathological modeling of complex AI safety problems.
AB - This paper presents a novel approach to the technical analysis of wireheading in intelligent agents. Inspired by the natural analogues of wireheading and their prevalent manifestations, we propose the modeling of such phenomenon in Reinforcement Learning (RL) agents as psychological disorders. In a preliminary step towards evaluating this proposal, we study the feasibility and dynamics of emergent addictive policies in Q-learning agents in the tractable environment of the game of Snake. We consider a slightly modified version of this game, in which the environment provides a “drug” seed alongside the original “healthy” seed for the consumption of the snake. We adopt and extend an RL-based model of natural addiction to Q-learning agents in these settings, and derive sufficient parametric conditions for the emergence of addictive behaviors in such agents. Furthermore, we evaluate our theoretical analysis with three sets of simulation-based experiments. The results demonstrate the feasibility of addictive wireheading in RL agents, and provide promising venues of further research on the psychopathological modeling of complex AI safety problems.
UR - http://www.scopus.com/inward/record.url?scp=85060618706&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060618706&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85060618706
SN - 1613-0073
VL - 2301
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 2019 AAAI Workshop on Artificial Intelligence Safety, SafeAI 2019
Y2 - 27 January 2019
ER -