TY - GEN
T1 - Leveling the playing field
T2 - 14th International Conference on the Foundations of Digital Games, FDG 2019
AU - Canaan, Rodrigo
AU - Salge, Christoph
AU - Togelius, Julian
AU - Nealen, Andy
N1 - Funding Information:
RC gratefully acknowledges the financial support from Honda Research Institute Europe (HRI-EU). CS is funded by the EU Horizon 2020 programme / Marie Sklodowska- Curie grant 705643.
Publisher Copyright:
© 2019 ACM.
PY - 2019/8/26
Y1 - 2019/8/26
N2 - From the beginning of the history of AI, there has been interest in games as a platform of research. As the field developed, human-level competence in complex games became a target researchers worked to reach. Only relatively recently has this target been finally met for traditional tabletop games such as Backgammon, Chess and Go. This prompted a shift in research focus towards electronic games, which provide unique new challenges. As is often the case with AI research, these results are liable to be exaggerated or mis-represented by either authors or third parties. The extent to which these game benchmarks constitute "fair" competition between human and AI is also a matter of debate. In this paper, we review statements made by reseachers and third parties in the general media and academic publications about these game benchmark results. We analyze what a fair competition would look like and suggest a taxonomy of dimensions to frame the debate of fairness in game contests between humans and machines. Eventually, we argue that there is no completely fair way to compare human and AI performance on a game.
AB - From the beginning of the history of AI, there has been interest in games as a platform of research. As the field developed, human-level competence in complex games became a target researchers worked to reach. Only relatively recently has this target been finally met for traditional tabletop games such as Backgammon, Chess and Go. This prompted a shift in research focus towards electronic games, which provide unique new challenges. As is often the case with AI research, these results are liable to be exaggerated or mis-represented by either authors or third parties. The extent to which these game benchmarks constitute "fair" competition between human and AI is also a matter of debate. In this paper, we review statements made by reseachers and third parties in the general media and academic publications about these game benchmark results. We analyze what a fair competition would look like and suggest a taxonomy of dimensions to frame the debate of fairness in game contests between humans and machines. Eventually, we argue that there is no completely fair way to compare human and AI performance on a game.
KW - AI benchmarks
KW - Fairness
KW - Game AI
KW - games
UR - http://www.scopus.com/inward/record.url?scp=85072824393&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072824393&partnerID=8YFLogxK
U2 - 10.1145/3337722.3337750
DO - 10.1145/3337722.3337750
M3 - Conference contribution
AN - SCOPUS:85072824393
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 14th International Conference on the Foundations of Digital Games, FDG 2019
A2 - Khosmood, Foaad
A2 - Pirker, Johanna
A2 - Apperley, Thomas
A2 - Deterding, Sebastian
PB - Association for Computing Machinery
Y2 - 26 August 2019 through 30 August 2019
ER -