On the robustness of cooperative multi-agent reinforcement learning

Jieyu Lin, Kristina Dzeparoska, Sai Qian Zhang, Alberto Leon-Garcia, Nicolas Papernot

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In cooperative multi-agent reinforcement learning (c-MARL), agents learn to cooperatively take actions as a team to maximize a total team reward. We analyze the robustness of c-MARL to adversaries capable of attacking one of the agents on a team. Through the ability to manipulate this agent's observations, the adversary seeks to decrease the total team reward. Attacking c-MARL is challenging for three reasons: first, it is difficult to estimate team rewards or how they are impacted by an agent mispredicting; second, models are non-differentiable; and third, the feature space is low-dimensional. Thus, we introduce a novel attack. The attacker first trains a policy network with reinforcement learning to find a wrong action it should encourage the victim agent to take. Then, the adversary uses targeted adversarial examples to force the victim to take this action. Our results on the StartCraft II multi-agent benchmark demonstrate that c-MARL teams are highly vulnerable to perturbations applied to one of their agent's observations. By attacking a single agent, our attack method has highly negative impact on the overall team reward, reducing it from 20 to 9.4. This results in the team's winning rate to go down from 98.9% to 0%.

Original languageEnglish (US)
Title of host publicationProceedings - 2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages62-68
Number of pages7
ISBN (Electronic)9781728193465
DOIs
StatePublished - May 2020
Event2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020 - Virtual, San Francisco, United States
Duration: May 21 2020 → …

Publication series

NameProceedings - 2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020

Conference

Conference2020 IEEE Symposium on Security and Privacy Workshops, SPW 2020
Country/TerritoryUnited States
CityVirtual, San Francisco
Period5/21/20 → …

Keywords

  • Adversarial Examples
  • Adversarial Policy
  • Cooperative Multi Agent Reinforcement Learning
  • Deep reinforcement learning
  • Machine learning
  • Q Learning
  • Robustness

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Civil and Structural Engineering
  • Safety, Risk, Reliability and Quality
  • Analysis

Fingerprint

Dive into the research topics of 'On the robustness of cooperative multi-agent reinforcement learning'. Together they form a unique fingerprint.

Cite this