TY - JOUR
T1 - Efficient communication in multi-agent reinforcement learning via variance based control
AU - Zhang, Sai Qian
AU - Zhang, Qi
AU - Lin, Jieyu
N1 - Publisher Copyright:
© 2019 Neural information processing systems foundation. All rights reserved.
PY - 2019
Y1 - 2019
N2 - Multi-agent reinforcement learning (MARL) has recently received considerable attention due to its applicability to a wide range of real-world applications. However, achieving efficient communication among agents has always been an overarching problem in MARL. In this work, we propose Variance Based Control (VBC), a simple yet efficient technique to improve communication efficiency in MARL. By limiting the variance of the exchanged messages between agents during the training phase, the noisy component in the messages can be eliminated effectively, while the useful part can be preserved and utilized by the agents for better performance. Our evaluation using multiple MARL benchmarks indicates that our method achieves 2 - 10× lower in communication overhead than state-of-the-art MARL algorithms, while allowing agents to achieve better overall performance.
AB - Multi-agent reinforcement learning (MARL) has recently received considerable attention due to its applicability to a wide range of real-world applications. However, achieving efficient communication among agents has always been an overarching problem in MARL. In this work, we propose Variance Based Control (VBC), a simple yet efficient technique to improve communication efficiency in MARL. By limiting the variance of the exchanged messages between agents during the training phase, the noisy component in the messages can be eliminated effectively, while the useful part can be preserved and utilized by the agents for better performance. Our evaluation using multiple MARL benchmarks indicates that our method achieves 2 - 10× lower in communication overhead than state-of-the-art MARL algorithms, while allowing agents to achieve better overall performance.
UR - http://www.scopus.com/inward/record.url?scp=85090174215&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85090174215&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85090174215
SN - 1049-5258
VL - 32
JO - Advances in Neural Information Processing Systems
JF - Advances in Neural Information Processing Systems
T2 - 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019
Y2 - 8 December 2019 through 14 December 2019
ER -