Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Zheng, Lulu; Chen, Jiarui; Wang, Jianhao; He, Jiamin; Hu, Yujing; Chen, Yingfeng; Fan, Changjie; Gao, Yang; Zhang, Chongjie

Computer Science > Machine Learning

arXiv:2111.11032 (cs)

[Submitted on 22 Nov 2021]

Title:Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Authors:Lulu Zheng, Jiarui Chen, Jianhao Wang, Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

View PDF

Abstract:Efficient exploration in deep cooperative multi-agent reinforcement learning (MARL) still remains challenging in complex coordination problems. In this paper, we introduce a novel Episodic Multi-agent reinforcement learning with Curiosity-driven exploration, called EMC. We leverage an insight of popular factorized MARL algorithms that the "induced" individual Q-values, i.e., the individual utility functions used for local execution, are the embeddings of local action-observation histories, and can capture the interaction between agents due to reward backpropagation during centralized training. Therefore, we use prediction errors of individual Q-values as intrinsic rewards for coordinated exploration and utilize episodic memory to exploit explored informative experience to boost policy training. As the dynamics of an agent's individual Q-value function captures the novelty of states and the influence from other agents, our intrinsic reward can induce coordinated exploration to new or promising states. We illustrate the advantages of our method by didactic examples, and demonstrate its significant outperformance over state-of-the-art MARL baselines on challenging tasks in the StarCraft II micromanagement benchmark.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2111.11032 [cs.LG]
	(or arXiv:2111.11032v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.11032

Submission history

From: Lulu Zheng [view email]
[v1] Mon, 22 Nov 2021 07:34:47 UTC (9,527 KB)

Computer Science > Machine Learning

Title:Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators