Multi-Agent Training Framework

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

Population-based multi-agent reinforcement learning (PB-MARL) refers to the series of methods nested with reinforcement learning (RL) algorithms, which produces a self-generated sequence of tasks arising from the coupled population dynamics. By …