Framework

Our VMAgent’s framework is mainly based on the pymarl. It consists controller, learner, components, modules and utils.

vmdiag

Controller

The controller plays the role on output actions for sampling.

Learner

The learner plays is to update the agent’s policy

Components

It provides key components for learning and sampling. It includes the implementations of replay memory and action selector (i.e., epsilon-greedy action selection).

Modules

It provides different networks of the agent, including critic network, policy network etc.

Utils

It provides several utils for reinforcement learning.