Framework
Our VMAgent’s framework is mainly based on the pymarl. It consists
controller
, learner
, components
, modules
and utils
.
Controller
The controller plays the role on output actions for sampling.
Learner
The learner plays is to update the agent’s policy
Components
It provides key components for learning and sampling. It includes the implementations of replay memory and action selector (i.e., epsilon-greedy action selection).
Modules
It provides different networks of the agent, including critic network, policy network etc.
Utils
It provides several utils for reinforcement learning.