12月28日，「开源中国源创会年终盛典」珠海站再次回归！点击免费报名参会

加入 Gitee

与超过 1200万开发者一起发现、参与优秀开源项目，私有仓库也完全免费：）

该仓库未声明开源许可证文件（LICENSE），使用请关注具体项目描述及其代码上游依赖。

克隆/下载

update to pytorch 0.4.0 b7c1acf

randomProcess.py

Loading...

README

An implementation of MADDPG

1. Introduction

This is a pytorch implementation of multi-agent deep deterministic policy gradient algorithm.

The experimental environment is a modified version of Waterworld based on MADRL.

2. Environment

The main features (different from MADRL) of the modified Waterworld environment are:

evaders and poisons now bounce at the wall obeying physical rules
sizes of the evaders, pursuers and poisons are now the same so that random actions will lead to average rewards around 0.
need exactly n_coop agents to catch food.

3. Dependency

pytorch
visdom
python==3.6.1 (recommend using the anaconda/miniconda)
if you need to render the environments, opencv is required

4. Install

Install MADRL.
Replace the madrl_environments/pursuit directory with the one in this repo.
python main.py

if scene rendering is enabled, recommend to install opencv through conda-forge.

5. Results

two agents, cooperation = 2

The two agents need to cooperate to achieve the food for reward 10.

PNG/demo.gif

PNG/3.png

the average

PNG/4.png

one agent, cooperation = 1

PNG/newplot.png

6. TODO

reproduce the experiments in the paper with competitive environments.

暂无描述

Python

取消

暂无发行版

马建仓 AI 助手

尝试更多

代码解读

代码找茬

代码优化

1

https://gitee.com/lxqbupt/pytorch-maddpg.git

git@gitee.com:lxqbupt/pytorch-maddpg.git

lxqbupt

pytorch-maddpg

pytorch-maddpg

master