GitHub - sjtuytc/AAAI21-RoutineAugmentedPolicyLearning: Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Sec0: Introduction

This is the official code release to our AAAI21 work titled "Augmenting Policy Learning with Routines Discovered from a Single Demonstration".

Authors: Zelin Zhao (me), Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua Tenenbaum.

Work was done during Zelin’s internship at MIT.

Paper link: https://arxiv.org/abs/2012.12469

Sec1: Installation

Install miniconda

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh

create an environment

conda create -n baselines python=3.7

install libraries

pip install tensorflow-gpu==1.14 ffmpeg-python matplotlib
pip install gym 
pip install gym[atari]

install baselines

git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

install pytorch

conda install pytorch torchvision -c soumith

Sec3: Training expert policy

python launch.py --mode expert --seed 0

Sec4: Make demonstration and Abstract routines

python launch.py --mode abstraction --seed 0

Sec5: Train and test command

python launch.py --mode routine --seed 0

Trouble Shooting

ValueError: Cannot feed value of shape (1, 210, 160, 12) for Tensor 'Placeholder:0', which has shape '(?, 84, 84, 4)'

Gym version error. Please ensure that gym version is 0.10.5.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
env_makers		env_makers
make_demo_discover_rt		make_demo_discover_rt
torchrl_with_routines		torchrl_with_routines
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
launch.py		launch.py
requirements.txt		requirements.txt
run_all.yaml		run_all.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sec0: Introduction

Sec1: Installation

Sec3: Training expert policy

Sec4: Make demonstration and Abstract routines

Sec5: Train and test command

Trouble Shooting

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sjtuytc/AAAI21-RoutineAugmentedPolicyLearning

Folders and files

Latest commit

History

Repository files navigation

Sec0: Introduction

Sec1: Installation

Sec3: Training expert policy

Sec4: Make demonstration and Abstract routines

Sec5: Train and test command

Trouble Shooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages