Code of our ICML`23 paper: Actor-Critic Alignment for Offline-to-Online Reinforcement Learning
-
Pull this repo
git clone git@github.com:ZishunYu/ACA.git; cd ACA
-
Create conda virtual env
conda create --name ACA python=3.7.4; conda activate ACA
-
Install MuJoCo200 following the official documentation
-
Install d4rl
git clone https://github.com/rail-berkeley/d4rl.git cd d4rl; pip3 install -e .; cd ..
-
Install requirements
pip3 install -r requirements.txt
- Download offline pretrained models from here (Google drive)
- Run experiment with
python3 run_aca.py --dataset hopper-medium-v2 --seed 1
- MuJoCo installation troubleshooting, see MuJoCo official git page
ImportError: libpython3.7m.so.1.0: cannot open shared object file: No such file or directory
, try setting the lib path before running experimentexport LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/PATH/TO/CONDA/envs/ACA/lib export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/YOUR_USER_NAME/.mujoco/mujoco200/bin
OSError: /some/path/mujoco/libmujoco200.so: undefined symbol: __glewBindBuffer
, try install libglfw3 and libglew2.0 byconda install -c menpo glfw3 conda install -c conda-forge glew==2.0.0
@InProceedings{pmlr-v202-yu23k,
title = {Actor-Critic Alignment for Offline-to-Online Reinforcement Learning},
author = {Yu, Zishun and Zhang, Xinhua},
booktitle = {Proceedings of the 40th International Conference on Machine Learning},
pages = {40452--40474},
year = {2023},
editor = {Krause, Andreas and Brunskill, Emma and Cho, Kyunghyun and Engelhardt, Barbara and Sabato, Sivan and Scarlett, Jonathan},
volume = {202},
series = {Proceedings of Machine Learning Research},
month = {23--29 Jul},
publisher = {PMLR},
pdf = {https://proceedings.mlr.press/v202/yu23k/yu23k.pdf},
url = {https://proceedings.mlr.press/v202/yu23k.html},
}