"Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments"
Maciej Aleksandrowicz, Joanna Jaworek-Korjakowska
Machine Vision Group 2023
Figure 2: The methodology used in this work. First, the agents are trained in a fixed environment. Then an evaluation procedure is performed on the transferred agent to a slightly changed environment. Please note, that from the perspective of the agent the space and observation domain are the same — only the underlying dynamics of the environment are different. Finally, the obtained results (in terms of total reward) are used to calculate generalization metrics.
Note
This repository contains code for the research article.
Important
The underlying code was used for research, not for production. Proceed with caution.
Inside the ws
you can find:
dmc_custom_envs
folder with hacked DMC Benchmark Suite environments, which allows to parametrize one constant variable in the environment,check_gpu.ipnyb
- a simple sanity check for GPU detection,evaluation.ipnyb
- notebook for running trained models on parametrized environments,generate_charts.ipnyb
- notebook for generating charts,training.ipnyb
- notebook for training models on DMC suite.
Tip
The results were obtained by running: 1. training, 2. evaluation, 3. chart generation.
The all dependencies of this project are packed into a single Docker Container.
TL;DR for your convenience: to run this project, you only need to install Docker Engine and Docker Compose on your machine. The project has been developed and tested on Ubuntu 22.04
and Arch Linux
.
After obtaining working Docker Engine
& Docker Compose
:
- Create the project workspace directory or just clone this repository with a prepared
ws
folder. - Enter the
docker
directory. - Edit the bind-mount paths in
docker-compose.yaml
file (i.e. change/home/macal/paper_ws
to your project pathws
). You need to specify the workspace directory,zshrc
andzsh_history
files. - Inside the
docker
subdirectory, build and run the container:
docker-compose build
docker-compose up
- The entrypoint of the project will start a local Jupyter Lab instance. Please click on the link inside the terminal to proceed further.
The published version of our article contains only raster images. To address that inconvenience we provide the original vector files in the figures
subfolder.
Please note that the figure 3 is taken from dm_control: Software and tasks for continuous control.
If you find this project useful for your research, please cite our work with the following BibTeX entry:
@article{Aleksandrowicz2023,
author = {Maciej Aleksandrowicz and Joanna Jaworek-Korjakowska},
doi = {doi:10.2478/jaiscr-2024-0003},
url = {https://doi.org/10.2478/jaiscr-2024-0003},
title = {Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments},
journal = {Journal of Artificial Intelligence and Soft Computing Research},
number = {1},
volume = {14},
year = {2023},
pages = {45--61}
}
For your convenience, here is the citation in MLA formatting:
Aleksandrowicz, Maciej and Jaworek-Korjakowska, Joanna. "Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments" Journal of Artificial Intelligence and Soft Computing Research, vol.14, no.1, 2023, pp.45-61. https://doi.org/10.2478/jaiscr-2024-0003
Contact: Machine Vision Group Website