Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments

Paper

"Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments"
Maciej Aleksandrowicz, Joanna Jaworek-Korjakowska
Machine Vision Group 2023

Figure 2: The methodology used in this work. First, the agents are trained in a fixed environment. Then an evaluation procedure is performed on the transferred agent to a slightly changed environment. Please note, that from the perspective of the agent the space and observation domain are the same — only the underlying dynamics of the environment are different. Finally, the obtained results (in terms of total reward) are used to calculate generalization metrics.

Note

This repository contains code for the research article.

Important

The underlying code was used for research, not for production. Proceed with caution.

Project strcture

Inside the ws you can find:

dmc_custom_envs folder with hacked DMC Benchmark Suite environments, which allows to parametrize one constant variable in the environment,
check_gpu.ipnyb - a simple sanity check for GPU detection,
evaluation.ipnyb - notebook for running trained models on parametrized environments,
generate_charts.ipnyb - notebook for generating charts,
training.ipnyb - notebook for training models on DMC suite.

Tip

The results were obtained by running: 1. training, 2. evaluation, 3. chart generation.

Building & running the container

The all dependencies of this project are packed into a single Docker Container.

TL;DR for your convenience: to run this project, you only need to install Docker Engine and Docker Compose on your machine. The project has been developed and tested on Ubuntu 22.04 and Arch Linux.

After obtaining working Docker Engine & Docker Compose:

Create the project workspace directory or just clone this repository with a prepared ws folder.
Enter the docker directory.
Edit the bind-mount paths in docker-compose.yaml file (i.e. change /home/macal/paper_ws to your project path ws). You need to specify the workspace directory, zshrc and zsh_history files.
Inside the docker subdirectory, build and run the container:

docker-compose build
docker-compose up

The entrypoint of the project will start a local Jupyter Lab instance. Please click on the link inside the terminal to proceed further.

Figures

The published version of our article contains only raster images. To address that inconvenience we provide the original vector files in the figures subfolder.

Please note that the figure 3 is taken from dm_control: Software and tasks for continuous control.

Citation

If you find this project useful for your research, please cite our work with the following BibTeX entry:

@article{Aleksandrowicz2023,
    author = {Maciej Aleksandrowicz and Joanna Jaworek-Korjakowska},
    doi = {doi:10.2478/jaiscr-2024-0003},
    url = {https://doi.org/10.2478/jaiscr-2024-0003},
    title = {Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments},
    journal = {Journal of Artificial Intelligence and Soft Computing Research},
    number = {1},
    volume = {14},
    year = {2023},
    pages = {45--61}
}

For your convenience, here is the citation in MLA formatting:

Aleksandrowicz, Maciej and Jaworek-Korjakowska, Joanna. "Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments" Journal of Artificial Intelligence and Soft Computing Research, vol.14, no.1, 2023, pp.45-61. https://doi.org/10.2478/jaiscr-2024-0003

Contact: Machine Vision Group Website

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docker		docker
figures		figures
ws		ws
.gitignore		.gitignore
CITATION		CITATION
CITATION.bib		CITATION.bib
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments

Paper

Project strcture

Building & running the container

Figures

Citation

About

Languages

License

macmacal/drl_generalization_metrics

Folders and files

Latest commit

History

Repository files navigation

Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments

Paper

Project strcture

Building & running the container

Figures

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages