Upside-Down-Reinforcement-Learning

Upside-Down Reinforcement Learning (⅂ꓤ) implementation in Pytorch.
Based on the paper published by Jürgen Schmidhuber: ⅂ꓤ-Paper

This repository contains a discrete action space as well as a continuous action space implementation for the OpenAI gym CartPole environment (continuous version of the environment).

The notebooks include the training of a behavior function as well as an evaluation part, where you can test the trained behavior function. Feed it with an desired reward that the agent shall achieve in a desired time horizon.

Plots for the discrete CartPole Environment:

Plots for the continuous CartPole Environment:

Plots for the LunarLander Environment:

TODO:

test some possible improvements mentioned in the paper (6. Future Research Directions).

Author

Sebastian Dittert

Feel free to use this code for your own projects or research. For citation check DOI or cite as:

@misc{Upside-Down,
  author = {Dittert, Sebastian},
  title = {PyTorch Implementation of Upside-Down RL},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BY571/Upside-Down-Reinforcement-Learning}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
imgs		imgs
paper		paper
LICENSE		LICENSE
README.md		README.md
Upside-Down.ipynb		Upside-Down.ipynb
Upside_Down_continuous.ipynb		Upside_Down_continuous.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Upside-Down-Reinforcement-Learning

Plots for the discrete CartPole Environment:

Plots for the continuous CartPole Environment:

Plots for the LunarLander Environment:

Author

About

Releases 1

Packages

Languages

License

BY571/Upside-Down-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Upside-Down-Reinforcement-Learning

Plots for the discrete CartPole Environment:

Plots for the continuous CartPole Environment:

Plots for the LunarLander Environment:

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages