TD-Gammon implementation
-
Updated
Sep 25, 2023 - Python
TD-Gammon implementation
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
Multi-Shot Approximation of Discounted Cost MDPs
Code for the Macro III course at the M.Sc/Ph.D programs at EPGE/FGV
RL with OpenAI Gym
Flappy Bird for artificial intelligence/machine learning (Agent available: Q-Learning, SARSA, and combined with Backpropagation)
GAN zoo include GAN, ACGAN, EBGAN, BEGAN, LSGAN, SAGAN, CVAE.
This is code from the Numerical Methods course at EPGE/FGV in 2018
multi-utility optimal individualized treatment regime estimation for survival data
Goal Selection Strategies for Learning Goal-Oriented Value Functions
Add a description, image, and links to the value-function topic page so that developers can more easily learn about it.
To associate your repository with the value-function topic, visit your repo's landing page and select "manage topics."