Skip to content

Releases: tao-pr/q-exp

Generalisation

30 Mar 14:25
Compare
Choose a tag to compare

Q-learning now has its built-in generalisation using gradient descent over linear combination of policy variables. Also, add another sample falling stones to demonstrate how generalisation is used.