Releases · tao-pr/q-exp

30 Mar 14:25

tao-pr

0.0.2

c35b351

Generalisation Latest

Latest

Q-learning now has its built-in generalisation using gradient descent over linear combination of policy variables. Also, add another sample falling stones to demonstrate how generalisation is used.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: tao-pr/q-exp

Generalisation