Skip to content

Latest commit

 

History

History
14 lines (13 loc) · 349 Bytes

File metadata and controls

14 lines (13 loc) · 349 Bytes

112-1 強化學習專論

  • 教授:吳毅成

作業說明

  • Lab1: Temporal Difference Learning Demo for Game 2048
    • 117/120
  • Lab2: Deep Q-Network for Atari MsPacman-v5
    • 116/120
  • Lab3: Proximal Policy Optimization for Atari Enduro-v5
    • 120/120
  • Lab4: Twin Delayed DDPG for CarRacing-v2
    • 119/130
  • Project: Racecar_gym
    • 95.6/100