Projects using ChainerRL

J. Halverson, B. Nelson, and F. Ruehle, “Branes with Brains: Exploring String Vacua with Deep Reinforcement Learning,” arXiv
T. Akazaki, S. Liu, Y. Yamagata, Y. Duan, and J. Hao, “Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning,” in the 22nd International Symposium on Formal Methods, 2018. arXiv
J. D. C. Yuxuan, L. Abhishek, G. Benjamin, E. Pieter, and A. Sergey, “Self-Consistent Trajectory Autoencoder : Hierarchical Reinforcement Learning with Trajectory Embeddings,” in ICML, 2018. arXiv
Y. Fujita and S. Maeda, “Clipped Action Policy Gradient,” in ICML, 2018. arXiv code
橋本さゆり, 金子晃, and 小林一郎, “深層強化学習を用いたロボットの自然言語による制御への取組み,” in 言語処理学会第24回年次大会発表論文集, 2018.
大橋耕也, 幸島匡宏, 堤田恭太, 松林達史, and 戸田浩之, “深層強化学習による車両移動経路と信号機の同時最適化,” in 第10回データ工学と情報マネジメントに関するフォーラム, 2018.
竹原一彰, “深層強化学習による対話メディアのモデリング,” オペレーションズ・リサーチ = Commun. Oper. Res. Soc. Japan 経営の科学, vol. 62, no. 11, pp. 725–730, 2017.

Provide feedback