Skip to content

Latest commit

 

History

History
9 lines (9 loc) · 963 Bytes

RewardsInvRL.md

File metadata and controls

9 lines (9 loc) · 963 Bytes

Reward Design, Inverse RL

  • Inverse Reward Design[Paper]
    • Dylan Hadfield-Menell, Smitha Milli, Stuart J Russell, Pieter Abbeel, Anca Dragan, NIPS, 2017
  • Cooperative Inverse Reinforcement Learning [Paper] [Lecture Slides]
    • Dylan Hadfield-Menell, Anca Dragan Pieter Abbeel Stuart Russell, NIPS, 2016
  • Reward Design via Online Gradient Ascent [Paper]
    • Jonathan Sorg, Satinder Singh, Richard L. Lewis, NIPS, 2010
  • Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games [Paper]
    • Xiaoxiao Guo, Satinder Singh, Richard Lewis, Honglak Lee, IJCAI, 2016