Key concepts in RL

  1. policy network
  2. policy gradients
  3. sparse rewards
  4. reward shaping

Leave a comment