RL agents essentially do backward inductionWe can think of value iteration in RL as actually doing backward induction, and thus get a better intuition of what it does.Jun 15, 2021Jun 15, 2021