View Transcript
Episode Description
I have a much better understanding of Sutton’s perspective now. I wanted to reflect on it a bit.
(00:00:00) - The steelman
(00:02:42) - TLDR of my current thoughts
(00:03:22) - Imitation learning is continuous with and complementary to RL
(00:08:26) - Continual learning
(00:10:31) - Concluding thoughts
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe