Advanced Survey of Sequential Decision Making
Is Q-learning guaranteed to converge?
Under what conditions?
Which is more sample-efficient? Model-based or Model-free Reinforcement Learning?
How much computational power did AlphaGo use?
Do you need less for chess?
How much more do you need to learn StarCraft?
(Assignments)
Since the course relies on student participation, I prefer for students to enroll rather than audit.
If you intend to audit: