berkeley who has a deep reinforcement learning course
by Cale Wiza
Published 3 years ago
Updated 3 years ago
10 min read
Prerequisites
CS189 or equivalent is a prerequisite for the course. This course will assume some familiarity with reinforcement learning, numerical optimization, and machine learning. For introductory material on RL and MDPs, see the CS188 EdX course, starting with Markov Decision Processes I, as well as Chapters 3 and 4 of Sutton & Barto.
Technology
Piazza will be used for announcements, general questions and discussions, clarifications about assignments, student questions to each other, and so on. If you are a UC Berkeley student enrolled in the course, and haven't already been added to Piazza, please email Kevin.
Late Policy
All assignments must be turned in via Gradescope on time. We will allow a total of five late days cumulatively. We will not make any additional allowances for late assignments: the late days are intended to provide for exceptional circumstances, and students should avoid using them unless absolutely necessary.
Lecture Videos
The course lectures are available below. The course is not being offered as an online course, and the videos are provided only for your personal informational and entertainment purposes. They are not part of any course requirement or degree-bearing university program. For all videos, click here. For livestream, click here.
Below you can find an outline of the course. Slides and references will be posted as the course proceeds. 1. Jan 18: Introduction and course overview (Levine, Finn, Schulman) 1.1. Slides: Levine 1.2. Slides: Finn 1.3. Slides: Schulman 2. Jan 23: Supervised learning and decision making (Levine) 2.1. Slides 2.2. End to End Learning for Self-Driving Cars 2.3. A Reduction of Imitation L…
CS189 or equivalent is a prerequisite for the course. This course will assume some familiarity with reinforcement learning, numerical optimization and machine learning. Students who are not familiar with the concepts below are encouraged to brush up using the references provided right below this list. We’ll review this material in class, but it will be rather cursory. 1. Reinforcement le…
John's lecture series at MLSS 1. Lecture 1: intro, derivative free optimization 2. Lecture 2: score function gradient estimation and policy gradients 3. Lecture 3: actor critic methods 4. Lecture 4: trust region and natural gradient methods, open problems
Courses 1. Dave Silver’s course on reinforcement learning / Lecture Videos 2. Nando de Freitas’ course on machine learning 3. Andrej Karpathy’s course on neural networks