Announcements

May 2nd Announcements

  • Last day of class!
  • Next fall: Optimization!
  • Project reports due 5/7 (Start Writing!)

April 30th announcements

  • Online Quiz 3
  • Final Projects
  • FCQs due tonight

April 25th Announcements

  • FCQs open until 4/30
  • Final Project due 5/7
  • Online Quiz due next week

April 16th Announcements

  • HW 5 Leaderboard
  • Class Poll
  • Exam: Thursday-Saturday

March 12 Announcement

  • Quiz due Tomorrow (Question)
  • HW5 Due Next Friday (autograder still needs to be released)

March 5th Announcements

  • Exam 2 -> Weekly Quizzes
  • Exam 1 graded

Feb 27

  • HW 3 Completed
  • Project Ideas, due Feb 29
  • HW 4 Released, due March 7
  • Colorado Primary, March 5th 🇺🇸
    • Vote at the UMC
    • If you're unaffiliated, you can vote in either party's primary

Feb 15

  • Exam today released at 1 today, due 11:59 pm tomorrow
    • 120 minutes
    • Four problems
    • Four pages with room to write on (not required)
    • Office Hours 6-7, private questions on piazza
    • ChatGPT countermeasures: Images - not text, some consistent mistakes

Feb 13, 2024

  • Leaderboard
  • Exam Thursday and Friday
  • Homework 3
  • Review Session

Feb 1st, 2024

  • HW 1 Leaderboard, Solution
  • No Class next Tuesday

January 30th 2024

  • HW2 due NEXT Friday at 5pm

Jan 25, 2024

  • HW 1 due Friday
  • Office Hours 1-2

Jan 18 2024

  • HW 1 Released (will demo at end)
  • HW Challenge Problems
  • OH 2-3 today (AERO 263)

May 2nd 2023

  • Project: Start Writing!
  • FCQs
  • Programming assignment

ASEN 6519-001: Advanced Survey of Sequential Decision Making (Fall 2021)

  • Filet-mignon level understanding!
  • Exploring the latest literature
  • No homework! Final project, presentations, conceptual review quizzes
  • Presentations: how to communicate about research

"I consider this class among the most productive and informative I've taken while at CU." - 2021 student

AKA DMU++

April 27, 2023

  • Exam 3
  • Final Project
  • Seminar: Friday 2:45 AERO 114

Today: Overview of related topics

  • Imitation Learning
  • Inverse Reinforcement Learning (IRL)
  • Transfer Learning
  • Meta Learning

April 4, 2023

  • Exam 2: Thursday-Friday
  • HW 5 leaderboard
  • HW6 Assigned

March 14, 2023

  • Exam 1 scores
  • Tips for getting RL to work
  • Homework 5
  • Python for HW 5

Happy policy day!

March 9, 2023

  • Schedule
  • Writing large programs in Julia

March 7, 2023

  • Survey (required by tonight)
  • HW 4 due Thursday

March 2nd, 2023

  • Survey
  • Leaderboard

Title Text

  • How MCTS is used in a simulation
  • Policy Gradient Figure

Feb 16, 2023

  • Exam 1, Monday-Tuesday
  • HW2, Problem 3

February 14th, 2023

  • HW 2 Leaderboard
  • HW 3 Released
  • HW 1 Grades Posted
  • Exam 1 Next Tuesday

Feb 9th 2023

  • Leaderboard
  • HW2 Extended
  • HW1 solutions posted
  • Extra Office Hours
  • dot operator

Feb 2nd, 2023

  • Leaderboard
    • You can submit as many times as you want
  • Jackson Office Hours

Jan 26th 2023

  • HW 1 due tonight
  • HW 2 released tomorrow

Jan 24th 2020

  • Office hours changed from 4:30-5:30 Tuesdays
  • Loose ends from last lecture
  • ChatGPT
  • Remote study groups

Jan 19 2023

  • Julia 1.8
  • Back to problem formulation
  • Jackson
  • Schedule
  • Slides

Feb 1, 2022

  • HW2 due this evening
  • Quiz 1 next week
  • HW3 released this week

3 Bellman/Dynamic Programming Equations

February 3rd, 2022

  • HW2 Leaderboard
  • Quiz 1, next Wednesday and Thursday
  • HW 3

Breakout Rooms

1. Grid World

Feb 8th, 2022

  • HW3 Released, due Wednesday
  • Quiz 1 Tomorrow at Noon to Thursday

Feb 10th

  • Quiz 1 finished by tonight at midnight (10 submissions so far)
  • HW 3 Due Feb 17th at 5pm

Feb 10

HW Due Thursday

Feb 17

  • HW3 Due Saturday at 5 pm
  • HW4 Released Soon

Discuss

loop

    \(\tau \gets \text{simulate}(\pi_\theta)\)

    \(\theta \gets \theta + \alpha \sum_{k=0}^d \nabla_\theta \log \pi_\theta(a_k \mid s_k) \gamma^k r_{k, \text{to go}} \)

\[\pi_\theta (a=L \mid s) = \frac{\theta_1}{\theta_1 + \theta_2}\]

\[\pi_\theta (a=R \mid s) = \frac{\theta_2}{\theta_1 + \theta_2}\]

  1. 1. Given \(\theta = (0.2, 0.8)\) calculate \(\sum_{k=0}^d \nabla_\theta \log \pi_\theta(a_k \mid s_k) \gamma^k r_{k, \text{to go}} \) for two cases, (a) where \(a_0 = R\) and (b) where \(a_0 = L\)
  2. What happens if \(\theta_1 \to 0\)

Feb 22

  • HW 3 Leaderboard
  • HW 4
  • Quiz almost graded

Feb 24

  • Look at Schedule
  • Project Proposal Assignment
  • HW 4 Due Tuesday
  • HW 5: Get started early!

Perspective

"The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most effective, and by a large margin." - Richard Sutton

The Bitter Lesson

March 3rd

  • Quiz 1 solutions released
  • Schedule: Project Proposals, HW 5

March 15

  • HW5 Due Today
  • HW6 Out this week
  • Quiz 2 (RL and POMDPs): March 29th - study materials out this week

Alpha Vector Recap

  • How many states
  • X and Y axis
  • Value Function
  • Each alpha vector corresponds to a conditional plan
  • Can multiple alpha vectors correspond to the same action?
  • How to execute a policy

March 29

  • HW6: Due 4/7
  • Quiz 2: tomorrow noon - Thursday midnight

April 18

  • HW6 Leaderboard Results
  • Quiz 3
  • Project

The American diplomat George Kennan, one of the architects of that order, once wrote that a healthy American foreign policy should create “the impression of a country which knows what it wants.” Yet it is just as important to know what one’s adversaries want. American adventures in East Asia, particularly, are notable for their long history of governments talking past one another.

Game Theory Quotes

"The single best piece of advice I ever got on marriage is that there is no use in thinking of your partner as a single stable entity that exists separate from you. There’s only your partner in a dynamic with you." - Ezra Klein

April 26, 2022

  • FCQs due tonight!
  • Final Project due May 3rd
  • Watch AlphaGo Documentary

Quizzes (grades in general) are for revealing how well you understand something

000-Announcements

By Zachary Sunberg

000-Announcements

  • 278