ECS8060: AI Engineering
Home
Schedule
Labs
Project
Policies & Resources
← Back to Schedule
Lecture 12: Preference Optimisation, RLHF, Verifiable Rewards
Lecture 12 · July 22, 2026
Readings
Direct Preference Optimization