ECS8060: AI Engineering

Home Schedule Labs Project Policies & Resources

← Back to Schedule

Lecture 12: Preference Optimisation, RLHF, Verifiable Rewards

Lecture 12 · July 22, 2026

Readings

Direct Preference Optimization

© 2026 Queen's University Belfast. ECS8060 AI Engineering.