Podcast Episode

The Data Exchange with Ben Lorica

Fine-tuning and Preference Alignment in a Single Streamlined Process

Jun 13, 2024

Ben Lorica

Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.

Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/

Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.

Detailed show notes can be found on The Data Exchange web site.