Fine-tuning and Preference Alignment in a Single Streamlined Process
The Data Exchange with Ben Lorica
More Info
The Data Exchange with Ben Lorica
Fine-tuning and Preference Alignment in a Single Streamlined Process
Jun 13, 2024
Ben Lorica

Jiwoo Hong and  Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model

Subscribe to the Gradient Flow Newsletterhttps://gradientflow.substack.com/

Subscribe: AppleSpotify OvercastPocket CastsAntennaPodPodcast AddictAmazon •  RSS.

Detailed show notes can be found on The Data Exchange web site.