Infinite ML with Prateek Joshi

Automated Evaluation of LLMs

Prateek Joshi

Anand Kannappan is the cofounder and CEO of Patronus AI, an automated AI evaluation and security company. They have raised funding from Lightspeed Venture Partners, Replit CEO Amjad Masad, Gokul Rajaram, and Fortune 500 executives. He was previously at Meta and Vertis. He was also the cofounder of Kyber Technologies, which was a service to systematically predict market events using AI and remote sensing data. It evolved into a futures quant hedge fund managing $15M for partners.

Anand's favorite book: Harry Potter series (Author: JK Rowling)

(00:00) Introduction and Common Failure Modes of Large Language Models
(03:02) Challenges of Automated Evaluation in AI Models
(06:08) The Importance of Fine-Tuning and Retrieval Augmented Generation
(09:02) Addressing Copyright Detection in Language Models
(11:51) The Liability of Companies Using AI Models
(15:02) Advancements in Multimodal Models and State Space Models
(20:48) The Role of Fine-Tuning in the Evolution of Language Models
(23:51) The Significance of Adversarial Testing in AI
(25:56) The Role of Retrieval Augmented Generation in AI
(28:05) The Need for Continuous Function Optimization in Prompting
(29:02) Rapid Fire Round

--------
Where to find Prateek Joshi:

Newsletter: https://prateekjoshi.substack.com 
Website: https://prateekj.com 
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 
Twitter: https://twitter.com/prateekvjoshi