Infinite Curiosity Pod with Prateek Joshi

The best place to find out how AI builders build. The host Prateek Joshi interviews world-class AI founders and VCs on this podcast. You can visit prateekj.com to learn more about the host.

All Episodes

Infinite Curiosity Pod with Prateek Joshi

Automated Evaluation of LLMs

May 07, 2024 • Prateek Joshi

Anand Kannappan is the cofounder and CEO of Patronus AI, an automated AI evaluation and security company. They have raised funding from Lightspeed Venture Partners, Replit CEO Amjad Masad, Gokul Rajaram, and Fortune 500 executives. He was previously at Meta and Vertis. He was also the cofounder of Kyber Technologies, which was a service to systematically predict market events using AI and remote sensing data. It evolved into a futures quant hedge fund managing $15M for partners.

Anand's favorite book: Harry Potter series (Author: JK Rowling)

(00:00) Introduction and Common Failure Modes of Large Language Models
(03:02) Challenges of Automated Evaluation in AI Models
(06:08) The Importance of Fine-Tuning and Retrieval Augmented Generation
(09:02) Addressing Copyright Detection in Language Models
(11:51) The Liability of Companies Using AI Models
(15:02) Advancements in Multimodal Models and State Space Models
(20:48) The Role of Fine-Tuning in the Evolution of Language Models
(23:51) The Significance of Adversarial Testing in AI
(25:56) The Role of Retrieval Augmented Generation in AI
(28:05) The Need for Continuous Function Optimization in Prompting
(29:02) Rapid Fire Round

--------
Where to find Prateek Joshi:

Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi