Dwarkesh Podcast
Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

2h 14min

The podcast explores the mathematical and hardware underpinnings of LLM training and inference, revealing how factors like batch size, memory bandwidth, and compute capabilities dictate model architecture, API pricing, and the pace of AI progress. It highlights the critical trade-offs involved in optimizing for latency, cost, and model quality in large-scale deployments, drawing insights from real-world API structures.

Summarized by Podsumo

Key Takeaways

💬 Notable Quotes

Get every episode summarized
Delivered to Telegram. Ask questions about any episode.
Start on Telegram