Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

NVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)

1h 24min

This episode of Latent Space features Nader Khalil (Brev) and Kyle Kranen (Dynamo) from NVIDIA, discussing the evolution of AI engineering. They delve into NVIDIA's developer-centric culture, the acquisition of Brev for simplified GPU access, and the "Speed of Light" philosophy. A core focus is Dynamo, NVIDIA's data center-scale inference engine, which optimizes large language model inference through techniques like disaggregation and specialized scaling, alongside a deep dive into the security and future of AI agents.

Summarized by Podsumo

Key Takeaways

💬 Notable Quotes

Get every episode summarized
Delivered to Telegram. Ask questions about any episode.
Start on Telegram