Dwarkesh Podcast
Dwarkesh Podcast

Eric Jang – Building AlphaGo from scratch

2h 37min

Eric Jang explains how to build AlphaGo from scratch, breaking down the core concepts of Monte Carlo Tree Search (MCTS), policy and value networks, and self-play reinforcement learning. He also discusses the profound implications of how a small neural network can amortize an intractable search problem, and shares insights into using LLMs for automated research.

Summarized by Podsumo

Key Takeaways

💬 Notable Quotes

Get every episode summarized
Delivered to Telegram. Ask questions about any episode.
Start on Telegram