This episode explores how formal methods, the mathematical practice of proving system correctness, are being combined with large language models to safely scale agentic AI. Byron Cook explains how AWS's Automated Reasoning Group applies symbolic reasoning to security policies, networking, and agent guardrails, and how LLMs now dramatically lower the barrier to using these powerful but previously brittle formal tools by translating natural language into logic and automating proof search.
Summarized by Podsumo
Formal methods can't solve undecidable problems in practice by accepting that the tool may occasionally decline to answer, enabling massive ROI for niche domains like device drivers (85% of OS crashes).
AWS's Automated Reasoning Group built moonshot ladder by breaking big moonshots (policy reasoning, VPC analysis) into quarterly deliverables that built trust and grew them into production services like IAM Access Analyzer and Bedrock Guardrails.
LLMs now enable 1,000x productivity gains for formal reasoning by taking over proof search and auto-formalization, and the key bottleneck has shifted from coding the logic to specifying the safety properties in natural language.
Agentic AI reintroduces the need for sociotechnical mechanisms—formal guardrails that can be checked automatically before agents execute actions, preventing leakage of confidential data or violating compliance.
Lean theorem prover, combined with LLMs, is emerging as the lingua franca for reasoning about programs (via open-source Strata IR).
"In each role I'm trying to identify really big things that other people can't wrap their head around that maybe they don't think are possible. I'm trying to move that seemingly impossible to the possible and then trying to de-risk that. — Byron Cook"
"We've seen with recent advances in the models... it's like a thousand X productivity gains from that small seed of individuals who can drive these tools Byron Cook"
"So there's a theorem that the problem's undecidable... You have to cut a corner somewhere... we'll be okay with maybe the tool not giving an answer from time to time Byron Cook"