January 6, 2025

How Does Claude 4 Think? — Sholto Douglas & Trenton Bricken

Highly recommend listening to this episode where Dwarkesh talks with Sholto Douglas and Trenton Bricken from Anthropic about how Claude 4 thinks, RL scaling, and the future of AI agents.

Key Takeaways

  • The discussion on mechanistic interpretability and how models develop “circuits” for different capabilities
  • Their predictions about autonomous agents - by May 2025, we might see agents capable of complex multi-step tasks like advanced Photoshop work
  • Insights into how reinforcement learning is scaling and what that means for model capabilities

YouTube link if you prefer video.