January 6, 2025
How Does Claude 4 Think? — Sholto Douglas & Trenton Bricken
Highly recommend listening to this episode where Dwarkesh talks with Sholto Douglas and Trenton Bricken from Anthropic about how Claude 4 thinks, RL scaling, and the future of AI agents.
Key Takeaways
- The discussion on mechanistic interpretability and how models develop “circuits” for different capabilities
- Their predictions about autonomous agents - by May 2025, we might see agents capable of complex multi-step tasks like advanced Photoshop work
- Insights into how reinforcement learning is scaling and what that means for model capabilities
YouTube link if you prefer video.