Introduction to What Is Mechanistic Interpretability

Exploring What Is Mechanistic Interpretability reveals several interesting facts. Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

What Is Mechanistic Interpretability Comprehensive Overview

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

This is the ambitious goal of

Summary & Highlights for What Is Mechanistic Interpretability

  • This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?
  • A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...
  • This is a talk I gave to my MATS scholars, with a stylised history of the field of
  • Neel Nanda from DeepMind presenting '
  • How can we use the language of causality to understand and edit the internal mechanisms of AI models? Atticus Geiger ...

Stay tuned for more updates related to What Is Mechanistic Interpretability.

What Is Mechanistic Interpretability.pdf

Size: 10.38 MB · Format: PDF · Secure Download

Download PDF Read Online

Related Documents