Quick Context: This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A surprising fact about modern large language models is that nobody really knows how they work internally.

Interpretability Now What -

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A surprising fact about modern large language models is that nobody really knows how they work internally. Seminar on Theoretical Machine Learning Topic: Understanding Deep Neural Networks: From Generalization to

Important details found

  • This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?
  • A surprising fact about modern large language models is that nobody really knows how they work internally.
  • Seminar on Theoretical Machine Learning Topic: Understanding Deep Neural Networks: From Generalization to
  • Use code WELCHLABS at the link below and get 60% off an annual plan: ...
  • Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...

Why this topic is useful

This topic is useful when readers need a quick overview first, then want to move into supporting details and related references.

Sponsored

Frequently Asked Questions

Why are related topics included?

Related topics help readers compare nearby references and understand the broader subject.

What is this page about?

This page summarizes Interpretability Now What and connects it with related entries, references, and supporting context.

Is the information always complete?

Not always. Some topics may need verification from official or primary sources.

Supporting Images

Interpretability - now what?
What Matters Right Now In Mechanistic Interpretability?
What is interpretability?
An Introduction to Mechanistic Interpretability โ€“ Neel Nanda | IASEAI 2025
Interpretability: Understanding how AI models think
Interpretable vs Explainable Machine Learning
What is mechanistic interpretability? Neel Nanda explains.
Interpretability for Everyone - Been Kim
Understanding Deep Neural Networks: From Generalization to Interpretability - Gitta Kutyniok
The Dark Matter of AI [Mechanistic Interpretability]
Sponsored
View Full Details
Interpretability - now what?

Interpretability - now what?

Read more details and related context about Interpretability - now what?.

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

An Introduction to Mechanistic Interpretability โ€“ Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability โ€“ Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ...

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

Interpretable vs Explainable Machine Learning

Interpretable vs Explainable Machine Learning

Read more details and related context about Interpretable vs Explainable Machine Learning.

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...

Interpretability for Everyone - Been Kim

Interpretability for Everyone - Been Kim

Read more details and related context about Interpretability for Everyone - Been Kim.

Understanding Deep Neural Networks: From Generalization to Interpretability - Gitta Kutyniok

Understanding Deep Neural Networks: From Generalization to Interpretability - Gitta Kutyniok

Seminar on Theoretical Machine Learning Topic: Understanding Deep Neural Networks: From Generalization to

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...