25 Interpretability

25. Interpretability

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ...

Lecture 25: Interpretability

Machine Learning for Healthcare #MachineLearning #ArtificialIntelligence #AI #ML #DataScience #HealthcareAI #AIinHealthcare ...

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '

[XHRI 2025] Interpretability Analysis of Symbolic Representations for SDM Systems

Interpretability

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Sensitivity and Interpretability of AI-Models

Speaker: Hanieh Arjmand, ML Researcher, Lydia.ai & Spark Tseung, Applied Data Scientist, Lydia.ai Model

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

With a growing interest in

[CoLoRAI 25] Compositionality Unlocks Deep Interpretable Models

Paper: Compositionality Unlocks Deep

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

Between the Layers– Interpreting Large Language Models - Michelle Frost - NDC AI 2025

This talk was recorded at NDC AI in Oslo, Norway. #ndcai #ndcconferences #developer #softwaredeveloper Attend the next NDC ...

Assessing skeptical views of interpretability research

Stanford AI Lab Faculty Lunch, November 7, 2025. Updated version of https://web.stanford.edu/~cgpotts/blog/interp/ 0:59 ...

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

Interpretability: Understanding how AI models think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

Manipulating and Measuring Model Interpretability

Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ...

Systor 25' Keynote: Prof. Nir Shavit - Towards Combinatorial Interpretability of Neural Computation

Abstract We introduce combinatorial

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our ...