Quick Overview: MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... How can we reverse engineer what a neural network is doing? In this IASEAI ' This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

25 Interpretability - Detailed Overview & Context

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... How can we reverse engineer what a neural network is doing? In this IASEAI ' This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... Speaker: Hanieh Arjmand, ML Researcher, Lydia.ai & Spark Tseung, Applied Data Scientist, Lydia.ai Model Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

This talk was recorded at NDC AI in Oslo, Norway. Attend the next NDC ... Stanford AI Lab Faculty Lunch, November 7, 2025. Updated version of 0:59 ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ... Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ...

Photo Gallery

25. Interpretability
Lecture 25: Interpretability
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
[XHRI 2025] Interpretability Analysis of Symbolic Representations for SDM Systems
What Matters Right Now In Mechanistic Interpretability?
What is interpretability?
Sensitivity and Interpretability of AI-Models
A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google
[CoLoRAI 25] Compositionality Unlocks Deep Interpretable Models
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
Between the Layers– Interpreting Large Language Models - Michelle Frost - NDC AI 2025
Assessing skeptical views of interpretability research
Sponsored
Sponsored
View Main Result
Sponsored
Sponsored