Quick Overview: Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ... Atticus Geiger from Pr(Ai)²R Group explores “State of Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ...

Scaling Interpretability - Detailed Overview & Context

Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ... Atticus Geiger from Pr(Ai)²R Group explores “State of Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... Eric is a PhD student in the Department of Physics at MIT working with Max Tegmark on improving our scientific/theoretical ... At an Anthropic Research Salon event in San Francisco, four of our researchers—Alex Tamkin, Jan Leike, Amanda Askell and ...

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... In this talk, Mansi discusses her work with Eric Michaud returns to the stream to talk about his recent work on How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

This talk was recorded at NDC AI in Oslo, Norway. Attend the next NDC ... This has been my favorite video so far to make! I think SPONSOR MESSAGES: CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide range ...

Photo Gallery

Scaling interpretability
Atticus Geiger - State of Interpretability & Ideas for Scaling Up [Alignment Workshop]
Scaling Laws of AI explained | Dario Amodei and Lex Fridman
The Dark Matter of AI [Mechanistic Interpretability]
Eric Michaud—Scaling, Grokking, Quantum Interpretability
How difficult is AI alignment? | Anthropic Research Salon
Interpretability: Understanding how AI models think
Interpretable vs Explainable Machine Learning
What is interpretability?
Mechanistic Interpretability explained | Chris Olah and Lex Fridman
Scaling ML Interpretability Experiments Using Parsl
Interpretability and AI Scaling with Eric Michaud
Sponsored
Sponsored
View Main Result
Sponsored
Sponsored