Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mechanistic interpretability for transformer models, specifically his paper, Learning Transformer Programs. The LTP paper proposes modifications to the transformer architecture which allow transformer models to be easily converted into human-readable programs, making them inherently interpretable. In our conversation, we compare the approach proposed by this research with prior approaches to understanding the models and their shortcomings. We also dig into the approach’s function and scale limitations and constraints.
The complete show notes for this episode can be found at twimlai.com/go/667.
Are Vector DBs the Future Data Platform for AI? with Ed Anuff - #664
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
Responsible AI in the Generative Era with Michael Kearns - #662
Edutainment for AI and AWS PartyRock with Mike Miller - #661
Data, Systems and ML for Visual Understanding with Cody Coleman - #660
Patterns and Middleware for LLM Applications with Kyle Roche - #659
AI Access and Inclusivity as a Technical Challenge with Prem Natarajan - #658
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
Visual Generative AI Ecosystem Challenges with Richard Zhang - #656
Deploying Edge and Embedded AI Systems with Heather Gorr - #655
AI Sentience, Agency and Catastrophic Risk with Yoshua Bengio - #654
Delivering AI Systems in Highly Regulated Environments with Miriam Friedel - #653
Mental Models for Advanced ChatGPT Prompting with Riley Goodside - #652
Multilingual LLMs and the Values Divide in AI with Sara Hooker - #651
Scaling Multi-Modal Generative AI with Luke Zettlemoyer - #650
Pushing Back on AI Hype with Alex Hanna - #649
Personalization for Text-to-Image Generative AI with Nataniel Ruiz - #648
Ensuring LLM Safety for Production Applications with Shreya Rajpal - #647
What’s Next in LLM Reasoning? with Roland Memisevic - #646
Is ChatGPT Getting Worse? with James Zou - #645
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week