Topics
No guides published for this topic yet.
Mechanistic interpretability has moved from describing what models do to engineering how they work. If you can identify the neurons responsible for a specific behavior, you don't need to control the entire system.
Queue is empty. Click "+ Queue" on any article to add it.