Residual Thoughts
Notes on mechanistic interpretability, geometry, and model behavior.
A Weight-Space Map of Attention Heads in Gemma-2 (with SAEs)
mechanistic-interpretability
gemma-2
sae
attention-heads
No matching items
Notes on mechanistic interpretability, geometry, and model behavior.