Technique · interpretability
Sparse Autoencoders for Interpretability
Training sparse autoencoders on residual-stream activations to extract monosemantic, human-interpretable features from transformer internals.
0
Products deploying
—
Avg research → prod
—
First commercial deploy
Deployment timeline
No verified deployments yet in our tracked product set.