2024-09-09
Interpretability
Relevant:
Case study of Grokking via Mechanistic Interpretability
Backlinks
Every Model Learned by Gradient Descent Is Approximately a Kernel Machine
Found this interesting?
Subscribe
to new posts.
Any comments? Send an
email
.