2024-09-09
Interpretability
Relevant:
Case study of Grokking via Mechanistic Interpretability
Backlinks
Every Model Learned by Gradient Descent Is Approximately a Kernel Machine
You can send your feedback, queries
here