2024-09-09

Interpretability

Relevant:

Case study of Grokking via Mechanistic Interpretability

Backlinks

Every Model Learned by Gradient Descent Is Approximately a Kernel Machine

Found this interesting? Subscribe to new posts.
Any comments? Send an email.