about me
what's that animation? ⇨
research papers
[New paper] Analytically solving the training dynamics of word2vec
9 min9 minWord embedding models learn interpretable topic-level concepts one at a time.
Lazy (NTK) and active (muP) training -- what gives?
6 min6 minThere's only one degree of freedom in choosing how hyperparameters scale with network width.
How to take derivatives of matrix expressions
8 min8 minUsing einsums for pen-and-paper matrix differentiation
Quick n dirty derivation of Larmor radiation and gravitational waves
4 min4 minHow to derive order-of-magnitude expressions for radiation formulas
Statistical physics of signal propagation in deep neural networks
7 min7 minUsing statistical physics to understand what makes a good activation function
Detecting and tracking structures in protostellar outflows
1 min1 minMy senior thesis won an award
Blackbody radiation made simpler
10 min10 minThe "cavity with a hole" is soo confusing.