news
| Apr 30, 2026 | Paper accepted at ICML 2026! Inverse Depth Scaling From Most Layers Being Similar |
|---|---|
| Jan 26, 2026 | Two of my papers were accepted to ICLR 2026: 🪃 Boomerang Distillation Enables Zero-Shot Model Size Interpolation 🪃 and Hidden Breakthroughs in Language Model Training! |
| Dec 06, 2025 | My work Boomerang Distillation Enables Zero-Shot Model Size Interpolation was published at the NeurIPS 2025 UniReps Workshop as part of the blogpost track. Check out our post on the UniReps blog! |