Announcement_3

Released the preprint for my work 🪃 Boomerang Distillation Enables Zero-Shot Model Size Interpolation 🪃. We uncover boomerang distillation, a surprising phenomenon by which we can create a full family of models of fine-grained sizes with no additional training by interpolating between a pretrained and distilled model.