Running 74 Unlocking On-Policy Distillation for Any Model Family 📝 74 Apply on-policy distillation to any model family
Running on CPU Upgrade Featured 2.79k The Smol Training Playbook 📚 2.79k The secrets to building world-class LLMs
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 25 days ago • 216k • 1.56k
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • 24B • Updated Apr 20, 2025 • 83 • • 58
bartowski/DeepSeek-R1-Distill-Qwen-32B-abliterated-GGUF Text Generation • Updated Jan 25, 2025 • 7.22k • 127