HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation Paper • 2603.23871 • Published Mar 25 • 1
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why Paper • 2605.10889 • Published May 11 • 6
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published Apr 1 • 56
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published Apr 27 • 25
Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics Paper • 2605.12178 • Published May 12 • 61
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents Paper • 2605.13841 • Published about 1 month ago • 75
Eureka-Audio: Triggering Audio Intelligence in Compact Language Models Paper • 2602.13954 • Published Feb 15 • 4
EpochX: Building the Infrastructure for an Emergent Agent Civilization Paper • 2603.27304 • Published Mar 28 • 47
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 290
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published Mar 10 • 76
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published Mar 3 • 106
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 133