Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 9 days ago • 42
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 3 days ago • 70
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 13 days ago • 78
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 9 days ago • 86
VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 4 days ago • 97
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 14 days ago • 95
Running Featured 1.37k FineWeb: decanting the web for the finest text data at scale 🍷 1.37k Explore and download the FineWeb web‑scale text dataset
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research Paper • 2606.09730 • Published 11 days ago • 50
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts Paper • 2606.05922 • Published 15 days ago • 52
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 15 days ago • 63