LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 14 days ago • 207
VisualClaw: A Real-Time, Personalized Agent for the Physical World Paper • 2606.16295 • Published 15 days ago • 28
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents Paper • 2605.30621 • Published May 28 • 22
Mistral Medium 3.5 Collection Our first flaship models handling instruction-following, reasoning, and coding in a single set of opened-weights. • 2 items • Updated Apr 29 • 19
AutoMedBench: Towards Medical AutoResearch with Agentic AI Models Paper • 2606.01961 • Published 27 days ago • 27