SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper β’ 2604.08377 β’ Published 6 days ago β’ 273
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper β’ 2604.06628 β’ Published 7 days ago β’ 308
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper β’ 2604.04771 β’ Published 9 days ago β’ 116
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 15 days ago β’ 47
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper β’ 2603.17024 β’ Published 28 days ago β’ 109
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 β’ 124
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper β’ 2512.16676 β’ Published Dec 18, 2025 β’ 222
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper β’ 2511.18538 β’ Published Nov 23, 2025 β’ 304
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation Paper β’ 2511.06307 β’ Published Nov 9, 2025 β’ 53
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper β’ 2509.22638 β’ Published Sep 26, 2025 β’ 70
Reverse-Engineered Reasoning for Open-Ended Generation Paper β’ 2509.06160 β’ Published Sep 7, 2025 β’ 151
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper β’ 2508.02193 β’ Published Aug 4, 2025 β’ 138
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo Paper β’ 2508.02317 β’ Published Aug 4, 2025 β’ 23
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology Paper β’ 2507.07999 β’ Published Jul 10, 2025 β’ 51
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Paper β’ 2504.13914 β’ Published Apr 10, 2025 β’ 5
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents Paper β’ 2507.04009 β’ Published Jul 5, 2025 β’ 54