P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads Paper • 2602.09443 • Published 2 days ago • 54
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 10 days ago • 10
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 3 days ago • 37
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 3 days ago • 37
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 10 days ago • 32
A2Eval: Agentic and Automated Evaluation for Embodied Brain Paper • 2602.01640 • Published 10 days ago • 8
A2Eval: Agentic and Automated Evaluation for Embodied Brain Paper • 2602.01640 • Published 10 days ago • 8
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models Paper • 2511.23319 • Published Nov 28, 2025 • 24
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17, 2025 • 134
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Paper • 2503.16057 • Published Mar 20, 2025 • 14
Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting Paper • 2310.08129 • Published Oct 12, 2023
QUBE: Enhancing Automatic Heuristic Design via Quality-Uncertainty Balanced Evolution Paper • 2412.20694 • Published Dec 30, 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models Paper • 2401.13919 • Published Jan 25, 2024 • 32
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper • 2410.19609 • Published Oct 25, 2024 • 18