-
Beyond Language Modeling: An Exploration of Multimodal Pretraining
Paper • 2603.03276 • Published • 105 -
Qwen3-Coder-Next Technical Report
Paper • 2603.00729 • Published • 65 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 13 -
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
Paper • 2602.23166 • Published • 45
Collections
Discover the best community collections!
Collections including paper arxiv:2602.23166
-
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 190 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 128 -
PretrainZero: Reinforcement Active Pretraining
Paper • 2512.03442 • Published • 50 -
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents
Paper • 2601.16344 • Published • 12
-
Warrieryes/OpenThinkIMG-Chart-Qwen2-2B-VL
2B • Updated • 17 • 3 -
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Paper • 2505.08617 • Published • 42 -
hitsmy/OpenThinkIMG-Chart-SFT-2942
Viewer • Updated • 2.94k • 75 • 1 -
hitsmy/OpenThinkIMG-Chart-RL-14501
Viewer • Updated • 14.5k • 17
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 44 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 95 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 222
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 46 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 2.49k • 570 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 13.6k • 464 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 19
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 86 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 156 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Beyond Language Modeling: An Exploration of Multimodal Pretraining
Paper • 2603.03276 • Published • 105 -
Qwen3-Coder-Next Technical Report
Paper • 2603.00729 • Published • 65 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 13 -
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
Paper • 2602.23166 • Published • 45
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 44 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 95 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 222
-
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 190 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 128 -
PretrainZero: Reinforcement Active Pretraining
Paper • 2512.03442 • Published • 50 -
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents
Paper • 2601.16344 • Published • 12
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 46 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 2.49k • 570 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 13.6k • 464 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 19
-
Warrieryes/OpenThinkIMG-Chart-Qwen2-2B-VL
2B • Updated • 17 • 3 -
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Paper • 2505.08617 • Published • 42 -
hitsmy/OpenThinkIMG-Chart-SFT-2942
Viewer • Updated • 2.94k • 75 • 1 -
hitsmy/OpenThinkIMG-Chart-RL-14501
Viewer • Updated • 14.5k • 17
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 86 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 156 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25