TCAndon-Router: Adaptive Reasoning Router for Multi-Agent Collaboration Paper • 2601.04544 • Published 5 days ago • 2 • 3
Memory Matters More: Event-Centric Memory as a Logic Map for Agent Searching and Reasoning Paper • 2601.04726 • Published 5 days ago • 3
DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation Paper • 2601.04823 • Published 5 days ago • 3
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published 3 days ago • 12
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published 5 days ago • 20
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis Paper • 2601.05808 • Published 3 days ago • 24
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 3 days ago • 38
MMFormalizer: Multimodal Autoformalization in the Wild Paper • 2601.03017 • Published 7 days ago • 94
ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers Paper • 2601.04342 • Published 5 days ago • 3
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published 6 days ago • 8
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published 6 days ago • 10
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 5 days ago • 24
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 4 days ago • 26
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published 4 days ago • 31
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 5 days ago • 39
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 4 days ago • 154
Unified Thinker: A General Reasoning Modular Core for Image Generation Paper • 2601.03127 • Published 6 days ago • 7
Parallel Latent Reasoning for Sequential Recommendation Paper • 2601.03153 • Published 6 days ago • 2