CaRR & C-GRPO Collection Data and models for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards". • 6 items • Updated Mar 25 • 1
CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • 2B • Updated 35 minutes ago • 317k • 964