REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
Paper
• 2505.20046 • Published
• 18
This is a reasoning reranking agent model built upon Qwen-2.5-7B for the paper REARANK: Reasoning Re-ranking Agent via Reinforcement Learning. The model is trained on reranking dataset built from only 179 queries using GRPO to perform reranking task, the codebase is at https://github.com/lezhang7/Rearank