One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Renhao Li
RioLee
·
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 2 months ago
ToolRM
updated
a collection
about 2 months ago
ToolRM
authored
a paper
about 2 months ago
CoEvol: Constructing Better Responses for Instruction Finetuning through
Multi-Agent Cooperation