One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Renhao Li
RioLee
·
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
29 days ago
ToolRM
updated
a collection
29 days ago
ToolRM
authored
a paper
30 days ago
CoEvol: Constructing Better Responses for Instruction Finetuning through
Multi-Agent Cooperation