Rewards as Labels: Revisiting RLVR from a Classification Perspective Paper • 2602.05630 • Published Feb 5 • 3
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 • 1.18k
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 120