SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Abstract
SkillRL enables LLM agents to improve through hierarchical skill discovery and recursive policy evolution, achieving superior performance on complex tasks while reducing computational overhead.
Large Language Model (LLM) agents have shown stunning results in complex tasks, yet they often operate in isolation, failing to learn from past experiences. Existing memory-based methods primarily store raw trajectories, which are often redundant and noise-heavy. This prevents agents from extracting high-level, reusable behavioral patterns that are essential for generalization. In this paper, we propose SkillRL, a framework that bridges the gap between raw experience and policy improvement through automatic skill discovery and recursive evolution. Our approach introduces an experience-based distillation mechanism to build a hierarchical skill library SkillBank, an adaptive retrieval strategy for general and task-specific heuristics, and a recursive evolution mechanism that allows the skill library to co-evolve with the agent's policy during reinforcement learning. These innovations significantly reduce the token footprint while enhancing reasoning utility. Experimental results on ALFWorld, WebShop and seven search-augmented tasks demonstrate that SkillRL achieves state-of-the-art performance, outperforming strong baselines over 15.3% and maintaining robustness as task complexity increases. Code is available at this https://github.com/aiming-lab/SkillRL.
Community
Skill accumulation is the new paradigm for AI agents.
We’re moving from static models to recursive evolution 🧬. SkillRL proves skills > scale, enabling a 7B model to beat GPT-4o 🚀.
Evolving > Scaling. 💡
Paper: https://arxiv.org/abs/2602.08234
Code: https://github.com/aiming-lab/SkillRL
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents (2026)
- ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents (2026)
- Reinforcement Learning for Self-Improving Agent with Skill Library (2025)
- ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation (2026)
- Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates (2026)
- CoDA: A Context-Decoupled Hierarchical Agent with Reinforcement Learning (2025)
- MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper