Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published 28 days ago • 36
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards Paper • 2602.21158 • Published 29 days ago • 1
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 8 days ago • 128
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning Paper • 2602.08234 • Published Feb 9 • 72
RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback Paper • 2603.08561 • Published 16 days ago • 12
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning Paper • 2603.16060 • Published 9 days ago • 1
Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning Paper • 2601.22297 • Published Jan 29 • 2
ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents Paper • 2602.01869 • Published Feb 2
Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates Paper • 2601.18510 • Published Jan 26