EG
A futuristic robot arm reaching toward a glowing digital interface, symbolizing autonomous AI learning and curiosity-driven exploration
ResearchJune 20, 2026Embodied Global

UC Berkeley Introduces RATS: Playful Agentic Robot Learning — Curiosity-Driven Skill Acquisition Without Supervision

UC Berkeley's RATS framework enables robots to autonomously acquire skills through curiosity-driven play, achieving 20.6% improvement on LIBERO-PRO benchmark without explicit task instructions.

#UC Berkeley#RATS#Playful Learning#Robotics#Curiosity-Driven AI#Code-as-Policy#LIBERO-PRO#arXiv
Reading in English

A research team from UC Berkeley has introduced RATS (Robotics Agent Teams), a novel framework for Playful Agentic Robot Learning that enables robots to autonomously acquire skills through self-directed play, without requiring explicit task instructions.

Published on arXiv on June 17, 2026, the paper "Playful Agentic Robot Learning" addresses a fundamental limitation of current robotic systems: they only learn when explicitly told what to do. The RATS framework flips this paradigm by treating robots as curious explorers who proactively discover and internalize reusable skills.

How RATS Works:

The system operates through a multi-agent architecture during a "play" phase. RATS proposes novel yet learnable exploratory tasks, plans and executes robot-code policies, verifies intermediate progress, diagnoses failures, retries with dense step-level feedback, and distills successful executions into a persistent code skill library.

Key components:

  • Curiosity-driven task proposal — the agent autonomously generates tasks that are challenging but achievable
  • Code-as-Policy execution — robot behavior is defined through executable code
  • Autonomous failure diagnosis — the system identifies why tasks failed and iterates
  • Skill library distillation — successful behaviors are stored as reusable code assets

Performance Results:

In experiments on LIBERO-PRO and MolmoSpaces benchmarks, RATS achieved remarkable improvements:

  • 20.6 percentage-point gain over standard Code-as-Policy agents on LIBERO-PRO
  • 17.0 percentage-point gain on MolmoSpaces
  • Learned skills improved RoboSuite performance by 8.9 points and real-world transfer by 8.8 points
  • Skills can be plugged into other inference-time agents without finetuning the underlying model — simply by retrieving them into context

Significance:

This research demonstrates that unsupervised "play" can produce high-quality, cross-environment transferable code skill libraries. The approach opens new possibilities where robots continuously improve through self-directed exploration, much like how humans learn through play. The project page is available at https://playful-rats.github.io/.

Authors include Junyi Zhang, Jiaxin Ge, Hanjun Yoo, Letian Fu, Ken Goldberg, Trevor Darrell, Ion Stoica, Angjoo Kanazawa, and other researchers from UC Berkeley.

Source: arXiv
Language: English- Showing content in English