research

Competitive self-play

Self-play could become a core component for builders aiming to create adaptive, high-skill AI agents without labor-intensive manual training setups.

OpenAI Blog·October 11, 2017·1 min readresearch

researchCompetitive self-play

openai.com

What happened

OpenAI has published findings on competitive self-play, a technique where AI agents improve by playing against copies of themselves. In simulations, this method enabled agents to autonomously discover complex physical skills such as dodging, catching, and tackling—without any hand-designed training curriculum. The environment automatically scales in difficulty as the agent improves, ensuring continuous learning. These results build on OpenAI's earlier success with self-play in Dota 2, suggesting the approach generalizes across domains. For developers building AI workflows, self-play offers a pathway to create agents that can acquire sophisticated behaviors without extensive manual tuning or domain-specific reward shaping. This could accelerate the development of autonomous systems in robotics, gaming, and simulation-based training.

Key takeaways

OpenAI tested self-play where AI agents compete against past versions of themselves to learn skills.
Agents discovered physical maneuvers like tackling and diving without explicit environment design.
The technique maintains an appropriate challenge level as the AI improves.
Previous Dota 2 results also used self-play, supporting its broader applicability.
Self-play may reduce the need for hand-crafted training scenarios in AI development.