When Agents Learn from the World, Not from Us
The pretraining paradigm scaled data. The agent paradigm scales environments. Three observations on environment scaling, continuous evolution, and multi-agent collaboration.
agentic-rlenvironment-scalingmulti-agentcontinual-learning