Self-Evolution | Binjie Zhang

An agent-native loop where the LLM is the decision-maker that keeps a model improving with minimal human intervention.

Closed cycle: diagnose → hypothesize → pilot → gate → train → reflect.
Cheap pilot runs are gated before committing to full training, so compute is spent only on promising directions.
An append-only lesson memory turns failed runs into reusable knowledge for the next cycle.

Built during my AI research internship at TikTok (ByteDance) as a self-evolving training platform.