projects | Binjie Zhang

agent systems

A declarative, skill-based harness for rapidly composing LLM agents.

Training-free optimization of agent memory for long-horizon tool use.

Context engineering for long-horizon, multi-step agents.

Agent-native loops that let models keep improving after deployment.

Taking LLM agents from prototype to production for e-commerce governance.

Group-relative policy optimization with structured self-reflection for long-horizon tool use.

Path-anchored compatibility for continually fine-tuning VLA policies.

Two-stage egocentric video prediction conditioned on hand trajectories.

Upgrade visual foundation models without retraining downstream tasks.