Memory Optimization | Binjie Zhang

Training-free optimization of an agent’s memory — deciding what to retrieve, write, and evict — without any gradient updates.

Treats memory policy as a search / optimization problem over read–write–evict decisions.
Targets long-horizon, multi-step tool-use agents, where naïve context growth becomes the bottleneck.
Complements context management to keep the working set small and relevant.

Research line from my AI research internship at TikTok (ByteDance); paper in preparation.