Binjie Zhang
Ph.D. Candidate, Show Lab, National University of Singapore.
Email: binjie97 [at] u.nus.edu
Show Lab, NUS
Singapore
I am a Ph.D. candidate in Computer Science at the National University of Singapore (NUS), advised by Assistant Prof. Mike Zheng Shou in Show Lab.
My research interests are LLM agents — harness design, memory, tool use, and self-evolution — together with vision–language and video understanding, world / predictive models, and lifelong and compatible representation learning. Broadly, I want to build systems where multi-modal agents can reuse what they already know, anticipate what comes next, and keep improving after deployment.
From January to June 2026, I was an AI Research Intern at TikTok (ByteDance) in Singapore, where I owned production LLM-agent systems for large-scale e-commerce (intellectual-property) governance. The work spanned five threads: (i) business landing of agents in real e-commerce moderation pipelines; (ii) an agent harness / framework for rapidly composing skill-based agents; (iii) training-free memory optimization; (iv) long-horizon context management; and (v) self-evolution — agent-native loops that keep models improving with minimal human intervention. I drove each system from research prototype to production deployment.
Before NUS I received my M.Eng. in Computer Science and Technology from Tsinghua University, supervised by Prof. Chun Yuan, and my B.Eng. in Information Engineering from East China University of Science and Technology (ECUST), where I ranked 1 / 92.
Before that, I spent several years at Tencent ARC Lab (Shenzhen) as an AI research intern, where I led a line of first-author work on backward-compatible representation learning and hot-refresh model upgrades for large-scale image and video retrieval (ICLR’22, IJCAI’22 long oral, AAAI’23, arXiv’23), as well as on cross-modality video understanding.
news
| Jun 20, 2026 | ReGRPO: Reflection-Augmented Policy Optimization for Tool-Using Agents accepted to ECCV 2026. |
|---|---|
| May 15, 2026 | Submitted three papers to NeurIPS 2026 — PathComp (lifelong vision–language–action adaptation), Ego-PM (egocentric predictive model), and cLOO (conflict-aware multi-hop RAG attribution). |
| Mar 07, 2026 | Submitted ReGRPO (reflection-augmented RL for tool-using agents) to ECCV 2026 as first author. |
| Jan 30, 2026 | Two first-author papers submitted to ICML 2026: RCFC (lifelong imitation learning) and Ego-centric Predictive Model Conditioned on Hand Trajectories (top 15%). |
| Jan 06, 2026 | Started as an AI Research Intern at TikTok (ByteDance) in Singapore, working end-to-end on LLM agents — harness, memory, tool use, and self-evolving training loops. |
selected publications
- ECCV’26*ReGRPO: Reflection-Augmented Policy Optimization for Tool-Using AgentsIn European Conference on Computer Vision (ECCV), 2026Accepted
- NeurIPS’26*
PathComp: Path-Anchored Compatibility for Lifelong Vision–Language–Action AdaptationIn Conference on Neural Information Processing Systems (NeurIPS), 2026Under review - NeurIPS’26*
Ego-centric Predictive Model Conditioned on Hand TrajectoriesIn Conference on Neural Information Processing Systems (NeurIPS), 2026Under review - arXiv’23*
TaCA: Upgrading Your Visual Foundation Model with a Task-Agnostic Compatible AdapterarXiv preprint arXiv:2306.12642, 2023 - AAAI’23*Darwinian Model Upgrades: Model Evolving with Selective CompatibilityIn AAAI Conference on Artificial Intelligence (AAAI), 2023
- IJCAI’22*
Towards Universal Backward-Compatible Representation LearningIn International Joint Conference on Artificial Intelligence (IJCAI), 2022Long oral - ICLR’22*
Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image RetrievalIn International Conference on Learning Representations (ICLR), 2022