Boxuan Zhang

Research Focus

Trustworthy Agentic AI

LLM Post-Training

AI Content Detection

ML for Healthcare

About Me

I am a CS Ph.D. student at Rutgers University, advised by Prof. Ruixiang (Ryan) Tang. Before Rutgers, I completed my master's and bachelor's degrees at the SIGMA Lab of Wuhan University, advised by Prof. Zengmao Wang and Prof. Bo Du.

My research focuses on building trustworthy agentic AI systems, with particular interests in LLM post-training, safety, reliability, and robustness. I also work on reliable detection of AI-generated visual content and interdisciplinary applications of ML in healthcare, including immunology and cell morphology.

🔥 If you share these research interests or have feedback on my previous work, feel free to drop me an email — I am always delighted to discuss potential collaborations!

News

2026-06

🎉 Honored to receive the 2025–2026 Excellent TA Award from CS@Rutgers University.

2026-05

🎉 Honored to be selected as an ICML 2026 Gold Reviewer.

2026-03

🎉 One paper has been accepted by ICME 2026 as Best Paper Award candidate.

2025-11

🎉 Excited to share that our Agent Matrix was featured in several tech blogs, including From Chatbots to Clones and AI Just Learned to Clone Itself.

2025-07

🎉 Our Frontier AI Risk Management Framework has been released. I participated in exploring self-replication risks in LLM agents.

Recent Projects

Selected work — click through to explore each project.

01 / 06

Agentic AI2026

AgentForesight

Online Auditing for Multi-Agent Systems

Explore Project

01 / 06

Selected Publications

View All →

FAGEN @ ICML2026

AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems

Boxuan Zhang^*, Jianing Zhu^*, Zeru Shi, Dongfang Liu, Ruixiang Tang

We reframe agentic failure analysis from post-hoc attribution on completed trajectories to online auditing on unfolding prefixes, where an auditor commits a continue-or-alarm verdict at every step.

Paper Code Project

arXiv2026

Micro-Defects Expose Macro-Fakes: Detecting AI-Generated Images via Local Distributional Shifts

Boxuan Zhang, Jianing Zhu, Qifan Wang, Jiang Liu, Ruixiang Tang

We propose Micro-Defects expose Macro-Fakes (MDMF), a local distribution-aware detection framework that amplifies micro-scale statistical irregularities into macro-level distributional discrepancies for AI-generated image detection.

Paper Code Project

NeurIPS2024

What If the Input is Expanded in OOD Detection?

Boxuan Zhang^*, Jianing Zhu^*, Zengmao Wang, Tongliang Liu, Bo Du, Bo Han

Propose a novel perspective to employ different common corruptions on the input space to expand the representation dimension for out-of-distribution detection.

Paper Code Project