π§ About Me
I am a 1st year CS PhD student at Rutgers University, advised by Prof. Ruixiang (Ryan) Tang. I have completed my masterβs degree at Sensing IntelliGence and MAchine learning(SIGMA) Lab in Wuhan University, under the supervision of Prof. Zengmao Wang and Prof. Bo Du.
Previously, I was a research intern at RZ-Lab, Purdue University, advised by Prof. Ruqi Zhang. I was also fortunate to work with Prof. Lu Cheng, Prof. Bo Han and Dr. Jianing Zhu.
π§© Research Interests:
My current research centers on building trustworthy agentic and generative AI systems, with an emphasis on Reinforcement Learning methods for Agentic LLMs and Diffusion Models, focusing on safety, reliability, and robustness. I am also working on reliable detection for AI-generated visual content (e.g., images, videos), and interdisciplinary applications of machine learning in healthcare (e.g., Immunology, Cell Morphology).
If you share the same research interests with me and are interested in these areas or my previous works, feel free to drop me an Email or add my Wechat . I am always delighted for potential collaborations!
π₯ News
- 2026.03: ππ One paper has been accepted by ICME 2026.
- 2025.11: ππ Excited to share our Agent Matrix featured in several tech blogs, including From Chatbots to Clones and AI Just Learned to Clone Itself. Great to see growing attention to this emerging topic!
- 2025.07: ππ Our Frontier AI Risk Management Framework has been released. I participated in exploring self-replication risks in LLM Agents.
- 2025.05: ππ CoT-UQ has been accepted by ACL 2025.
- 2025.02: ππ Check our two preprint works regarding LLMs! One investigates Uncertainty Quantification in LLMs, and the other explores Connections between Creativity and Hallucination in LLMs.
- 2025.02: ππ I will join CS@Rutgers University as a PhD student in 2025 Fall, supervised by Prof. Ryan Tang!
- 2024.09: ππ Our paper titled "What If the Input is Expanded in OOD Detection?" has been accepted by NeurIPS 2024.
- 2024.06: ππ Start my remote research internship in CS@Purdue University, collaborating with Dr. Ruqi Zhang.
- 2024.05: ππ Successfully defended my Master thesis!
- 2024.04: ππ I will join CS@Purdue University as a research intern in June 2024.
- 2024.01: ππ One paper has been accepted by GRSL 2024.
π» Recent Projects
(Coming soonβ¦)
(Click the images for more details)
π Publications
(* indicates equal contribution)
Selected Publication

CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought
[ACL 2025 Findings]
Boxuan Zhang and Ruqi Zhang
TL;DR: Propose to quantify response-wise uncertainty by integrating LLMsβ inherent reasoning capabilities through Chain-of-Thought (CoT) into the UQ process.
@article{zhang2025cot,
title={CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought},
author={Zhang, Boxuan and Zhang, Ruqi},
journal={arXiv preprint arXiv:2502.17214},
year={2025}
} 
What If the Input is Expanded in OOD Detection?
[Neural Information Processing Systems (NeurIPS), 2024]
Boxuan Zhang*, Jianing Zhu*, Zengmao Wang, Tongliang Liu, Bo Du, and Bo Han
TL;DR: Propose a novel perspective to employ different common corruptions on the input space to expand the representation dimension for OOD detection.
[PDF] Β [Project Page] Β [Code] Β [BibTeX]
@inproceedings{zhang2024what,
title={What If the Input is Expanded in OOD Detection?},
author={Zhang, Boxuan and Zhu, Jianing and Wang, Zengmao and Liu, Tongliang and Du, Bo and Han, Bo},
booktitle={The Thirty-Eighth Annual Conference on Neural Information Processing Systems},
year={2024},
} Working Preprint

Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents
[Preprint. Under Review]
Boxuan Zhang*, Yi Yu*, Jiaxuan Guo, and Jing Shao
TL;DR: We present a comprehensive evaluation framework for quantifying self-replication risks. Our framework establishes authentic production environments and realistic tasks to enable scenario-driven assessment of agent behaviors.
@article{zhang2025dive,
title={Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents},
author={Zhang, Boxuan and Yu, Yi and Guo, Jiaxuan and Shao, Jing},
journal={arXiv preprint arXiv:2509.25302},
year={2025}
} 
What Shapes a Creative Machine Mind? Comprehensively Benchmarking Creativity in Foundation Models
[Preprint. Under Review]
Zicong He*, Boxuan Zhang*, Weihao Liu*, Ruixiang Tang, and Lu Cheng
TL;DR: We introduce C2-Eval, a holistic benchmark for the unified assessment of creativity in foundation models. C2-Eval distinguishes between two complementary forms of Creativity (C2): convergent creativity, where tasks admit constrained solutions, and divergent creativity, where tasks are open-ended.
@article{he2025what,
title={What Shapes a Creative Machine Mind? Comprehensively Benchmarking Creativity in Foundation Models},
author={He, Zicong and Zhang, Boxuan and Liu, Weihao and Tang, Ruixiang and Cheng, Lu},
journal={arXiv preprint arXiv:2510.04009},
year={2025}
} π Education
- 2025.09 - present, PhD, Department of Computer Science, Rutgers University, United States.
- 2022.09 - 2024.06, Master, School of Computer Science, Wuhan University, China.
- 2018.09 - 2022.06, Undergraduate, School of Computer Science, Wuhan University, China.
π¨π»βπ» Research Experience
- 2024.11 - 2025.08, Research Assistant
Department of Computer Science, University of Illinois Chicago (UIC)
Supervisor: Prof. Lu Cheng - 2024.06 - 2025.02, Research Intern
Department of Computer Science, Purdue University
Supervisor: Prof. Ruqi Zhang - 2023.11 - 2024.06, Research Intern
TMLR Group, Hong Kong Baptist University (HKBU)
Supervisor: Prof. Bo Han
Collaborate with: Dr. Jianing Zhu
- 2022.09 - 2024.06, Research Assistant
SIGMA Lab, Wuhan University (WHU)
Supervisor: Prof. Zengmao Wang and Prof. Bo Du
πΌ Work Experience
- 2025.03 - 2025.07, Research Assistant
Center of Safe & Trustworthy AI, Shanghai AI Laboratory
Mentor: Dr. Yi Yu and Dr. Dongrui Liu
ππ»ββοΈ Academic Service
- Journal Reviewer: ISPRS Journal of Photogrammetry and Remote Sensing
- Conference Reviewer: NeurIPS (2025, 2026) ICLR (2026) ICML (2026)






