Research
My research focuses on AI Software Engineering, especially the trustworthiness of large language model (LLM)-based systems. I am particularly engaged in:
- Improving the reliability and security of LLM-based systems by studying their failure modes and robustness.
- Exploring LLM-assisted development workflows, including vibe coding and AI-driven code review, to support human–AI collaboration in software engineering.
|
|
|
Prompting Instability: An Empirical Study of LLM Robustness in Code Vulnerability Detection
Shuo Han,
Tao Tan,
Yuantian Miao,
Xiao Chen,
Nan Sun
AJCAI, 2025
Paper
A large-scale empirical study revealing instability of LLMs under paraphrased prompts in vulnerability detection, highlighting the need for robust prompting and model refinement.
|
|
|
Soft-Label Integration for Robust Toxicity Classification
Zelei Cheng,
Xian Wu,
Jiahao Yu,
Shuo Han,
Xin-Qiang Cai,
Xinyu Xing
NeurIPS, 2024
Paper
A bi-level optimization framework that integrates crowdsourced annotations with soft-labeling and optimizes them using GroupDRO to enhance robustness against out-of-distribution risks.
|
|