Publications | Mingzhe Du

Preprints

Preprint

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Xinyi He, Qian Liu, Mingzhe Du, and 5 more authors

arXiv preprint arXiv:2507.12415, 2025

PDF
Preprint

Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis

Dong Huang, Mingzhe Du, Jie M Zhang, and 4 more authors

arXiv preprint, 2025
Preprint

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Tianyi Wu, Mingzhe Du, Yue Liu, and 4 more authors

arXiv preprint, 2025
Preprint

Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Xiao Zhu, Xinyu Zhou, Boyu Zhu, and 5 more authors

arXiv preprint, 2025
Preprint

Paper Espresso: From Paper Overload to Research Insight

Mingzhe Du, Anh Tuan Luu, Dong Huang, and 1 more author

arXiv preprint, 2025

Published Papers

ICLR

Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts

Zhaomin Wu, Mingzhe Du, See-Kiong Ng, and 1 more author

International Conference on Learning Representations, 2026

PDF
TOSEM

Benchmarking LLMs for Unit Test Generation from Real-World Functions

Dong Huang, Jie M Zhang, Mark Harman, and 3 more authors

ACM Transactions on Software Engineering and Methodology, 2026

PDF
ACL

Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework

Chengran Yang, Ting Zhang, Jinfeng Jiang, and 9 more authors

Proceedings of the Association for Computational Linguistics, 2026
EACL

Pro-QuEST: Prompt-chaining Quiz Engine for testing Specialized Technical Product Knowledge

Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors

Proceedings of the European Chapter of the Association for Computational Linguistics, 2026
AMIYA

Improving Arabic Dialectness in LLMs with Reinforcement Learning

Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors

AMIYA Workshop, 2026

ACL

CodeArena: A collective evaluation platform for LLM code generation

Mingzhe Du, Anh Tuan Luu, Bin Ji, and 5 more authors

Proceedings of the Association for Computational Linguistics, 2025

Code
ACL

AntiLeakBench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Xiaobao Wu, Liangming Pan, Yuxi Xie, and 7 more authors

Proceedings of the Association for Computational Linguistics, 2025

PDF
AAAI

Towards Verifiable Text Generation with Generative Agent

Bin Ji, Huijun Liu, Mingzhe Du, and 5 more authors

Proceedings of the AAAI Conference on Artificial Intelligence, 2025

PDF
R2-FM@ICML

Guardreasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Yue Liu, Shengfang Zhai, Mingzhe Du, and 9 more authors

R2-FM Workshop at ICML 2025, 2025

PDF
PRAL@ICML

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Mingzhe Du, Anh Tuan Luu, Yue Liu, and 6 more authors

PRAL Workshop at ICML 2025, 2025

PDF
JMIR

Unraveling Online Mental Health Through the Lens of Early Maladaptive Schemas: AI-Enabled Content Analysis of Online Mental Health Communities

Beng Heng Ang, Sujatha Das Gollapalli, Mingzhe Du, and 1 more author

Journal of Medical Internet Research, 2025

PDF
NeurIPS

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Yuhao Qing, Boyu Zhu, Mingzhe Du, and 9 more authors

Conference on Neural Information Processing Systems, 2025

PDF
ICSE

Measuring the Influence of Incorrect Code on Test Generation

Dong Huang, Jie M Zhang, Mark Harman, and 2 more authors

the International Conference on Software Engineering, 2025

PDF
NeurIPS

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Mingzhe Du, Anh Tuan Luu, Yue Liu, and 6 more authors

Conference on Neural Information Processing Systems, 2025

PDF
NeurIPS

Guardreasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Yue Liu, Shengfang Zhai, Mingzhe Du, and 9 more authors

Conference on Neural Information Processing Systems, 2025

PDF
EMNLP

On Assigning Product and Software Codes to Service Requests with Large Language Models

Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors

Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2025
ICML

Position: Current Model Licensing Practices are Dragging Us into a Quagmire of Legal Noncompliance

Moming Duan, Mingzhe Du, Rui Zhao, and 4 more authors

International Conference on Machine Learning, 2025
SAC

Curriculum Demonstration Selection for In-Context Learning

Duc Anh Vu, Cong-Duy Nguyen, Xiaobao Wu, and 4 more authors

ACM/SIGAPP Symposium On Applied Computing, 2025

AAAI

From Static to Dynamic: Knowledge Metabolism for Large Language Models

Mingzhe Du, Anh Tuan Luu, Bin Ji, and 1 more author

Proceedings of the AAAI Conference on Artificial Intelligence, 2024

PDF Code
NeurIPS

Mercury: A Code Efficiency Benchmark for Code Large Language Models

Mingzhe Du, Anh Tuan Luu, Bin Ji, and 2 more authors

Conference on Neural Information Processing Systems, 2024

PDF Code Website
ECAI

Counseling Responses for Mental Health Forum Questions with Early Maladaptive Schema Prediction

Das Gollapalli Sujatha, Ang Beng Heng, Du Mingzhe, and 1 more author

European Conference on Artificial Intelligence, 2024

PDF Website
AAAI

Chain-of-Thought Improves Text Generation with Citations in Large Language Models

Bin Ji, Huijun Liu, Mingzhe Du, and 1 more author

In Proceedings of the AAAI Conference on Artificial Intelligence, 2024

PDF

TLDK

Constituency-Informed and Constituency-Constrained Extractive Question Answering with Heterogeneous Graph Transformer

Mingzhe Du, Mouad Hakam, See-Kiong Ng, and 1 more author

In Transactions on Large-Scale Data-and Knowledge-Centered Systems LIII, 2023
WWW

Identifying Checkworthy Cure Claims on Twitter

Sujatha Das Gollapalli, Mingzhe Du, and See-Kiong Ng

In Proceedings of the ACM Web Conference 2023, 2023

PDF
AAAI

Generating Reflective Questions for Engaging Gallery Visitors in ArtMuse

Sujatha Das Gollapalli, Mingzhe Du, and See-Kiong Ng

In Proceedings of the AAAI Conference on Artificial Intelligence, 2023

PDF Code

CLEF

NUS-IDS at CheckThat! 2022: identifying check-worthiness of tweets using CheckthaT5

Du Mingzhe, Gollapalli Sujatha Das, and Ng See-Kiong

Working Notes of CLEF, 2022

PDF