Publications

For the recent publications, please go to my Google Scholar directly.

Preprints

    1. Preprint
      swe_perf.png
      SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
      Xinyi He, Qian Liu, Mingzhe Du, and 5 more authors
      arXiv preprint arXiv:2507.12415, 2025
    2. Preprint
      nexus_cover.jpg
      Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis
      Dong Huang, Mingzhe Du, Jie M Zhang, and 4 more authors
      arXiv preprint, 2025
    3. Preprint
      secure_code_cover.jpg
      Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
      Tianyi Wu, Mingzhe Du, Yue Liu, and 4 more authors
      arXiv preprint, 2025
    4. Preprint
      scaling_code_cover.jpg
      Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
      Xiao Zhu, Xinyu Zhou, Boyu Zhu, and 5 more authors
      arXiv preprint, 2025
    5. Preprint
      paper_espresso_cover.jpg
      Paper Espresso: From Paper Overload to Research Insight
      Mingzhe Du, Anh Tuan Luu, Dong Huang, and 1 more author
      arXiv preprint, 2025

          Published Papers

          1. ICLR
            deceptive_llm.jpg
            Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts
            Zhaomin Wu, Mingzhe Du, See-Kiong Ng, and 1 more author
            International Conference on Learning Representations, 2026
          2. TOSEM
            real_world_test.jpg
            Benchmarking LLMs for Unit Test Generation from Real-World Functions
            Dong Huang, Jie M Zhang, Mark Harman, and 3 more authors
            ACM Transactions on Software Engineering and Methodology, 2026
          3. ACL
            vulnerability_repair_cover.jpg
            Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework
            Chengran Yang, Ting Zhang, Jinfeng Jiang, and 9 more authors
            Proceedings of the Association for Computational Linguistics, 2026
          4. EACL
            pro_quest_cover.jpg
            Pro-QuEST: Prompt-chaining Quiz Engine for testing Specialized Technical Product Knowledge
            Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors
            Proceedings of the European Chapter of the Association for Computational Linguistics, 2026
          5. AMIYA
            arabic_dialect_cover.jpg
            Improving Arabic Dialectness in LLMs with Reinforcement Learning
            Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors
            AMIYA Workshop, 2026
          1. ACL
            arena_cover.jpg
            CodeArena: A collective evaluation platform for LLM code generation
            Mingzhe Du, Anh Tuan Luu, Bin Ji, and 5 more authors
            Proceedings of the Association for Computational Linguistics, 2025
          2. ACL
            knowledge_cover.png
            AntiLeakBench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
            Xiaobao Wu, Liangming Pan, Yuxi Xie, and 7 more authors
            Proceedings of the Association for Computational Linguistics, 2025
          3. AAAI
            verifiable_cover.png
            Towards Verifiable Text Generation with Generative Agent
            Bin Ji, Huijun Liu, Mingzhe Du, and 5 more authors
            Proceedings of the AAAI Conference on Artificial Intelligence, 2025
          4. R2-FM@ICML
            guardreasoner_vl.png
            Guardreasoner-VL: Safeguarding VLMs via Reinforced Reasoning
            Yue Liu, Shengfang Zhai, Mingzhe Du, and 9 more authors
            R2-FM Workshop at ICML 2025, 2025
          5. PRAL@ICML
            afterburner.png
            Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
            Mingzhe Du, Anh Tuan Luu, Yue Liu, and 6 more authors
            PRAL Workshop at ICML 2025, 2025
          6. JMIR
            eraly_maladaptive_schemas.png
            Unraveling Online Mental Health Through the Lens of Early Maladaptive Schemas: AI-Enabled Content Analysis of Online Mental Health Communities
            Beng Heng Ang, Sujatha Das Gollapalli, Mingzhe Du, and 1 more author
            Journal of Medical Internet Research, 2025
          7. NeurIPS
            effibench_x.png
            EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
            Yuhao Qing, Boyu Zhu, Mingzhe Du, and 9 more authors
            Conference on Neural Information Processing Systems, 2025
          8. ICSE
            influence_incorrect_code.png
            Measuring the Influence of Incorrect Code on Test Generation
            Dong Huang, Jie M Zhang, Mark Harman, and 2 more authors
            the International Conference on Software Engineering, 2025
          9. NeurIPS
            afterburner.png
            Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
            Mingzhe Du, Anh Tuan Luu, Yue Liu, and 6 more authors
            Conference on Neural Information Processing Systems, 2025
          10. NeurIPS
            guardreasoner_vl.png
            Guardreasoner-VL: Safeguarding VLMs via Reinforced Reasoning
            Yue Liu, Shengfang Zhai, Mingzhe Du, and 9 more authors
            Conference on Neural Information Processing Systems, 2025
          11. EMNLP
            service_request_cover.jpg
            On Assigning Product and Software Codes to Service Requests with Large Language Models
            Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and 2 more authors
            Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2025
          12. ICML
            model_licensing_cover.jpg
            Position: Current Model Licensing Practices are Dragging Us into a Quagmire of Legal Noncompliance
            Moming Duan, Mingzhe Du, Rui Zhao, and 4 more authors
            International Conference on Machine Learning, 2025
          13. SAC
            curriculum_demo_cover.jpg
            Curriculum Demonstration Selection for In-Context Learning
            Duc Anh Vu, Cong-Duy Nguyen, Xiaobao Wu, and 4 more authors
            ACM/SIGAPP Symposium On Applied Computing, 2025
          1. AAAI
            dynamind_cover.png
            From Static to Dynamic: Knowledge Metabolism for Large Language Models
            Mingzhe Du, Anh Tuan Luu, Bin Ji, and 1 more author
            Proceedings of the AAAI Conference on Artificial Intelligence, 2024
          2. NeurIPS
            mercury_cover.jpeg
            Mercury: A Code Efficiency Benchmark for Code Large Language Models
            Mingzhe Du, Anh Tuan Luu, Bin Ji, and 2 more authors
            Conference on Neural Information Processing Systems, 2024
          3. ECAI
            health_cover.jpg
            Counseling Responses for Mental Health Forum Questions with Early Maladaptive Schema Prediction
            Das Gollapalli Sujatha, Ang Beng Heng, Du Mingzhe, and 1 more author
            European Conference on Artificial Intelligence, 2024
          4. AAAI
            chain_cover.jpg
            Chain-of-Thought Improves Text Generation with Citations in Large Language Models
            Bin Ji, Huijun Liu, Mingzhe Du, and 1 more author
            In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
          1. TLDK
            constituency_cover.jpeg
            Constituency-Informed and Constituency-Constrained Extractive Question Answering with Heterogeneous Graph Transformer
            Mingzhe Du, Mouad Hakam, See-Kiong Ng, and 1 more author
            In Transactions on Large-Scale Data-and Knowledge-Centered Systems LIII, 2023
          2. WWW
            twitter_cover.png
            Identifying Checkworthy Cure Claims on Twitter
            Sujatha Das Gollapalli, Mingzhe Du, and See-Kiong Ng
            In Proceedings of the ACM Web Conference 2023, 2023
          3. AAAI
            aaai23_cover.jpg
            Generating Reflective Questions for Engaging Gallery Visitors in ArtMuse
            Sujatha Das Gollapalli, Mingzhe Du, and See-Kiong Ng
            In Proceedings of the AAAI Conference on Artificial Intelligence, 2023
          1. CLEF
            checkthat5_cover.png
            NUS-IDS at CheckThat! 2022: identifying check-worthiness of tweets using CheckthaT5
            Du Mingzhe, Gollapalli Sujatha Das, and Ng See-Kiong
            Working Notes of CLEF, 2022