Huawen Feng

🧑‍🎓 About Me

🎓 I’m a third-year Ph.D. student in School of Computer Science and Engineering at South China University of Technology (SCUT), advised by Prof. Qianli Ma. Previously, I obtained my bachelor’s degree from this institute with the qualification for postgraduate recommendation and later applied for a direct Ph.D. program in my first year of graduate studies.

🔬 My research focuses on LLM Alignment, Post Training and Preference Optimization.

📄 Google Scholar

ORCID

📚 Research Internship

Tencent Hunyuan X

April 2025 - Now | Qingyun Intern
The research focuses on the reinforcement learning for reasoning ablilities of math LLMs.

Microsoft AI

July 2024 - April 2025 | Research Intern
The research focuses on the methods for data flywheels for code LLMs. Current methods typically rely on off-the-shelf datasets and data augmentation from proprietary LLMs. We propose WarriorCoder, a novel paradigm where the target model learns from expert battles to address these limitations.

Alibaba Tongyi Lab

July 2023 - July 2024 | Research Intern
The research focuses on hallucinations in LLMs and methods for resolving the problem. We propose Contrastive Preference Optimization (CPO) — a method to improve the model’s faithfulness to the context during the generation process without the need for pairwise annotations. Furthermore, we explore the model’s “selective” faithfulness to the context and propose Backtracking Correction (BC) — a reinforcement learning framework that does not require additional data annotations.

📝 Publications

📌 Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought
✍️ Tencent Hunyuan Team
🏛️ Technical Report

📌 WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models
✍️ Huawen Feng, Pu Zhao, Qingfeng Sun, Can Xu, Fangkai Yang, Lu Wang, Qianli Ma, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang
🏛️ 2025 ACL

📌 Training Large Language Models for Retrieval-Augmented Question Answering through Backtracking Correction
✍️ Huawen Feng, Zekun Yao, Junhao Zheng, Qianli Ma
🏛️ 2025 ICLR

📌 Improving Factual Consistency of News Summarization by Contrastive Preference Optimization
✍️ Huawen Feng, Yan Fan, Xiong Liu, Ting-En Lin, Zekun Yao, Yuchuan Wu, Fei Huang, Yongbin Li, Qianli Ma
🏛️ 2024 EMNLP (Findings)

📌 Well Begun Is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification
✍️ Huawen Feng, Jingsong Yan, Junlong Liu, Junhao Zheng, Qianli Ma
🏛️ 2024 COLING

📌 Perturbation-Based Self-Supervised Attention for Attention Bias in Text Classification
✍️ Huawen Feng, Zhenxi Lin, Qianli Ma
🏛️ IEEE/ACM Transactions on Audio, Speech, and Language Processing

📌 Joint Constrained Learning with Boundary-adjusting for Emotion-Cause Pair Extraction
✍️ Huawen Feng, Junlong Liu, Junhao Zheng, Haibin Chen, Xichen Shang, Qianli Ma
🏛️ 2023 ACL

📌 It’s Better to Teach Fishing than Giving a Fish: An Auto-Augmented Structure-aware Generative Model for Metaphor Detection
✍️ Huawen Feng, Qianli Ma
🏛️ 2022 EMNLP (Findings)

📌 More Papers

📬 Contact Information

✉️ Email: [541119578@qq.com]