Chujie Zheng 郑楚杰
Welcome! I am a fourth-year Ph.D student of THU CoAI group, advised by Prof. Minlie Huang. Prior to my Ph.D study, I received my B.Sc degree from Dept. of Physics, Tsinghua University.
I have a broad research interest in building trustworthy language models, with current focus on improving their robustness and alignment. Previously, I conducted research on language models’ social good, especially in empathetic dialogue systems. I have also built a series of popular NLP datasets, including ChID, KDConv, ESConv, and CDConv.
Education
- Aug 2020 - present. Ph.D student in CoAI group, Department of Computer Science and Technology, Tsinghua University. Advisor: Prof. Minlie Huang
- Aug 2016 - Jul 2020. B.Sc. in Physics, Tsinghua University. Major GPA: 3.98/4.00 (ranking 2/59)
Main Papers
* indicates equal contribution.
- Chujie Zheng, Hao Zhou, Fandong Meng, Jie Zhou, Minlie Huang. On Large Language Models’ Selection Bias in Multi-Choice Questions. arxiv:2309.03882. [paper]
- Chujie Zheng, Sahand Sabour, Jiaxin Wen, Zheng Zhang, Minlie Huang. AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation. Findings of ACL 2023. [paper] [repo]
- Chujie Zheng, Pei Ke, Zheng Zhang, Minlie Huang. Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning. Findings of ACL 2023. [paper] [repo]
- Chujie Zheng*, Jinfeng Zhou*, Yinhe Zheng, Libiao Peng, Zhen Guo, Wenquan Wu, Zhengyu Niu, Hua Wu, Minlie Huang. CDConv: A Benchmark for Contradiction Detection in Chinese Conversations. EMNLP 2022. [paper] [repo]
Other Papers
- Jinfeng Zhou*, Chujie Zheng*, Bo Wang, Zheng Zhang, Minlie Huang. CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation. ACL 2023. [paper] [repo]
- Yuxian Gu*, Jiaxin Wen*, Hao Sun*, Yi Song, Pei Ke, Chujie Zheng, Zheng Zhang, Jianzhu Yao, Lei Liu, Xiaoyan Zhu, Minlie Huang. EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training. Machine Intelligence Research 2023. [paper] [repo]
- Jiawen Deng*, Jingyan Zhou*, Hao Sun, Chujie Zheng, Fei Mi, Helen Meng, Minlie Huang. COLD: A Benchmark for Chinese Offensive Language Detection. EMNLP 2022. [paper] [repo]
- Hao Sun*, Guangxuan Xu*, Jiawen Deng, Jiale Cheng, Chujie Zheng, Hao Zhou, Nanyun Peng, Xiaoyan Zhu, Minlie Huang. On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark. Findings of ACL 2022. [paper] [repo]
- Sahand Sabour, Chujie Zheng, Minlie Huang. CEM: Commonsense-aware Empathetic Response Generation. AAAI 2022. [paper] [repo]
- Chujie Zheng, Minlie Huang. Exploring Prompt-based Few-shot Learning for Grounded Dialog Generation. arXiv:2109.06513. [paper]
- Hao Zhou*, Pei Ke*, Zheng Zhang*, Yuxian Gu, Yinhe Zheng, Chujie Zheng, Yida Wang, Chen Henry Wu, Hao Sun, Xiaocong Yang, Bosi Wen, Xiaoyan Zhu, Minlie Huang, Jie Tang. EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training. arXiv:2108.01547. [paper] [repo]
- Siyang Liu*, Chujie Zheng*, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong Jiang, Minlie Huang. Towards Emotional Support Dialog Systems. ACL 2021. [paper] [repo]
- Chujie Zheng, Yong Liu, Wei Chen, Yongcai Leng, Minlie Huang. CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation. Findings of ACL 2021. [paper] [repo]
- Hao Sun*, Zhenru Lin*, Chujie Zheng, Siyang Liu, Minlie Huang. PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support. Findings of ACL 2021. [paper] [repo]
- Chujie Zheng, Yunbo Cao, Daxin Jiang, Minlie Huang. Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation. Findings of EMNLP 2020. [paper] [repo]
- Hao Zhou*, Chujie Zheng*, Kaili Huang, Minlie Huang, Xiaoyan Zhu. KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation. ACL 2020. [paper] [repo]
- Chujie Zheng, Minlie Huang, Aixin Sun. ChID: A Large-scale Chinese IDiom Dataset for Cloze Test. ACL 2019. [paper] [repo]
Selected Awards and Honors
- Comprehensive Scholarship (2nd Prize), Tsinghua University, 2022
- Comprehensive Scholarship (2nd Prize), Tsinghua University, 2021
- Excellent Thesis (Top 5/100), Tsinghua University, 2020
- Outstanding Graduate, Tsinghua University, 2020
- Chi-Sun YEH (叶企孙) Scholarship (Top 5/100), Dept. of Physics, Tsinghua University, 2020
- National Scholarship (Top 2/100), 2019
- Overall Excellence Scholarship, Tsinghua University, 2018
Talks
- Nov 2022, Shanghai AI Lab. Towards Well-behaved Dialogue Systems.
- Jul 2021, AI Time. Approaches of Empathy Expression and Emotional Support in Dialogue Systems. [video]
- Nov 2020, Biendata & PaperWeekly. Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation. [video]
- Jul 2020, AI Time. KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation. [video]
Services
- Reviewer: ACL (22/23), EMNLP (21/22/23), AAAI (22/23), EACL (23), KNOSYS, TIST
- Review Assistant: EMNLP (20), AAAI (21), COLING (20)
- Organizer:
- May 2020 - Aug 2020. SMP2020-ECDT Task 2
- Jun 2019 - Nov 2019. Chinese Idiom MRC Competition [data & codes]
Experiences
- Feb 2022 - Jun 2022. Research Intern. General Dialogue Group, Baidu, Beijing, China.
- Jun 2020 - Sep 2020. Research Intern. AI Interactive Technology Team, Sogou, Hangzhou, China.