Research
I work on interdisciplinary study of AI and Social Science, including AI's social impact, socially responsible AI (AI safety, security, and privacy), and computational social science. My long-term goal is to understand and advance the relationship between AI and society, exploring both its technological and social dimensions.
Topics I currently focus on:
Others:
|
Education / Experience
- 2020-Now: Ph.D. Student in Computer Science, HKUST
- 2024/4-Now: Visiting Student Researcher in Sociology at Princeton University, Princeton
- 2023/3-2024/2: Visiting Student Researcher at Duke University, Remote
- 2022/3-2022/8: Research Intern at Microsoft Research Asia, Beijing
- 2016-2020: B.S. in Intelligence Science and Technology, Peking University
- 2019/6-2019/9: Visiting Student at HKUST, Hong Kong
|
Selected Work
(* joint first authors)
Full Publication List on Google Scholar.
|
Socially Responsible Generative AI
Defending ChatGPT against Jailbreak Attack via Self-Reminder
Yueqi Xie*, Jingwei Yi*, Jiawei Shao, Justin Curl, Lingjuan Lyu, Qifeng Chen, Xing Xie, Fangzhao Wu
Nature Machine Intelligence, 2023
/
/
/
Media Coverage: TechXplore,
Tech Times
|
GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis
Yueqi Xie, Minghong Fang, Renjie Pi, Neil Gong
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
/
/
|
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Renjie Pi*, Tianyang Han*, Jianshu Zhang*, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang
The Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
/
/
|
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Jingwei Yi*, Yueqi Xie*, Bin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025
/
/
|
AI's Societal Impact / Computational Social Science
Measuring Human Contribution in AI-Assisted Content Generation
Yueqi Xie*, Tao Qi*, Jingwei Yi*, Ryan Whalen, Junming Huang, Qian Ding, Yu Xie, Xing Xie, Fangzhao Wu
Preprint, 2024
/
/
|
Secure and Robust Federated Learning
Defending against Model Poisoning Attacks for Federated Learning using Model Update Reconstruction Error
Yueqi Xie, Minghong Fang, Neil Gong
International Conference on Machine Learning (ICML), 2024
/
/
|
DYNAFED: Tackling Client Data Heterogeneity with Global Dynamics
Renjie Pi*, Weizhong Zhang*, Yueqi Xie*, Jiahui Gao, Xiaoyu Wang, Sunghun Kim, Qifeng Chen
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
/
/
|
PoisonedFL: Model Poisoning Attacks to Federated Learning via Multi-Round Consistency
Yueqi Xie, Minghong Fang, Neil Gong
Preprint, 2024
/
|
|