Yueqi Xie 谢悦琪

I am a postdoctoral research associate at Princeton University, working with Prof. Yu Xie on AI and society. I received my Ph.D. in computer science from HKUST, advised by Prof. Qifeng Chen and Prof. Sunghun KIM. I received a bachelor's degree in Intelligence Science and Technology from Peking University.

Email  /  Google Scholar

profile photo
Research

I work on interdisciplinary study of AI and Social Science, including AI's social impact, socially responsible AI (AI safety, security, and privacy), and computational social science. My long-term goal is to understand and advance the relationship between AI and society, exploring both its technological and social dimensions.

Topics I currently focus on:

Education / Experience

Selected Work

(* joint first authors, # corresponding authors)

Full Publication List on Google Scholar.

Socially Responsible Generative AI
Defending ChatGPT against Jailbreak Attack via Self-Reminder
Yueqi Xie*, Jingwei Yi*, Jiawei Shao, Justin Curl, Lingjuan Lyu, Qifeng Chen, Xing Xie, Fangzhao Wu
Nature Machine Intelligence, 2023
/ / / Media Coverage: TechXplore, Tech Times
GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis
Yueqi Xie, Minghong Fang, Renjie Pi, Neil Gong
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
/ /
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Jingwei Yi*, Yueqi Xie*, Bin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025
/ /
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Renjie Pi*, Tianyang Han*, Jianshu Zhang*, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang
The Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
/ /
Evidencing Unauthorized Training Data from AI Generated Content using Information Isotopes
Tao Qi, Jinhua Yin, Dongqi Cai, Yueqi Xie, Huili Wang, Zhiyang Hu, Peiru Yang, Guoshun Nan, Zhili Zhou, Shangguang Wang, Lingjuan Lyu, Yongfeng Huang, Nicholas D. Lane
Preprint, 2025
AI's Societal Impact / Computational Social Science
Measuring Human Contribution in AI-Assisted Content Generation
Yueqi Xie*, Tao Qi*, Jingwei Yi*, Ryan Whalen, Junming Huang, Qian Ding, Yu Xie, Xing Xie, Fangzhao Wu
Preprint, 2024
/ /
Uncovering inequalities in new knowledge learning by large language models across different languages
Chenglong Wang, Haoyu Tang, Xiyuan Yang, Yueqi Xie#, Jina Suh, Sunayana Sitaram, Junming Huang, Yu Xie, Zhaoya Gong#, Xing Xie, Fangzhao Wu#
Preprint, 2025
Secure and Robust Federated Learning
PoisonedFL: Model Poisoning Attacks to Federated Learning via Multi-Round Consistency
Yueqi Xie, Minghong Fang, Neil Gong
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
/
Defending against Model Poisoning Attacks for Federated Learning using Model Update Reconstruction Error
Yueqi Xie, Minghong Fang, Neil Gong
International Conference on Machine Learning (ICML), 2024
/ /
DYNAFED: Tackling Client Data Heterogeneity with Global Dynamics
Renjie Pi*, Weizhong Zhang*, Yueqi Xie*, Jiahui Gao, Xiaoyu Wang, Sunghun Kim, Qifeng Chen
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
/ /
Awards

Source code from Jon Barron, Aljaž Božič, XINTAO and Ekun.