Publications

Publications

  1. JMLR
    Large sample spectral analysis of graph-based multi-manifold clustering
    Nicolas Garcia Trillos*, Pengfei He*, and Chenghui Li*
    Journal of Machine Learning Research (JMLR), 2023
  2. TMLR
    Stealthy Backdoor Attack via Confidence-driven Sampling
    Pengfei He, Yue Xing, Han Xu, and 6 more authors
    Transactions on Machine Learning Research, 2024
  3. SIGKDD Explor.
    FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
    Yingqian Cui, Jie Ren, Yuping Lin, and 7 more authors
    SIGKDD Explor. Newsl., Jan 2025
  4. Stat
    Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study
    Pengfei He, Yingqian Cui, Han Xu, and 4 more authors
    Stat, Jan 2024
  1. CIKM 2022
    PROPN: Personalized Probabilistic Strategic Parameter Optimization in Recommendations
    Pengfei He, Haochen Liu, Xiangyu Zhao, and 2 more authors
    In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM), 2022
  2. ICML 2023
    Probabilistic Categorical Adversarial Attack and Adversarial Training
    Han Xu, Pengfei He, Jie Ren, and 4 more authors
    In International Conference on Machine Learning (ICML), 2023
  3. ICLR ’24 Spotlight
    Sharpness-Aware Data Poisoning Attack
    Pengfei He, Han Xu, Jie Ren, and 4 more authors
    In International Conference on Learning Representations (ICLR), 2024
    Spotlight Paper, 5%
  4. SIGKDD Explor.
    DiffusionShield: A Watermark for Data Copyright Protection against Generative Diffusion Models
    Yingqian Cui, Jie Ren, Han Xu, and 5 more authors
    In , Jan 2025
  5. ACL 2024
    The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
    Shenglai Zeng, Jiankun Zhang, Pengfei He, and 8 more authors
    In Findings of the Association for Computational Linguistics ACL 2024, Aug 2024
  6. ACL 2024
    Exploring Memorization in Fine-tuned Language Models
    Shenglai Zeng, Yaxin Li, Jie Ren, and 7 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  7. EMNLP 2024
    On the Generalization of Training-based ChatGPT Detection Methods
    Han Xu, Jie Ren, Pengfei He, and 5 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Dec 2024
  8. EMNLP 2024
    Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
    Yuping* Lin, Pengfei* He, Han Xu, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Dec 2024
  9. NAACL 2025
    Data Poisoning for In-context Learning
    Pengfei He, Han Xu, Yue Xing, and 3 more authors
    In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics, Apr 2025
  10. AISTATS 2025
    Superiority of Multi-Head Attention in In-Context Linear Regression
    Yingqian Cui, Jie Ren, Pengfei He, and 2 more authors
    In , Apr 2025
  11. AISTSTS 2025
    A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
    Yingqian Cui, Pengfei He, Xianfeng Tang, and 4 more authors
    In , Apr 2025
  12. EMNLP 2025
    Advancing Reasoning with Off-the-Shelf LLMs: A Semantic Structure Perspective
    Pengfei He, Zitao Li, Yue Xing, and 3 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2025, Apr 2025
  13. EMNLP 2025
    Mitigating the privacy issues in retrieval-augmented generation (rag) via pure synthetic data
    Shenglai Zeng, Jiankun Zhang, Pengfei He, and 7 more authors
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Apr 2025
  14. ACL 2025
    Red-Teaming LLM Multi-Agent Systems via Communication Attacks
    Pengfei He, Yupin Lin, Shen Dong, and 3 more authors
    In , Apr 2025
  15. ACL 2025
    Unveiling Privacy Risks in LLM Agent Memory
    Bo Wang, Weiyi He, Shenglai Zeng, and 4 more authors
    In , Apr 2025
  16. ACL 2025
    Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
    Yingqian Cui, Pengfei He, Jingying Zeng, and 8 more authors
    In , Apr 2025
  17. ACL 2025
    Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach
    Shenglai Zeng, Pengfei He, Kai Guo, and 4 more authors
    In , Apr 2025
  18. NeurIPS 2025
    Memory Injection Attacks on LLM Agents via Query-Only Interaction
    Shen Dong, Shaochen Xu, Pengfei He, and 5 more authors
    In , Apr 2025
  19. ACL 2026
    Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems
    Pengfei He, Zhenwei Dai, Xianfeng Tang, and 9 more authors
    In , Apr 2026
  20. ICLR 2026
    TRAJECT-Bench: A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
    Pengfei He, Zhenwei Dai, Bing He, and 8 more authors
    In , Apr 2026
  21. EACL 2026
    PEAR: Planner-Executor Agent Robustness Benchmark
    Shen Dong, Mingxuan Zhang, Pengfei He, and 4 more authors
    In , Apr 2026
  22. ICML 2026
    Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents
    Pengfei He, Ash Fox, Lesly Miculicich, and 7 more authors
    In , Apr 2026
  23. TALE 2025
    Human-AI Collaboration for Knowledge-in-use Assessment Design: Leveraging LLMs with RAG
    Juanhui Li, Tingting Li, Hang Li, and 4 more authors
    In 2025 IEEE International Conference on Teaching, Assessment, and Learning for Engineering (TALE), Apr 2025
  24. ACL 2026
    Retrieval Heads are Dynamic
    Yuping Lin, Zitao Li, Yue Xing, and 6 more authors
    In , Apr 2026
  25. ACL 2026
    How memory management impacts llm agents: An empirical study of experience-following behavior
    Zidi Xiong, Yuping Lin, Wenya Xie, and 5 more authors
    In , Apr 2026

Preprints

  1. preprint
    Copyright Protection in Generative AI: A Technical Perspective
    Jie Ren, Han Xu, Pengfei He, and 8 more authors
    2024
  2. preprint
    Multi-Faceted Studies on Data Poisoning can Advance LLM Development
    Pengfei He, Yue Xing, Han Xu, and 2 more authors
    2025
  3. preprint
    Comprehensive Vulnerability Analysis is Necessary for Trustworthy LLM-MAS
    Pengfei He, Yue Xing, Shen Dong, and 8 more authors
    2025