Publications

Publications

  1. JMLR
    Large sample spectral analysis of graph-based multi-manifold clustering
    Nicolas Garcia Trillos, Pengfei He, and Chenghui Li
    Journal of Machine Learning Research (JMLR), 2023
  2. Analyzing illegal psychostimulant trafficking networks using noisy and sparse data
    Margret V Bjarnadottir, Siddharth Chandra, Pengfei He, and 1 more author
    IISE Transactions, 2024
  3. TMLR
    Stealthy Backdoor Attack via Confidence-driven Sampling
    Pengfei He, Han Xu, Yue Xing, and 6 more authors
    Transactions on Machine Learning Research(TMLR), 2024
  4. NeruIPS workshop
    Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study
    Pengfei He, Yingqian Cui, Han Xu, and 4 more authors
    M3L & SFLLM workshop NeruIPS 2024, 2024
  1. CIKM
    PROPN: Personalized Probabilistic Strategic Parameter Optimization in Recommendations
    Pengfei He, Haochen Liu, Xiangyu Zhao, and 2 more authors
    In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (CIKM), 2022
  2. ICML
    Probabilistic Categorical Adversarial Attack and Adversarial Training
    Han Xu, Pengfei He, Jie Ren, and 4 more authors
    In International Conference on Machine Learning (ICML), 2023
  3. ICLR Spotlight
    Sharpness-Aware Data Poisoning Attack
    Pengfei He, Han Xu, Jie Ren, and 4 more authors
    In International Conference on Learning Representations (ICLR), 2024
    Spotlight Paper, 5%
  4. ACL
    The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
    Shenglai Zeng, Jiankun Zhang, Pengfei He, and 8 more authors
    In Findings of the Association for Computational Linguistics ACL 2024, Aug 2024
  5. ACL
    Exploring Memorization in Fine-tuned Language Models
    Shenglai Zeng, Yaxin Li, Jie Ren, and 7 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  6. EMNLP
    On the Generalization of Training-based ChatGPT Detection Methods
    Han Xu, Jie Ren, Pengfei He, and 5 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Dec 2024
  7. EMNLP
    Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
    Pengfei He, Yuping Lin, Han Xu, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Dec 2024

Preprints

  1. DiffusionShield: A Watermark for Copyright Protection against Generative Diffusion Models
    Yingqian Cui, Jie Ren, Han Xu, and 4 more authors
    2023
  2. FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
    Yingqian Cui, Jie Ren, Yuping Lin, and 6 more authors
    2023
  3. Data Poisoning for In-context Learning
    Pengfei He, Han Xu, Yue Xing, and 3 more authors
    2024
  4. Copyright Protection in Generative AI: A Technical Perspective
    Jie Ren, Han Xu, Pengfei He, and 8 more authors
    2024
  5. Superiority of Multi-Head Attention in In-Context Linear Regression
    Yingqian Cui, Jie Ren, Pengfei He, and 2 more authors
    2024
  6. A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
    Yingqian Cui, Pengfei He, Xianfeng Tang, and 4 more authors
    2024
  7. Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning
    Pengfei He, Zitao Li, Yue Xing, and 3 more authors
    2024