Announcement_24

September 20, 2024

2024

Two papers( Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis; On the generalization of training-based chatgpt detection methods) accepted to EMNLP 2024!