%0 Conference Proceedings %T Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks %A Rao, Abhinav Sukumar %A Naik, Atharva Roshan %A Vashistha, Sachin %A Aditya, Somak %A Choudhury, Monojit %Y Calzolari, Nicoletta %Y Kan, Min-Yen %Y Hoste, Veronique %Y Lenci, Alessandro %Y Sakti, Sakriani %Y Xue, Nianwen %S Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) %D 2024 %8 May %I ELRA and ICCL %C Torino, Italia %F rao-etal-2024-tricking %U https://aclanthology.org/2024.lrec-main.1462/ %P 16802-16830