Annual Meeting of the Association for Computational Linguistics (ACL)
Transactions on Machine Learning Research
Proceedings of the 12th ACM Cyber-Physical System Security Workshop
IEEE Conference on Secure and Trustworthy Machine Learning (SaTML)
Efficient Semi-Supervised Adversarial Training via Latent Clustering-Based Data Reduction
Conference on Neural Information Processing Systems (NeurIPS)
GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs