IEEE International Conference on Computer Vision (ICCV)
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions
Usenix Security Symposium (USENIX-Security)
Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications
Annual Meeting of the Association for Computational Linguistics (ACL)
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
Usenix Security Symposium (USENIX-Security)
SecurityNet: Assessing Machine Learning Vulnerabilities on Public Models
International Conference on Learning Representations (ICLR)
Data Poisoning Attacks Against Multimodal Encoders