IEEE International Conference on Computer Vision (ICCV)
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions
ACM Conference on Computer and Communications Security (CCS)
UnsafeBench: Benchmarking Image Safety Classifiers onReal-World and AI-Generated Images
Usenix Security Symposium (USENIX-Security)
Bridging the Gap in Vision Language Models in IdentifyingUnsafe Concepts Across Modalities
Usenix Security Symposium (USENIX-Security)
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
Usenix Security Symposium (USENIX-Security)
Usenix Security Symposium (USENIX-Security)
Prompt Stealing Attacks Against Text-to-Image Generation Models
ACM ASIA Conference on Computer and Communications Security (AsiaCCS)
FAKEPCD: Fake Point Cloud Detection via Source Attribution
ACM Conference on Computer and Communications Security (CCS)
Unsafe Diffusion: On the Generation of Unsafe Images and
Hateful Memes From Text-To-Image Models
IEEE Symposium on Security and Privacy (S&P)