ACM Conference on Computer and Communications Security (CCS)
UnsafeBench: Benchmarking Image Safety Classifiers onReal-World and AI-Generated Images
Annual Meeting of the Association for Computational Linguistics (ACL)
When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs
Annual Meeting of the Association for Computational Linguistics (ACL)
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
IEEE Symposium on Security and Privacy (S&P)
GPTracker: A Large-Scale Measurement of Misused GPTs
IEEE Symposium on Security and Privacy (S&P)
On the Effectiveness of Prompt Stealing Attacks on In-The-Wild Prompts
Usenix Security Symposium (USENIX-Security)
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
Usenix Security Symposium (USENIX-Security)
Annual Meeting of the Association for Computational Linguistics (ACL)
Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media
ACM Conference on Computer and Communications Security (CCS)
MGTBench: Benchmarking Machine-Generated Text Detection
Conference on Empirical Methods in Natural Language Processing (EMNLP)
The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective