ACM Conference on Computer and Communications Security (CCS)
MGTBench: Benchmarking Machine-Generated Text Detection
ACM Conference on Computer and Communications Security (CCS)
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models