Send email Copy Email Address
Placeholder
PhD Student

Yixin Wu

Email

Address

Stuhlsatzenhaus 5
66123 Saarbrücken (Germany)

Publications by Yixin Wu

Year 2026

Conference / Medium

Annual Meeting of the Association for Computational Linguistics (ACL)
Peering Behind the Shield: Guardrail Identification in Large Language Models

Conference / Medium

Annual Meeting of the Association for Computational Linguistics (ACL)
InferPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents

Conference / Medium

Annual Meeting of the Association for Computational Linguistics (ACL)
Rethinking Assessments of Prompt Injection Attacks

Year 2025

Conference / Medium

ACM Conference on Computer and Communications Security (CCS)
UnsafeBench: Benchmarking Image Safety Classifiers onReal-World and AI-Generated Images

Conference / Medium

Usenix Security Symposium (USENIX-Security)
Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications

Conference / Medium

Usenix Security Symposium (USENIX-Security)
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts

Conference / Medium

Usenix Security Symposium (USENIX-Security)
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

Article

Foundations and Trends® in Privacy and Security Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety

Year 2024

Conference / Medium

Conference on Empirical Methods in Natural Language Processing (EMNLP)
The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective