Send email Copy Email Address
Placeholder
PhD Student

Wai Man Si

Email

Address

Stuhlsatzenhaus 5
66123 Saarbrücken (Germany)

Publications by Wai Man Si

Year 2025

Conference / Medium

Conference on Neural Information Processing Systems (NeurIPS)
Finding and Reactivating Post-Trained LLMs’ Hidden Safety Mechanisms

Conference / Medium

International Conference on Learning Representations (ICLR)
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Year 2023

Conference / Medium

Usenix Security Symposium (USENIX-Security)
Two-in-One: A Model Hijacking Attack Against Text Generation Models

Year 2022

Conference / Medium

ACM Conference on Computer and Communications Security (CCS)