E-mail senden E-Mail Adresse kopieren
Placeholder
Doktorand:in

Wai Man Si

E-Mail

Adresse

Stuhlsatzenhaus 5
66123 Saarbrücken (Germany)

Veröffentlichungen von Wai Man Si

Jahr 2025

Konferenz / Medium

Conference on Neural Information Processing Systems (NeurIPS)
Finding and Reactivating Post-Trained LLMs’ Hidden Safety Mechanisms

Konferenz / Medium

International Conference on Learning Representations (ICLR)
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Jahr 2023

Konferenz / Medium

Usenix Security Symposium (USENIX-Security)
Two-in-One: A Model Hijacking Attack Against Text Generation Models

Jahr 2022

Konferenz / Medium

ACM Conference on Computer and Communications Security (CCS)