Can Personal Health Information Be Secured in LLM? Privacy Attack and Defense in the Medical Domain
Citations

SCOPUS

1

초록

Recent advancements have shown that Large Language Models (LLMs) possess significant versatility, making them suitable for applications in many areas. Several studies have shown how general-purpose LLMs can be adapted to domain-specific tasks. However, these domain-adapted LLMs can be exposed to greater privacy risks, which are especially exacerbated in the medical field. In this paper, we present the study investigating the susceptibility of LLMs to leaking sensitive health information. We conduct prompt-based attacks on LLMs trained with medical datasets, showing that medical LLMs can inadvertently disclose confidential patient data. To contribute towards mitigating privacy risks in the medical domain, we implement red teaming defense strategies to make LLMs robust against malicious attacks. For this medical red teaming approach, we develop and publicly release MediRed, a dataset of 1,000 red team attacks. By leveraging this dataset to enhance our defense mechanisms, we achieve up to 56% improvement in privacy protection compared to base models. Our code and dataset are available at https://github.com/yujinKang32/Private_Med_LLM.git.

키워드

DefenseMedical LLMPersonal Health Information (PHI)Privacy attack
제목
Can Personal Health Information Be Secured in LLM? Privacy Attack and Defense in the Medical Domain
저자
Kang, YujinKim, EunsunCho, Yoon-Sik
DOI
10.1145/3719027.3765105
발행일
2025-10
유형
Conference Paper
저널명
CCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security
페이지
4199 ~ 4213