상세 보기
- Kang, Yujin;
- Kim, Eunsun;
- Cho, Yoon-Sik
SCOPUS
1초록
Recent advancements have shown that Large Language Models (LLMs) possess significant versatility, making them suitable for applications in many areas. Several studies have shown how general-purpose LLMs can be adapted to domain-specific tasks. However, these domain-adapted LLMs can be exposed to greater privacy risks, which are especially exacerbated in the medical field. In this paper, we present the study investigating the susceptibility of LLMs to leaking sensitive health information. We conduct prompt-based attacks on LLMs trained with medical datasets, showing that medical LLMs can inadvertently disclose confidential patient data. To contribute towards mitigating privacy risks in the medical domain, we implement red teaming defense strategies to make LLMs robust against malicious attacks. For this medical red teaming approach, we develop and publicly release MediRed, a dataset of 1,000 red team attacks. By leveraging this dataset to enhance our defense mechanisms, we achieve up to 56% improvement in privacy protection compared to base models. Our code and dataset are available at https://github.com/yujinKang32/Private_Med_LLM.git.
키워드
- 제목
- Can Personal Health Information Be Secured in LLM? Privacy Attack and Defense in the Medical Domain
- 저자
- Kang, Yujin; Kim, Eunsun; Cho, Yoon-Sik
- 발행일
- 2025-10
- 유형
- Conference Paper
- 저널명
- CCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security
- 페이지
- 4199 ~ 4213