신경망 보코더 기반 위조 음성 데이터가 위조 탐지 모형 성능에 미치는 영향 분석

박서영; 곽일엽

doi:10.5351/KJAS.2025.38.5.679

상세 보기

신경망 보코더 기반 위조 음성 데이터가 위조 탐지 모형 성능에 미치는 영향 분석

Impact of neural-vocoder–generated spoofed speech on spoofing-detection performance

박서영;
곽일엽

Citations

WEB OF SCIENCE

0

초록

Recently, significant research efforts have been directed toward enhancing the performance of voice spoofing detection systems by employing neural vocoders as a data augmentation technique. In this study, we investigated the impact of training spoofing detection models with spoofed speech data generated through neural vocoders. To this end, we constructed training datasets by applying four different pre-trained vocoders to the data from the Automatic Speaker Verification Spoofing and Countermeasures (ASVspoof) Logical Access (LA) track. We employed a Self-Supervised Learning (SSL)-based Countermeasure (CM) model and evaluated its detection performance across four distinct datasets. For the ASVspoof 2019 LA dataset—representing the training domain—performance gains were observed in certain vocoder configurations; however, these improvements were inconsistent across various vocoder types or their combinations. Conversely, substantial performance improvements of approximately 27\% and 47\% were achieved on external evaluation datasets, specifically WaveFake and In-the-Wild, which were not utilized during training. These findings emphasize the importance of appropriate vocoder selection in enhancing the generalization capability of spoofing detection models and demonstrate that vocoder-generated spoofed speech can significantly boost model performance.

키워드

딥러닝; 음성 합성; 위조 음성 탐지; 신경망 기반 보코더; deep learning; speech synthesis; spoof detection; neural vocoder

제목: 신경망 보코더 기반 위조 음성 데이터가 위조 탐지 모형 성능에 미치는 영향 분석

제목 (타언어): Impact of neural-vocoder–generated spoofed speech on spoofing-detection performance

저자: 박서영; 곽일엽

DOI: 10.5351/KJAS.2025.38.5.679

발행일: 2025-10

유형: Article

저널명: 응용통계연구

권: 38

호: 5

페이지: 679 ~ 692