신경망 보코더 기반 위조 음성 데이터가 위조 탐지 모형 성능에 미치는 영향 분석
Impact of neural-vocoder–generated spoofed speech on spoofing-detection performance
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Recently, significant research efforts have been directed toward enhancing the performance of voice spoofing detection systems by employing neural vocoders as a data augmentation technique. In this study, we investigated the impact of training spoofing detection models with spoofed speech data generated through neural vocoders. To this end, we constructed training datasets by applying four different pre-trained vocoders to the data from the Automatic Speaker Verification Spoofing and Countermeasures (ASVspoof) Logical Access (LA) track. We employed a Self-Supervised Learning (SSL)-based Countermeasure (CM) model and evaluated its detection performance across four distinct datasets. For the ASVspoof 2019 LA dataset—representing the training domain—performance gains were observed in certain vocoder configurations; however, these improvements were inconsistent across various vocoder types or their combinations. Conversely, substantial performance improvements of approximately 27\% and 47\% were achieved on external evaluation datasets, specifically WaveFake and In-the-Wild, which were not utilized during training. These findings emphasize the importance of appropriate vocoder selection in enhancing the generalization capability of spoofing detection models and demonstrate that vocoder-generated spoofed speech can significantly boost model performance.

키워드

딥러닝음성 합성위조 음성 탐지신경망 기반 보코더deep learningspeech synthesisspoof detectionneural vocoder
제목
신경망 보코더 기반 위조 음성 데이터가 위조 탐지 모형 성능에 미치는 영향 분석
제목 (타언어)
Impact of neural-vocoder–generated spoofed speech on spoofing-detection performance
저자
박서영곽일엽
DOI
10.5351/KJAS.2025.38.5.679
발행일
2025-10
유형
Article
저널명
응용통계연구
38
5
페이지
679 ~ 692