Dataset condensation with coarse-to-fine regularization
Citations

WEB OF SCIENCE

1
Citations

SCOPUS

0

초록

State-of-the-art artificial intelligence models heavily rely on datasets with large numbers of samples, necessitating substantial memory allocation for data storage and high computational costs for model training. To alleviate storage and computational overheads, dataset condensation has recently gained attention. This approach encapsulates large samples into a more compact sample set while preserving the accuracy of a network trained on an entire sample set. Existing methods focus on aligning the output logits or network parameters trained on synthetic images with those of networks trained on real images. However, these approaches fail to encapsulate the diverse information because of their inability to account for relationships between synthetic images, leading to information redundancy between multiple synthetic images. To address these issues, we exploit the relationships among synthetic samples. This allows us to create diverse representations of synthetic images across distinct classes and to encourage diversity within the same class. We further promote diverse representations between synthetic image sub-regions. Experimental results with various datasets demonstrate that our method outperforms competitors by up to 12.2%. Moreover, the networks, which were not encountered during the condensation process, and were trained using our synthesized dataset, outperform other methods. © 2025 Elsevier B.V.

키워드

Dataset condensationRepresentation learning
제목
Dataset condensation with coarse-to-fine regularization
저자
Jin, HyundongKim, Eunwoo
DOI
10.1016/j.patrec.2024.12.018
발행일
2025-02
유형
Article
저널명
Pattern Recognition Letters
188
페이지
178 ~ 184