의생물학 Tabular 데이터에서 딥러닝과 전통적 머신러닝의 성능 비교
Benchmarking Deep Learning vs. Traditional Machine Learning on Biomedical Tabular Data
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

While deep learning has achieved revolutionary success in image and natural language processing, traditional gradient boosting-based machine learning (ML) models still dominate in the biomedical domain for tabular data. This study systematically evaluates the performance and efficiency of three ML models (XGBoost, LightGBM, CatBoost) and four deep learning (DL) models on five public biomedical datasets, applying identical preprocessing and hyperparameter tuning. Experimental results show that for small to medium-sized datasets (under 10,000 samples), ML models consistently demonstrated superior performance and speed. On large-scale datasets (over 200,000 samples), DL models showed comparable performance but with significantly decreased efficiency as the number of features increased. In conclusion, gradient boosting-based ML models remain a robust choice for most biomedical tabular problems, while Transformer-based DL models may offer limited benefits only when applied to very large datasets with sufficient computational resources.

키워드

Biomedical DataTabular DataDeep LearningBenchmarking.의생물학 데이터Tabular 데이터머신러닝딥러닝벤치마킹
제목
의생물학 Tabular 데이터에서 딥러닝과 전통적 머신러닝의 성능 비교
제목 (타언어)
Benchmarking Deep Learning vs. Traditional Machine Learning on Biomedical Tabular Data
저자
송채린이나현곽일엽
DOI
10.37727/jkdas.2025.27.5.1501
발행일
2025-10
유형
Y
저널명
Journal of The Korean Data Analysis Society
27
5
페이지
1501 ~ 1515