Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity

Park, Junhyeong; Kim, Jinsung

doi:10.1109/ICTC62082.2024.10826856

상세 보기

Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity

Park, Junhyeong;
Kim, Jinsung

Citations

SCOPUS

0

초록

In recent years, the rapid advancement of deep learning technologies in the field of Natural Language Processing (NLP) has led to the emergence of large language models (LLM). While these large language models have demonstrated exceptional performance across various tasks, they require huge computational resources due to their massive parameter sizes. To address the high operational costs associated with training and deploying these models, techniques such as pruning and mixed-precision computation have been proposed. Pruning reduces the number and size of model parameters while mixed-precision computation optimizes computational speed and memory usage. This study evaluates the impact of mixed-precision techniques on the performance of LLM when applied at various levels of sparsity induced by pruning.

키워드

LLM; Mixed-precision; Pruning; Sparsity

제목: Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity

저자: Park, Junhyeong; Kim, Jinsung

DOI: 10.1109/ICTC62082.2024.10826856

발행일: 2024-10

유형: Conference paper

저널명: International Conference on ICT Convergence

권: 2024

페이지: 1024 ~ 1025