Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity
Citations

SCOPUS

0

초록

In recent years, the rapid advancement of deep learning technologies in the field of Natural Language Processing (NLP) has led to the emergence of large language models (LLM). While these large language models have demonstrated exceptional performance across various tasks, they require huge computational resources due to their massive parameter sizes. To address the high operational costs associated with training and deploying these models, techniques such as pruning and mixed-precision computation have been proposed. Pruning reduces the number and size of model parameters while mixed-precision computation optimizes computational speed and memory usage. This study evaluates the impact of mixed-precision techniques on the performance of LLM when applied at various levels of sparsity induced by pruning.

키워드

LLMMixed-precisionPruningSparsity
제목
Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity
저자
Park, JunhyeongKim, Jinsung
DOI
10.1109/ICTC62082.2024.10826856
발행일
2024-10
유형
Conference paper
저널명
International Conference on ICT Convergence
2024
페이지
1024 ~ 1025