상세 보기
- Park, Junhyeong;
- Kim, Jinsung
SCOPUS
0초록
In recent years, the rapid advancement of deep learning technologies in the field of Natural Language Processing (NLP) has led to the emergence of large language models (LLM). While these large language models have demonstrated exceptional performance across various tasks, they require huge computational resources due to their massive parameter sizes. To address the high operational costs associated with training and deploying these models, techniques such as pruning and mixed-precision computation have been proposed. Pruning reduces the number and size of model parameters while mixed-precision computation optimizes computational speed and memory usage. This study evaluates the impact of mixed-precision techniques on the performance of LLM when applied at various levels of sparsity induced by pruning.
키워드
- 제목
- Comparative Analysis of Model Accuracy Using Mixed-Precision Techniques at Various Levels of Sparsity
- 저자
- Park, Junhyeong; Kim, Jinsung
- 발행일
- 2024-10
- 유형
- Conference paper
- 저널명
- International Conference on ICT Convergence
- 권
- 2024
- 페이지
- 1024 ~ 1025