이기종 클러스터 환경의 딥러닝 응용을 위한 GPU 공유 기법 분석
An Analysis of GPU Sharing Methods for Deep Learning Applications in a Heterogeneous Cluster Environment
  • 하지원
  • 김서영
  • 테오도라 아두푸
  • 엄현상
  • 김윤희
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

GPU sharing technologies are being implemented to address the low resource utilization often seen in deep learning applications and to enhance performance in heterogeneous GPU clusters. However, there has been considerably less research on the effective use of time-slicing, Multi-Process Service (MPS), and Multi-Instance GPU (MIG) sharing technologies in relation to the specific characteristics of deep learning applications across different GPU architectures. This paper proposes optimal task placement criteria for these GPU sharing technologies by analyzing the execution scale, resource sensitivity, and resource utilization of tasks based on various deep learning models and parameters. In a heterogeneous GPU cluster environment that supports multiple sharing technologies, this study tested these optimal task placement criteria, resulting in reduced execution time and improved throughput.

키워드

heterogeneous GPUsapplication profilingKubernetesMPSMIGtime-slicing이기종 GPU응용 프로파일링쿠버네티스MPSMIGtime-slicing
제목
이기종 클러스터 환경의 딥러닝 응용을 위한 GPU 공유 기법 분석
제목 (타언어)
An Analysis of GPU Sharing Methods for Deep Learning Applications in a Heterogeneous Cluster Environment
저자
하지원김서영테오도라 아두푸엄현상김윤희
DOI
10.5626/KTCP.2025.31.1.25
발행일
2025-01
저널명
정보과학회 컴퓨팅의 실제 논문지
31
1
페이지
25 ~ 33