GPU의 효율적인 자원 활용을 위한 동시 멀티태스킹 성능 분석
Performance Analysis of Concurrent Multitasking for Efficient Resource Utilization of GPUs
  • 김세진
  • 진계신
  • 염헌영
  • 김윤희
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

As Graphics Processing Units (GPUs) are widely utilized to accelerate compute-intensive applications, their application has expanded especially in data centers and clouds. However, the existing resource sharing methods within GPU are limited and cannot efficiently handle several requests of concurrent cloud users’ executions on GPU while effectively utilizing the available system resources. In addition, it is challenging to effectively partition resources within GPU without under-standing and assimilating application execution patterns. This paper proposes an execution pattern- based application classification method and analyzes run-time characteristics: why the performance of an application is saturated at a point regardless of the allocated resources. In addition, we analyze the multitasking performance of the co-allocated applications using smCompactor, a thread block-based scheduling framework. We identify near-best co-allocated application sets, which effectively utilize the available system resources. Based on our results, there was a performance improvement of approxi-mately 28% compared to NVIDIA MPS.

키워드

GPU멀티태스킹응용 분류스케줄링smCompactorGPUmultitaskingapplication classificationschedulingsmCompactor
제목
GPU의 효율적인 자원 활용을 위한 동시 멀티태스킹 성능 분석
제목 (타언어)
Performance Analysis of Concurrent Multitasking for Efficient Resource Utilization of GPUs
저자
김세진진계신염헌영김윤희
DOI
10.5626/JOK.2021.48.6.604
발행일
2021-06
저널명
정보과학회논문지
48
6
페이지
604 ~ 611