상세 보기
- 김세진;
- 진계신;
- 염헌영;
- 김윤희
WEB OF SCIENCE
0SCOPUS
0초록
As Graphics Processing Units (GPUs) are widely utilized to accelerate compute-intensive applications, their application has expanded especially in data centers and clouds. However, the existing resource sharing methods within GPU are limited and cannot efficiently handle several requests of concurrent cloud users’ executions on GPU while effectively utilizing the available system resources. In addition, it is challenging to effectively partition resources within GPU without under-standing and assimilating application execution patterns. This paper proposes an execution pattern- based application classification method and analyzes run-time characteristics: why the performance of an application is saturated at a point regardless of the allocated resources. In addition, we analyze the multitasking performance of the co-allocated applications using smCompactor, a thread block-based scheduling framework. We identify near-best co-allocated application sets, which effectively utilize the available system resources. Based on our results, there was a performance improvement of approxi-mately 28% compared to NVIDIA MPS.
키워드
- 제목
- GPU의 효율적인 자원 활용을 위한 동시 멀티태스킹 성능 분석
- 제목 (타언어)
- Performance Analysis of Concurrent Multitasking for Efficient Resource Utilization of GPUs
- 저자
- 김세진; 진계신; 염헌영; 김윤희
- 발행일
- 2021-06
- 저널명
- 정보과학회논문지
- 권
- 48
- 호
- 6
- 페이지
- 604 ~ 611