Optimization of Matrix-Matrix Multiplication Algorithm for Matrix-Panel Multiplication on Intel KNL
  • Rizwan, Muhammad
  • Jung, Enoch
  • Park, Yoosang
  • Choi, Jaeyoung
  • Kim, Yoonhee
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

2

초록

The most scientific and numerical problems can be solved using the system of equations in linear algebra. Matrix-matrix multiplication is the foundation of linear algebra equations, and its optimization has an impact on the overall performance of a system. ScaLAPACK has established itself as the industry standard for dense linear algebraic computations, developed 30 years ago. Owing to advancements in microprocessor architectures, it is difficult to fully utilize the hardware capabilities of legacy software systems on modern architectures and achieve the maximum performance. In this study, we analyzed the effects of matrix size, register blocking parameters, and thread distribution on the performance, and improved our previously implemented matrix-matrix multiplication routine for matrix-panel multiplication, which performed well for large-sized square matrices. We also presented the ScaLAPACK QR factorization performance by replacing the double-precision general matrix-matrix multiplication routine (DGEMM) of ScaLAPACK with our matrix-matrix multiplication routine for a single node Intel Xeon Phi Knights Landing processor. © 2022 IEEE.

키워드

AVX-512Intel Knights Landingmatrix-matrix multiplicationQR factorizationScaLAPACK
제목
Optimization of Matrix-Matrix Multiplication Algorithm for Matrix-Panel Multiplication on Intel KNL
저자
Rizwan, MuhammadJung, EnochPark, YoosangChoi, JaeyoungKim, Yoonhee
DOI
10.1109/AICCSA56895.2022.10017947
발행일
2022-12
유형
Conference Paper
저널명
Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
2022-December