상세 보기
- Rizwan, Muhammad;
- Jung, Enoch;
- Park, Yoosang;
- Choi, Jaeyoung;
- Kim, Yoonhee
WEB OF SCIENCE
0SCOPUS
2초록
The most scientific and numerical problems can be solved using the system of equations in linear algebra. Matrix-matrix multiplication is the foundation of linear algebra equations, and its optimization has an impact on the overall performance of a system. ScaLAPACK has established itself as the industry standard for dense linear algebraic computations, developed 30 years ago. Owing to advancements in microprocessor architectures, it is difficult to fully utilize the hardware capabilities of legacy software systems on modern architectures and achieve the maximum performance. In this study, we analyzed the effects of matrix size, register blocking parameters, and thread distribution on the performance, and improved our previously implemented matrix-matrix multiplication routine for matrix-panel multiplication, which performed well for large-sized square matrices. We also presented the ScaLAPACK QR factorization performance by replacing the double-precision general matrix-matrix multiplication routine (DGEMM) of ScaLAPACK with our matrix-matrix multiplication routine for a single node Intel Xeon Phi Knights Landing processor. © 2022 IEEE.
키워드
- 제목
- Optimization of Matrix-Matrix Multiplication Algorithm for Matrix-Panel Multiplication on Intel KNL
- 저자
- Rizwan, Muhammad; Jung, Enoch; Park, Yoosang; Choi, Jaeyoung; Kim, Yoonhee
- 발행일
- 2022-12
- 유형
- Conference Paper
- 저널명
- Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
- 권
- 2022-December