TY - GEN
T1 - Communication efficient matrix multiplication on hypercubes
AU - Gupta, Himanshu
AU - Sadayappan, P.
N1 - Publisher Copyright: © 1994 ACM.
PY - 1994/8/1
Y1 - 1994/8/1
N2 - In this paper we present an efficient dense matrix multiplication algorithm for distributed memory computers with a hypercube topology. The proposed algorithm performs better than all previously proposed algorithms for a wide range of matrix sizes and number of processors, especially for large matrices. We analyze the performance of the algorithms for two types of hypercube architectures, one in which each node can use (to send and receive) at most one communication link at a time and the other in which each node can use all communication links simultaneously.
AB - In this paper we present an efficient dense matrix multiplication algorithm for distributed memory computers with a hypercube topology. The proposed algorithm performs better than all previously proposed algorithms for a wide range of matrix sizes and number of processors, especially for large matrices. We analyze the performance of the algorithms for two types of hypercube architectures, one in which each node can use (to send and receive) at most one communication link at a time and the other in which each node can use all communication links simultaneously.
KW - 3-D grids
KW - Distributed algorithms
KW - Hypercubes
KW - Interprocessor communication
KW - Matrix multiplication
UR - https://www.scopus.com/pages/publications/0039066274
U2 - 10.1145/181014.181434
DO - 10.1145/181014.181434
M3 - Conference contribution
T3 - Proceedings of the 6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994
SP - 320
EP - 329
BT - Proceedings of the 6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994
PB - Association for Computing Machinery, Inc
T2 - 6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994
Y2 - 27 June 1994 through 29 June 1994
ER -