Skip to main navigation Skip to search Skip to main content

Communication efficient matrix multiplication on hypercubes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

In this paper we present an efficient dense matrix multiplication algorithm for distributed memory computers with a hypercube topology. The proposed algorithm performs better than all previously proposed algorithms for a wide range of matrix sizes and number of processors, especially for large matrices. We analyze the performance of the algorithms for two types of hypercube architectures, one in which each node can use (to send and receive) at most one communication link at a time and the other in which each node can use all communication links simultaneously.

Original languageEnglish
Title of host publicationProceedings of the 6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994
PublisherAssociation for Computing Machinery, Inc
Pages320-329
Number of pages10
ISBN (Electronic)0897916719, 9780897916714
DOIs
StatePublished - Aug 1 1994
Event6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994 - Cape May, United States
Duration: Jun 27 1994Jun 29 1994

Publication series

NameProceedings of the 6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994

Conference

Conference6th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA 1994
Country/TerritoryUnited States
CityCape May
Period06/27/9406/29/94

Keywords

  • 3-D grids
  • Distributed algorithms
  • Hypercubes
  • Interprocessor communication
  • Matrix multiplication

Fingerprint

Dive into the research topics of 'Communication efficient matrix multiplication on hypercubes'. Together they form a unique fingerprint.

Cite this