Skip to main navigation Skip to search Skip to main content

Managing High-Bandwidth Memory is a Parallel Scheduling Problem (full paper only)

  • Washington University St. Louis
  • University of Pittsburgh
  • Carnegie Mellon University
  • Columbia University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

High-Bandwidth Memory (HBM) is a decade-old memory technology that is increasingly commonly being used in highly-parallel machines such as GPUs and multicores. Comparatively, HBM has higher bandwidth, smaller capacity, and similar latency to other DRAM technologies. Many systems use both HBM and other DRAM technologies, where HBM is naturally closer to the processor in the conceptual memory hierarchy. Thus, a natural resulting question is how one should best manage a collection of processes running on a HBM/DRAM memory hierarchy. Prior work introduced a theoretical model for addressing this question, and gave a competitive policy for the objective of minimizing makespan. Our main technical contribution is to give a competitive policy for the more commonly appropriate total/average response/completion time objective. However, we believe the broader, and more important contribution, is to make explicit the case (hinted at in the prior literature) that managing an HBM/DRAM hierarchy should be thought of as a parallel scheduling problem. To that end, we introduce a new online scheduling model that we call the semi-normal model. We then show how to use a competitive algorithm for scheduling in the semi-normal model as a black box to obtain a competitive algorithm for managing a HBM/DRAM memory hierarchy. Thus, as a result of this black-box conversion, competitiveness results in the semi-normal model translate (essentially) automatically into competitiveness results in the HBM/DRAM management model. Our main technical result is then an application of such a translation. That is, we show that a natural variant of the Round Robin (processor sharing) algorithm, naturally adapted for the seminormal model, is competitive for the objective of average/total completion time. Thus, we obtain an algorithm for managing a HBM/DRAM hierarchy that is competitive for the objective of average/total completion time, using this black-box reduction.

Original languageEnglish
Title of host publicationSPAA 2025 - Proceedings of the 2025 37th ACM Symposium on Parallelism in Algorithms and Architectures
PublisherAssociation for Computing Machinery
Pages171-180
Number of pages10
ISBN (Electronic)9798400712586
DOIs
StatePublished - Jul 16 2025
Event37th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2025 - Portland, United States
Duration: Jul 28 2025Aug 1 2025

Publication series

NameAnnual ACM Symposium on Parallelism in Algorithms and Architectures

Conference

Conference37th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2025
Country/TerritoryUnited States
CityPortland
Period07/28/2508/1/25

Keywords

  • Approximation algorithms
  • High-bandwidth memory
  • Multicore paging
  • Online algorithms
  • Paging
  • Scheduling

Fingerprint

Dive into the research topics of 'Managing High-Bandwidth Memory is a Parallel Scheduling Problem (full paper only)'. Together they form a unique fingerprint.

Cite this