Skip to main navigation Skip to search Skip to main content

Autoscale: Dynamic, robust capacity management for multi-tier data centers

  • Anshul Gandhi
  • , Mor Harchol-Balter
  • , Ram Raghunathan
  • , Michael A. Kozuch
  • Carnegie Mellon University
  • Intel

Research output: Contribution to journalArticlepeer-review

251 Scopus citations

Abstract

Energy costs for data centers continue to rise, already exceeding $15 billion yearly. Sadly much of this power is wasted. Servers are only busy 10-30% of the time on average, but they are often left on, while idle, utilizing 60% or more of peak power when in the idle state. We introduce a dynamic capacity management policy, AutoScale, that greatly reduces the number of servers needed in data centers driven by unpredictable, time-varying load, while meeting response time SLAs. AutoScale scales the data center capacity, adding or removing servers as needed. AutoScale has two key features: (i) it autonomically maintains just the right amount of spare capacity to handle bursts in the request rate; and (ii) it is robust not just to changes in the request rate of real-world traces, but also request size and server efficiency. We evaluate our dynamic capacity management approach via implementation on a 38-server multi-tier data center, serving a web site of the type seen in Facebook or Amazon, with a key-value store workload. We demonstrate that AutoScale vastly improves upon existing dynamic capacity management policies with respect to meeting SLAs and robustness.

Original languageEnglish
Article number14
JournalACM Transactions on Computer Systems
Volume30
Issue number4
DOIs
StatePublished - Nov 2012

Keywords

  • Data centers
  • Power management
  • Resource provisioning

Fingerprint

Dive into the research topics of 'Autoscale: Dynamic, robust capacity management for multi-tier data centers'. Together they form a unique fingerprint.

Cite this