TY - GEN
T1 - Learning from optimal caching for content delivery
AU - Yan, Gang
AU - Li, Jian
AU - Towsley, Don
N1 - Publisher Copyright: © 2021 ACM.
PY - 2021/12/2
Y1 - 2021/12/2
N2 - Content delivery networks (CDNs) distribute much of today's Internet traffic by caching and serving users' contents requested. A major goal of a CDN is to improve hit probabilities of its caches, thereby reducing WAN traffic and user-perceived latency. In this paper, we develop a new approach for caching in CDNs that learns from optimal caching for decision making. To attain this goal, we first propose HRO to compute the upper bound on optimal caching in an online manner, and then leverage HRO to inform future content admission and eviction. We call this new cache design LHR. We show that LHR is efficient since it includes a detection mechanism for model update, an auto-tuned threshold-based model for content admission with a simple eviction rule. We have implemented an LHR simulator as well as a prototype within an Apache Traffic Server and the Caffeine, respectively. Our experimental results using four production CDN traces show that LHR consistently outperforms state of the arts with an increase in hit probability of up to 9% and a reduction in WAN traffic of up to 15% compared to a typical production CDN cache. Our evaluation of the LHR prototype shows that it only imposes a moderate overhead and can be deployed on today's CDN servers.
AB - Content delivery networks (CDNs) distribute much of today's Internet traffic by caching and serving users' contents requested. A major goal of a CDN is to improve hit probabilities of its caches, thereby reducing WAN traffic and user-perceived latency. In this paper, we develop a new approach for caching in CDNs that learns from optimal caching for decision making. To attain this goal, we first propose HRO to compute the upper bound on optimal caching in an online manner, and then leverage HRO to inform future content admission and eviction. We call this new cache design LHR. We show that LHR is efficient since it includes a detection mechanism for model update, an auto-tuned threshold-based model for content admission with a simple eviction rule. We have implemented an LHR simulator as well as a prototype within an Apache Traffic Server and the Caffeine, respectively. Our experimental results using four production CDN traces show that LHR consistently outperforms state of the arts with an increase in hit probability of up to 9% and a reduction in WAN traffic of up to 15% compared to a typical production CDN cache. Our evaluation of the LHR prototype shows that it only imposes a moderate overhead and can be deployed on today's CDN servers.
UR - https://www.scopus.com/pages/publications/85121589866
U2 - 10.1145/3485983.3494855
DO - 10.1145/3485983.3494855
M3 - Conference contribution
T3 - CoNEXT 2021 - Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies
SP - 344
EP - 358
BT - CoNEXT 2021 - Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies
PB - Association for Computing Machinery, Inc
T2 - 17th ACM International Conference on emerging Networking EXperiments and Technologies, CoNEXT 2021
Y2 - 7 December 2021 through 10 December 2021
ER -