Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks

Alireza Sadeghi; Gang Wang; Georgios B. Giannakis

doi:10.1109/TCCN.2019.2936193

Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks

Alireza Sadeghi, Gang Wang, Georgios B. Giannakis

Electrical and Computer Engineering

Research output: Contribution to journal › Article › peer-review

89 Scopus citations

Abstract

Caching is envisioned to play a critical role in next-generation content delivery infrastructure, cellular networks, and Internet architectures. By smartly storing the most popular contents at the storage-enabled network entities during off-peak demand instances, caching can benefit both network infrastructure as well as end users, during on-peak periods. In this context, distributing the limited storage capacity across network entities calls for decentralized caching schemes. Many practical caching systems involve a parent caching node connected to multiple leaf nodes to serve user file requests. To model the two-way interactive influence between caching decisions at the parent and leaf nodes, a reinforcement learning (RL) framework is put forth. To handle the large continuous state space, a scalable deep RL approach is pursued. The novel approach relies on a hyper-deep Q-network to learn the Q-function, and thus the optimal caching policy, in an online fashion. Reinforcing the parent node with ability to learn-and-adapt to unknown policies of leaf nodes as well as spatio-temporal dynamic evolution of file requests, results in remarkable caching performance, as corroborated through numerical tests.

Original language	English (US)
Article number	8807260
Pages (from-to)	1024-1033
Number of pages	10
Journal	IEEE Transactions on Cognitive Communications and Networking
Volume	5
Issue number	4
DOIs	https://doi.org/10.1109/TCCN.2019.2936193
State	Published - Dec 2019

Bibliographical note

Funding Information:
Manuscript received February 26, 2019; revised June 30, 2019; accepted August 10, 2019. Date of publication August 20, 2019; date of current version December 12, 2019. This work was supported in part by NSF grants 1711471, 1514056, and 1901134. The associate editor coordinating the review of this article and approving it for publication was H. T. Dinh. (Corresponding author: Gang Wang.) The authors are with the Digital Technology Center, University of Minnesota, Minneapolis, MN 55455 USA, and also with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455 USA (e-mail: sadeghi@umn.edu; gangwang@umn.edu; georgios@umn.edu). Digital Object Identifier 10.1109/TCCN.2019.2936193

Publisher Copyright:
© 2015 IEEE.

Keywords

Caching
deep Q-network
deep RL
function approximation
next-generation networks

Access

10.1109/TCCN.2019.2936193

OpenUrl availability

Full text

Cite this

@article{299bcb59f6144c1e92ecc5b0f54d6a4c,

title = "Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks",

abstract = "Caching is envisioned to play a critical role in next-generation content delivery infrastructure, cellular networks, and Internet architectures. By smartly storing the most popular contents at the storage-enabled network entities during off-peak demand instances, caching can benefit both network infrastructure as well as end users, during on-peak periods. In this context, distributing the limited storage capacity across network entities calls for decentralized caching schemes. Many practical caching systems involve a parent caching node connected to multiple leaf nodes to serve user file requests. To model the two-way interactive influence between caching decisions at the parent and leaf nodes, a reinforcement learning (RL) framework is put forth. To handle the large continuous state space, a scalable deep RL approach is pursued. The novel approach relies on a hyper-deep Q-network to learn the Q-function, and thus the optimal caching policy, in an online fashion. Reinforcing the parent node with ability to learn-and-adapt to unknown policies of leaf nodes as well as spatio-temporal dynamic evolution of file requests, results in remarkable caching performance, as corroborated through numerical tests.",

keywords = "Caching, deep Q-network, deep RL, function approximation, next-generation networks",

author = "Alireza Sadeghi and Gang Wang and Giannakis, {Georgios B.}",

note = "Funding Information: Manuscript received February 26, 2019; revised June 30, 2019; accepted August 10, 2019. Date of publication August 20, 2019; date of current version December 12, 2019. This work was supported in part by NSF grants 1711471, 1514056, and 1901134. The associate editor coordinating the review of this article and approving it for publication was H. T. Dinh. (Corresponding author: Gang Wang.) The authors are with the Digital Technology Center, University of Minnesota, Minneapolis, MN 55455 USA, and also with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455 USA (e-mail: sadeghi@umn.edu; gangwang@umn.edu; georgios@umn.edu). Digital Object Identifier 10.1109/TCCN.2019.2936193 Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2019",

month = dec,

doi = "10.1109/TCCN.2019.2936193",

language = "English (US)",

volume = "5",

pages = "1024--1033",

journal = "IEEE Transactions on Cognitive Communications and Networking",

issn = "2332-7731",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks

AU - Sadeghi, Alireza

AU - Wang, Gang

AU - Giannakis, Georgios B.

N1 - Funding Information: Manuscript received February 26, 2019; revised June 30, 2019; accepted August 10, 2019. Date of publication August 20, 2019; date of current version December 12, 2019. This work was supported in part by NSF grants 1711471, 1514056, and 1901134. The associate editor coordinating the review of this article and approving it for publication was H. T. Dinh. (Corresponding author: Gang Wang.) The authors are with the Digital Technology Center, University of Minnesota, Minneapolis, MN 55455 USA, and also with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455 USA (e-mail: sadeghi@umn.edu; gangwang@umn.edu; georgios@umn.edu). Digital Object Identifier 10.1109/TCCN.2019.2936193 Publisher Copyright: © 2015 IEEE.

PY - 2019/12

Y1 - 2019/12

N2 - Caching is envisioned to play a critical role in next-generation content delivery infrastructure, cellular networks, and Internet architectures. By smartly storing the most popular contents at the storage-enabled network entities during off-peak demand instances, caching can benefit both network infrastructure as well as end users, during on-peak periods. In this context, distributing the limited storage capacity across network entities calls for decentralized caching schemes. Many practical caching systems involve a parent caching node connected to multiple leaf nodes to serve user file requests. To model the two-way interactive influence between caching decisions at the parent and leaf nodes, a reinforcement learning (RL) framework is put forth. To handle the large continuous state space, a scalable deep RL approach is pursued. The novel approach relies on a hyper-deep Q-network to learn the Q-function, and thus the optimal caching policy, in an online fashion. Reinforcing the parent node with ability to learn-and-adapt to unknown policies of leaf nodes as well as spatio-temporal dynamic evolution of file requests, results in remarkable caching performance, as corroborated through numerical tests.

AB - Caching is envisioned to play a critical role in next-generation content delivery infrastructure, cellular networks, and Internet architectures. By smartly storing the most popular contents at the storage-enabled network entities during off-peak demand instances, caching can benefit both network infrastructure as well as end users, during on-peak periods. In this context, distributing the limited storage capacity across network entities calls for decentralized caching schemes. Many practical caching systems involve a parent caching node connected to multiple leaf nodes to serve user file requests. To model the two-way interactive influence between caching decisions at the parent and leaf nodes, a reinforcement learning (RL) framework is put forth. To handle the large continuous state space, a scalable deep RL approach is pursued. The novel approach relies on a hyper-deep Q-network to learn the Q-function, and thus the optimal caching policy, in an online fashion. Reinforcing the parent node with ability to learn-and-adapt to unknown policies of leaf nodes as well as spatio-temporal dynamic evolution of file requests, results in remarkable caching performance, as corroborated through numerical tests.

KW - Caching

KW - deep Q-network

KW - deep RL

KW - function approximation

KW - next-generation networks

UR - http://www.scopus.com/inward/record.url?scp=85071539159&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85071539159&partnerID=8YFLogxK

U2 - 10.1109/TCCN.2019.2936193

DO - 10.1109/TCCN.2019.2936193

M3 - Article

AN - SCOPUS:85071539159

SN - 2332-7731

VL - 5

SP - 1024

EP - 1033

JO - IEEE Transactions on Cognitive Communications and Networking

JF - IEEE Transactions on Cognitive Communications and Networking

IS - 4

M1 - 8807260

ER -

Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this