Multi-User Adaptive Video Delivery over Wireless Networks: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

Kexin Tang; Nuowen Kan; Junni Zou; Chenglin Li; Xiao Fu; Mingyi Hong; Hongkai Xiong

doi:10.1109/TCSVT.2020.2980587

Multi-User Adaptive Video Delivery over Wireless Networks: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

Kexin Tang, Nuowen Kan, Junni Zou, Chenglin Li, Xiao Fu, Mingyi Hong, Hongkai Xiong

Research output: Contribution to journal › Article › peer-review

15 Scopus citations

Abstract

In this paper, we investigate the adaptive video delivery for multiple users over time-varying and mutually interfering multi-cell wireless networks. The key research challenge is to jointly design the physical-layer resource allocation scheme and application-layer rate adaptation logic, such that the users' long-term fair quality of experience (QoE) can be maximized. Due to the timescale mismatch between these two layers and the asynchrony of user requests, however, it is difficult to directly model the cross-layer stochastic control problem by using a reinforcement learning framework. To address this difficulty, we propose a novel two-level decision framework where an optimization-based beamforming scheme (performed at the base stations) and a deep reinforcement learning (DRL)-based rate adaptation scheme (performed at the user terminals) are, respectively, developed, such that a highly complex long-term multi-user QoE fairness problem is decomposed into some relatively simple problems and solved effectively. Our strategy represents a significant departure from the existing schemes with consideration of either a short-term multi-user QoE maximization or a long-term single-user point-to-point QoE maximization. Extensive simulations demonstrate that the proposed cross-layer design is effective and promising.

Original language	English (US)
Article number	9035396
Pages (from-to)	798-815
Number of pages	18
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	31
Issue number	2
DOIs	https://doi.org/10.1109/TCSVT.2020.2980587
State	Published - Feb 2021
Externally published	Yes

Bibliographical note

Funding Information:
Manuscript received November 6, 2019; revised January 26, 2020 and February 28, 2020; accepted March 2, 2020. Date of publication March 13, 2020; date of current version February 4, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61831018, Grant 61931023, Grant 61871267, Grant 61972256, and Grant 61720106001, and in part by the China Scholarship Council. The work of Mingyi Hong was supported in part by the National Science Foundation under Grant CIF-1910385 and in part by the Army Research Office under Grant 73202-CS. This article was recommended by Associate Editor C. Wu. (Corresponding author: Chenglin Li.) Kexin Tang, Nuowen Kan, Junni Zou, Chenglin Li, and Hongkai Xiong are with the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: tkx1994-china@sjtu.edu.cn; kannw_1230@sjtu.edu.cn; zou-jn@cs.sjtu. edu.cn; lcl1985@sjtu.edu.cn; xionghongkai@sjtu.edu.cn).

Publisher Copyright:
© 1991-2012 IEEE.

Keywords

Wireless video streaming
beamforming
cross-layer design
rate adaptation
reinforcement learning

Access

10.1109/TCSVT.2020.2980587

OpenUrl availability

Full text

Cite this

@article{abf600a7f1024900a1a0ac8f0ba76b75,

title = "Multi-User Adaptive Video Delivery over Wireless Networks: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach",

abstract = "In this paper, we investigate the adaptive video delivery for multiple users over time-varying and mutually interfering multi-cell wireless networks. The key research challenge is to jointly design the physical-layer resource allocation scheme and application-layer rate adaptation logic, such that the users' long-term fair quality of experience (QoE) can be maximized. Due to the timescale mismatch between these two layers and the asynchrony of user requests, however, it is difficult to directly model the cross-layer stochastic control problem by using a reinforcement learning framework. To address this difficulty, we propose a novel two-level decision framework where an optimization-based beamforming scheme (performed at the base stations) and a deep reinforcement learning (DRL)-based rate adaptation scheme (performed at the user terminals) are, respectively, developed, such that a highly complex long-term multi-user QoE fairness problem is decomposed into some relatively simple problems and solved effectively. Our strategy represents a significant departure from the existing schemes with consideration of either a short-term multi-user QoE maximization or a long-term single-user point-to-point QoE maximization. Extensive simulations demonstrate that the proposed cross-layer design is effective and promising.",

keywords = "Wireless video streaming, beamforming, cross-layer design, rate adaptation, reinforcement learning",

author = "Kexin Tang and Nuowen Kan and Junni Zou and Chenglin Li and Xiao Fu and Mingyi Hong and Hongkai Xiong",

note = "Funding Information: Manuscript received November 6, 2019; revised January 26, 2020 and February 28, 2020; accepted March 2, 2020. Date of publication March 13, 2020; date of current version February 4, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61831018, Grant 61931023, Grant 61871267, Grant 61972256, and Grant 61720106001, and in part by the China Scholarship Council. The work of Mingyi Hong was supported in part by the National Science Foundation under Grant CIF-1910385 and in part by the Army Research Office under Grant 73202-CS. This article was recommended by Associate Editor C. Wu. (Corresponding author: Chenglin Li.) Kexin Tang, Nuowen Kan, Junni Zou, Chenglin Li, and Hongkai Xiong are with the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: tkx1994-china@sjtu.edu.cn; kannw_1230@sjtu.edu.cn; zou-jn@cs.sjtu. edu.cn; lcl1985@sjtu.edu.cn; xionghongkai@sjtu.edu.cn). Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2021",

month = feb,

doi = "10.1109/TCSVT.2020.2980587",

language = "English (US)",

volume = "31",

pages = "798--815",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Multi-User Adaptive Video Delivery over Wireless Networks

T2 - A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

AU - Tang, Kexin

AU - Kan, Nuowen

AU - Zou, Junni

AU - Li, Chenglin

AU - Fu, Xiao

AU - Hong, Mingyi

AU - Xiong, Hongkai

N1 - Funding Information: Manuscript received November 6, 2019; revised January 26, 2020 and February 28, 2020; accepted March 2, 2020. Date of publication March 13, 2020; date of current version February 4, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61831018, Grant 61931023, Grant 61871267, Grant 61972256, and Grant 61720106001, and in part by the China Scholarship Council. The work of Mingyi Hong was supported in part by the National Science Foundation under Grant CIF-1910385 and in part by the Army Research Office under Grant 73202-CS. This article was recommended by Associate Editor C. Wu. (Corresponding author: Chenglin Li.) Kexin Tang, Nuowen Kan, Junni Zou, Chenglin Li, and Hongkai Xiong are with the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: tkx1994-china@sjtu.edu.cn; kannw_1230@sjtu.edu.cn; zou-jn@cs.sjtu. edu.cn; lcl1985@sjtu.edu.cn; xionghongkai@sjtu.edu.cn). Publisher Copyright: © 1991-2012 IEEE.

PY - 2021/2

Y1 - 2021/2

N2 - In this paper, we investigate the adaptive video delivery for multiple users over time-varying and mutually interfering multi-cell wireless networks. The key research challenge is to jointly design the physical-layer resource allocation scheme and application-layer rate adaptation logic, such that the users' long-term fair quality of experience (QoE) can be maximized. Due to the timescale mismatch between these two layers and the asynchrony of user requests, however, it is difficult to directly model the cross-layer stochastic control problem by using a reinforcement learning framework. To address this difficulty, we propose a novel two-level decision framework where an optimization-based beamforming scheme (performed at the base stations) and a deep reinforcement learning (DRL)-based rate adaptation scheme (performed at the user terminals) are, respectively, developed, such that a highly complex long-term multi-user QoE fairness problem is decomposed into some relatively simple problems and solved effectively. Our strategy represents a significant departure from the existing schemes with consideration of either a short-term multi-user QoE maximization or a long-term single-user point-to-point QoE maximization. Extensive simulations demonstrate that the proposed cross-layer design is effective and promising.

AB - In this paper, we investigate the adaptive video delivery for multiple users over time-varying and mutually interfering multi-cell wireless networks. The key research challenge is to jointly design the physical-layer resource allocation scheme and application-layer rate adaptation logic, such that the users' long-term fair quality of experience (QoE) can be maximized. Due to the timescale mismatch between these two layers and the asynchrony of user requests, however, it is difficult to directly model the cross-layer stochastic control problem by using a reinforcement learning framework. To address this difficulty, we propose a novel two-level decision framework where an optimization-based beamforming scheme (performed at the base stations) and a deep reinforcement learning (DRL)-based rate adaptation scheme (performed at the user terminals) are, respectively, developed, such that a highly complex long-term multi-user QoE fairness problem is decomposed into some relatively simple problems and solved effectively. Our strategy represents a significant departure from the existing schemes with consideration of either a short-term multi-user QoE maximization or a long-term single-user point-to-point QoE maximization. Extensive simulations demonstrate that the proposed cross-layer design is effective and promising.

KW - Wireless video streaming

KW - beamforming

KW - cross-layer design

KW - rate adaptation

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85100550480&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85100550480&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2020.2980587

DO - 10.1109/TCSVT.2020.2980587

M3 - Article

AN - SCOPUS:85100550480

SN - 1051-8215

VL - 31

SP - 798

EP - 815

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 2

M1 - 9035396

ER -

Multi-User Adaptive Video Delivery over Wireless Networks: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this