TY - GEN
T1 - Restless streaming bandits
T2 - 55th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2017
AU - Hosseini, S. Amir
AU - Panwar, Shivendra S.
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/1
Y1 - 2017/7/1
N2 - In this paper, we consider the problem of optimal scalable video delivery to mobile users in wireless networks given arbitrary Quality Adaptation (QA) mechanisms. In current practical systems, QA and wireless channel scheduling are performed independently by the content provider and network operator, respectively. While most research has been focused on jointly optimizing these two tasks, the high complexity that comes with a joint approach makes the implementation impractical. Therefore, we present a scheduling mechanism that takes the QA logic of each user as input and optimizes the scheduling accordingly. Hence, there is no need for centralized QA and cross-layer interactions are minimized. We model the QA-adaptive scheduling and the jointly optimal problem as a Restless Bandit and a Multi-user Semi Markov Decision Process, respectively in order to compare the loss incurred by not employing a jointly optimal scheme. We then present heuristic algorithms in order to achieve the optimal outcome of the Restless Bandit solution, assuming the base station has knowledge of, but no control over, the underlying quality adaptation of each user (QA-Aware). We also present a simplified heuristic without the need for any higher layer knowledge at the base station (QA-Blind). We show that our QA-Aware strategy can achieve up to two times improvement in network utilization compared to popular baseline algorithms such as Proportional Fairness.
AB - In this paper, we consider the problem of optimal scalable video delivery to mobile users in wireless networks given arbitrary Quality Adaptation (QA) mechanisms. In current practical systems, QA and wireless channel scheduling are performed independently by the content provider and network operator, respectively. While most research has been focused on jointly optimizing these two tasks, the high complexity that comes with a joint approach makes the implementation impractical. Therefore, we present a scheduling mechanism that takes the QA logic of each user as input and optimizes the scheduling accordingly. Hence, there is no need for centralized QA and cross-layer interactions are minimized. We model the QA-adaptive scheduling and the jointly optimal problem as a Restless Bandit and a Multi-user Semi Markov Decision Process, respectively in order to compare the loss incurred by not employing a jointly optimal scheme. We then present heuristic algorithms in order to achieve the optimal outcome of the Restless Bandit solution, assuming the base station has knowledge of, but no control over, the underlying quality adaptation of each user (QA-Aware). We also present a simplified heuristic without the need for any higher layer knowledge at the base station (QA-Blind). We show that our QA-Aware strategy can achieve up to two times improvement in network utilization compared to popular baseline algorithms such as Proportional Fairness.
UR - http://www.scopus.com/inward/record.url?scp=85047910086&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85047910086&partnerID=8YFLogxK
U2 - 10.1109/ALLERTON.2017.8262794
DO - 10.1109/ALLERTON.2017.8262794
M3 - Conference contribution
AN - SCOPUS:85047910086
T3 - 55th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2017
SP - 620
EP - 628
BT - 55th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2017
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 3 October 2017 through 6 October 2017
ER -