research-article

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming

Authors:
Chenglei Wu

Beijing Key Lab of Networked Multimedia

Beijing Key Lab of Networked Multimedia
View Profile

,
Zhi Wang

Tsinghua University

Tsinghua University
View Profile

,
Lifeng Sun

Beijing Key Lab of Networked Multimedia and Tsinghua University and Ministry of Education, China

Beijing Key Lab of Networked Multimedia and Tsinghua University and Ministry of Education, China
View Profile

NOSSDAV '21: Proceedings of the 31st ACM Workshop on Network and Operating Systems Support for Digital Audio and VideoJuly 2021Pages 34–41https://doi.org/10.1145/3458306.3460995

Published:02 July 2021Publication History

NOSSDAV '21: Proceedings of the 31st ACM Workshop on Network and Operating Systems Support for Digital Audio and Video

Pages 34–41

ABSTRACT

Conventional tile-based 360° video streaming methods, including deep reinforcement learning (DRL) based, ignore the interactive nature of 360° video streaming and download tiles following fixed sequential orders, thus failing to respond to the user's head motion changes. We show that these existing solutions suffer from either the prefetch accuracy or the playback stability drop. Furthermore, these methods are constrained to serve only one fixed streaming preference, causing extra training overhead and the lack of generalization on unseen preferences. In this paper, we propose a dual-queue streaming framework, with accuracy and stability purposes respectively, to enable the DRL agent to determine and change the tile download order without incurring overhead. We also design a preference-aware DRL algorithm to incentivize the agent to learn preference-dependent ABR decisions efficiently. Compared with state-of-the-art DRL baselines, our method not only significantly improves the streaming quality, e.g., increasing the average streaming quality by 13.6% on a public dataset, but also demonstrates better performance and generalization under dynamic preferences, e.g., an average quality improvement of 19.9% on unseen preferences.

References

A. T. Nasrabadi, A. Mahzari, J. D. Beshay, and R. Prakash, "Adaptive 360-degree video streaming using scalable video coding," in Proceedings of the 2017 ACM on Multimedia Conference, ser. MM '17. New York, NY, USA: ACM, 2017, pp. 1689--1697. [Online]. Available Google ScholarDigital Library
C. Zhou, Z. Li, and Y. Liu, "A measurement study of oculus 360 degree video streaming," in Proceedings of the 8th ACM on Multimedia Systems Conference, ser. MMSys'17. New York, NY, USA: ACM, 2017, pp. 27--37. [Online]. Available Google ScholarDigital Library
L. Xie, X. Zhang, and Z. Guo, "Cls: A cross-user learning based system for improving qoe in 360-degree video adaptive streaming," in Proceedings of the 26th ACM international conference on Multimedia, 2018, pp. 564--572.Google Scholar
X. Corbillon, A. Devlic, G. Simon, and J. Chakareski, "Optimal set of 360-degree videos for viewport-adaptive streaming," in Proceedings of the 2017 ACM on Multimedia Conference, ser. MM '17. New York, NY, USA: ACM, 2017, pp. 943--951. [Online]. Available Google ScholarDigital Library
S. Petrangeli, V. Swaminathan, M. Hosseini, and F. De Turck, "An http/2-based adaptive streaming framework for 360 virtual reality videos," in Proceedings of the 2017 ACM on Multimedia Conference, ser. MM '17. New York, NY, USA: ACM, 2017, pp. 306--314. [Online]. Available Google ScholarDigital Library
M. Xiao, C. Zhou, Y. Liu, and S. Chen, "Optile: Toward optimal tiling in 360-degree video streaming," in Proceedings of the 2017 ACM on Multimedia Conference, ser. MM '17. New York, NY, USA: ACM, 2017, pp. 708--716. [Online]. Available Google ScholarDigital Library
L. Xie, Z. Xu, Y. Ban, X. Zhang, and Z. Guo, "360probdash: Improving qoe of 360 video streaming using tile-based http adaptive streaming," in Proceedings of the 2017 ACM on Multimedia Conference, ser. MM '17. New York, NY, USA: ACM, 2017, pp. 315--323. [Online]. Available Google ScholarDigital Library
C. Zhou, M. Xiao, and Y. Liu, "Clustile: Toward minimizing bandwidth in 360-degree video streaming," in IEEE INFOCOM 2018-IEEE Conference on Computer Communications. IEEE, 2018, pp. 962--970.Google Scholar
Y. Zhang, P. Zhao, K. Bian, Y. Liu, L. Song, and X. Li, "Drl360: 360-degree video streaming with deep reinforcement learning," in IEEE INFOCOM 2019-IEEE Conference on Computer Communications. IEEE, 2019.Google Scholar
J. Fu, X. Chen, Z. Zhang, S. Wu, and Z. Chen, "360srl: A sequential reinforcement learning approach for abr tile-based 360 video streaming," in 2019 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2019, pp. 290--295.Google Scholar
N. Kan, J. Zou, K. Tang, C. Li, N. Liu, and H. Xiong, "Deep reinforcement learning-based rate adaptation for adaptive 360-degree video streaming," in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2019, pp. 4030--4034.Google Scholar
M. Almquist, V. Almquist, V. Krishnamoorthi, N. Carlsson, and D. Eager, "The prefetch aggressiveness tradeoff in 360° video streaming," in Proceedings of the 9th ACM Multimedia Systems Conference, ser. MMSys '18. New York, NY, USA: ACM, 2018, pp. 258--269. [Online]. Available Google ScholarDigital Library
S. Natarajan and P. Tadepalli, "Dynamic preferences in multi-criteria reinforcement learning," in Proceedings of the 22nd international conference on Machine learning, 2005, pp. 601--608.Google Scholar
K. Van Moffaert and A. Nowé, "Multi-objective reinforcement learning using sets of pareto dominating policies," The Journal of Machine Learning Research, vol. 15, no. 1, pp. 3483--3512, 2014.Google ScholarDigital Library
R. Yang, X. Sun, and K. Narasimhan, "A generalized algorithm for multi-objective reinforcement learning and policy adaptation," in Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. deBuc, E. Fox, and R. Garnett, Eds. Curran Associates, Inc., 2019, pp. 14636--14647.Google Scholar
H. Mao, R. Netravali, and M. Alizadeh, "Neural adaptive video streaming with pensieve," in Proceedings of the Conference of the ACM Special Interest Group on Data Communication. ACM, 2017, pp. 197--210.Google Scholar
Z. Wang, T. Schaul, M. Hessel, H. Hasselt, M. Lanctot, and N. Freitas, "Dueling network architectures for deep reinforcement learning," in International conference on machine learning, 2016, pp. 1995--2003.Google Scholar
A. Abels, D. Roijers, T. Lenaerts, A. Nowé, and D. Steckelmacher, "Dynamic weights in multi-objective deep reinforcement learning," in International Conference on Machine Learning, 2019, pp. 11--20.Google Scholar
X. Yao, T. Huang, C. Wu, R.-X. Zhang, and L. Sun, "Adversarial feature alignment: Avoid catastrophic forgetting in incremental task lifelong learning," Neural computation, vol. 31, no. 11, pp. 2266--2291, 2019.Google ScholarDigital Library
I. J. Goodfellow, M. Mirza, D. Xiao, A. Courville, and Y. Bengio, "An empirical investigation of catastrophic forgetting in gradient-based neural networks," arXiv preprint arXiv:1312.6211, 2013.Google Scholar
T. Schaul, J. Quan, I. Antonoglou, and D. Silver, "Prioritized experience replay," arXiv preprint arXiv:1511.05952, 2015.Google Scholar
H. Riiser, P. Vigmostad, C. Griwodz, and P. Halvorsen, "Commute path bandwidth traces from 3g networks: analysis and applications," in Proceedings of the 4th ACM Multimedia Systems Conference, 2013, pp. 114--118.Google Scholar
C. Wu, Z. Tan, Z. Wang, and S. Yang, "A dataset for exploring user behaviors in vr spherical video streaming," in Proceedings of the 8th ACM on Multimedia Systems Conference, ser. MMSys'17. New York, NY, USA: ACM, 2017, pp. 193--198. [Online]. Available Google ScholarDigital Library
Y. Bao, H. Wu, T. Zhang, A. A. Ramli, and X. Liu, "Shooting a moving target: Motion-prediction-based transmission for 360-degree videos," in 2016 IEEE International Conference on Big Data (Big Data). IEEE, 2016, pp. 1161--1170.Google Scholar
M. Hessel, J. Modayil, H. Van Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M. Azar, and D. Silver, "Rainbow: Combining improvements in deep reinforcement learning," in Thirty-Second AAAI Conference on Artificial Intelligence, 2018.Google Scholar

Index Terms

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia streaming

Recommendations

PARIMA: Viewport Adaptive 360-Degree Video Streaming
WWW '21: Proceedings of the Web Conference 2021

With increasing advancements in technologies for capturing 360° videos, advances in streaming such videos have become a popular research topic. However, streaming 360° videos require high bandwidth, thus escalating the need for developing optimized ...
Read More
Optimizing mobile video streaming through collaboration between content providers and 5G networks
MHV '24: Proceedings of the 3rd Mile-High Video Conference

The use of the internet and general-purpose networks to transport and consume media content is increasing over years. According to the Ericsson Mobility Report, by 2028, 5G networks will carry 69 percent of the world's smartphone traffic. The largest and ...
Read More
Design and Analysis of QoE-Aware Quality Adaptation for DASH: A Spectrum-Based Approach
Special Section on Deep Learning for Mobile Multimedia and Special Section on Best Papers from ACM MMSys/NOSSDAV 2016

The dynamics of the application-layer-based control loop of dynamic adaptive streaming over HTTP (DASH) make video bitrate selection for DASH a difficult problem. In this work, we provide a DASH quality adaptation algorithm, named SQUAD, that is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
NOSSDAV '21: Proceedings of the 31st ACM Workshop on Network and Operating Systems Support for Digital Audio and Video
July 2021
128 pages
ISBN:9781450384353
DOI:10.1145/3458306
Program Chairs:
Andra Lutu
Telefonica, Spain
,
Gwendal Simon
Huawei, France
,
Mylene C. Q. Farias
University of Brasilia, Brazil
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 July 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
360° video streaming
adaptive bitrate streaming
Qualifiers
- research-article
Conference

Acceptance Rates
NOSSDAV '21 Paper Acceptance Rate15of52submissions,29%Overall Acceptance Rate118of363submissions,33%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 327
  Total Downloads
- Downloads (Last 12 months)48
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming

NOSSDAV '21: Proceedings of the 31st ACM Workshop on Network and Operating Systems Support for Digital Audio and Video

ABSTRACT

References

Cited By

Index Terms

Recommendations

PARIMA: Viewport Adaptive 360-Degree Video Streaming

Optimizing mobile video streaming through collaboration between content providers and 5G networks

Design and Analysis of QoE-Aware Quality Adaptation for DASH: A Spectrum-Based Approach