skip to main content
10.1145/3625468.3647612acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article
Free Access

FovOptix: Human Vision-Compatible Video Encoding and Adaptive Streaming in VR Cloud Gaming

Authors Info & Claims
Published:17 April 2024Publication History

ABSTRACT

VR cloud gaming enables users to play high-end VR games on lightweight devices by offloading rendering tasks to cloud servers. Despite video compression, high-definition video streaming requires substantial data transfer rates. Foveated rendering (FR) and video encoding (FVE) leverage the non-uniform perception of the human visual system to reduce computing and bandwidth demand. They enhance visual quality in central gaze regions and reduce it in the periphery. However, bandwidth variation may hinder the provision of smooth VR gaming experiences. We present FovOptix, a system that combines FR with adaptive FVE to deliver video stream at a lower yet adaptive bitrate while not compromising the perceived video quality. FovOptix is based on a game-agnostic open-source to ensure reproducibility and compatibility with various games. We evaluate FovOptix against benchmarks using 5G mobile network traces. FovOptix achieves a latency reduction of 3% compared to the Google standard and a significant +100% reduction compared to other solutions. Additionally, it enhances the visual quality within the player's region of interest. Consequently, FovOptix attains the highest playability and gaming scores while minimizing the severity of motion sickness. FovOptix thus offers smooth and accessible VR cloud gaming for a wider range of players.

References

  1. 2018. SteamVR. https://store.steampowered.com/app/250820/SteamVR/Google ScholarGoogle Scholar
  2. 2023. Air Light VR (ALVR). https://github.com/alvr-org/ALVRGoogle ScholarGoogle Scholar
  3. 2023. NVENC Video Encoder API Programming Guide. https://docs.nvidia.com/video-technologies/video-codec-sdk/12.1/nvenc-video-encoder-api-prog-guide/index.htmlGoogle ScholarGoogle Scholar
  4. 2023. Real-time Communication for the Web (WebRTC). https://webrtc.org/Google ScholarGoogle Scholar
  5. Mehmet N. Akcay. 2021. Improving Server and Client-Side Algorithms for Adaptive Streaming of Non-Immersive and Immersive Media. In Proceedings of the 12th ACM Multimedia Systems Conference (Istanbul, Turkey) (MMSys '21). Association for Computing Machinery, New York, NY, USA, 383--387. https://doi.org/10.1145/3458305.3478461Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Rachel Albert, Anjul Patney, David Luebke, and Joohwan Kim. 2017. Latency Requirements for Foveated Rendering in Virtual Reality. ACM Trans. Appl. Percept. 14, 4, Article 25 (sep 2017), 13 pages. https://doi.org/10.1145/3127589Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ahmad Alhilal, Tristan Braud, Bo Han, and Pan Hui. 2022. Nebula: Reliable Low-Latency Video Transmission for Mobile Cloud Gaming. In Proceedings of the ACM Web Conference 2022 (Virtual Event, Lyon, France) (WWW '22). Association for Computing Machinery, New York, NY, USA, 3407--3417. https://doi.org/10.1145/3485447.3512276Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D.A. Atchison. 2023. Optics of the Human Eye. CRC Press. https://books.google.com.hk/books?id=5-WtEAAAQBAJGoogle ScholarGoogle Scholar
  9. Gaetano Carlucci, Luca De Cicco, Stefan Holmer, and Saverio Mascolo. 2016. Analysis and Design of the Google Congestion Control for Web Real-Time Communication (WebRTC). In Proceedings of the 7th International Conference on Multimedia Systems (Klagenfurt, Austria) (MMSys '16). Association for Computing Machinery, New York, NY, USA, Article 13, 12 pages. https://doi.org/10.1145/2910017.2910605Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Matthias Dick, Oliver Wellnitz, and Lars Wolf. 2005. Analysis of Factors Affecting Players' Performance and Perception in Multiplayer Games. In Proceedings of 4th ACM SIGCOMM Workshop on Network and System Support for Games (Hawthorne, NY) (NetGames '05). Association for Computing Machinery, New York, NY, USA, 1--7. https://doi.org/10.1145/1103599.1103624Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Florian Frieß, Matthias Braun, Valentin Bruder, Steffen Frey, Guido Reina, and Thomas Ertl. 2021. Foveated Encoding for Large High-Resolution Displays. IEEE Transactions on Visualization and Computer Graphics 27, 2 (2021), 1850--1859. https://doi.org/10.1109/TVCG.2020.3030445Google ScholarGoogle ScholarCross RefCross Ref
  12. Sandra G Hart. 2006. NASA-task load index (NASA-TLX); 20 years later. In Proceedings of the human factors and ergonomics society annual meeting, Vol. 50. Sage publications Sage CA: Los Angeles, CA, 904--908.Google ScholarGoogle ScholarCross RefCross Ref
  13. Jan Horský. 2022. Crowdsourcing VR headset data VR headset database. https://www.infinite.cz/projects/HMD-tester-virtual-reality-headset-database-utilityGoogle ScholarGoogle Scholar
  14. Sudeng Hu, Hanli Wang, and Sam Kwong. 2012. Adaptive Quantization-Parameter Clip Scheme for Smooth Quality in H.264/AVC. IEEE Transactions on Image Processing 21 (04 2012), 1911--1919. https://doi.org/10.1109/TIP.2011.2176347Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Chun-Ying Huang, Cheng-Hsin Hsu, Yu-Chun Chang, and Kuan-Ta Chen. 2013. GamingAnywhere: An Open Cloud Gaming System. In Proceedings of the 4th ACM Multimedia Systems Conference (Oslo, Norway) (MMSys '13). Association for Computing Machinery, New York, NY, USA, 36--47. https://doi.org/10.1145/2483977.2483981Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bert Hubert. [n.d.]. Linux TC Man Page. https://linux.die.net/man/8/tcGoogle ScholarGoogle Scholar
  17. Gazi Karam Illahi, Thomas Van Gemert, Matti Siekkinen, Enrico Masala, Antti Oulasvirta, and Antti Ylä-Jääski. 2020. Cloud Gaming with Foveated Video Encoding. ACM Trans. Multimedia Comput. Commun. Appl. 16, 1, Article 7 (feb 2020), 24 pages. https://doi.org/10.1145/3369110Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Gazi Karam Illahi, Matti Siekkinen, Teemu Kämäräinen, and Antti Ylä-Jääski. 2020. On the Interplay of Foveated Rendering and Video Encoding. In Proceedings of the 26th ACM Symposium on Virtual Reality Software and Technology (Virtual Event, Canada) (VRST '20). Association for Computing Machinery, New York, NY, USA, Article 66, 3 pages. https://doi.org/10.1145/3385956.3422126Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Gazi Karam Illahi, Matti Siekkinen, Teemu Kämäräinen, and Antti Ylä-Jääski. 2021. Foveated Streaming of Real-Time Graphics. In Proceedings of the 12th ACM Multimedia Systems Conference (Istanbul, Turkey) (MMSys '21). Association for Computing Machinery, New York, NY, USA, 214--226. https://doi.org/10.1145/3458305.3463383Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yoshio Ishiguro and Jun Rekimoto. 2011. Peripheral vision annotation: noninterference information presentation method for mobile augmented reality. In Proceedings of the 2nd Augmented Human International Conference. 1--5.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Nuwan Janaka, Chloe Haigh, Hyeongcheol Kim, Shan Zhang, and Shengdong Zhao. 2022. Paracentral and Near-Peripheral Visualizations: Towards Attention-Maintaining Secondary Information Presentation on OHMDs during in-Person Social Interactions. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI '22). Association for Computing Machinery, New York, NY, USA, Article 551, 14 pages. https://doi.org/10.1145/3491102.3502127Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Junchen Jiang, Vyas Sekar, and Hui Zhang. 2012. Improving Fairness, Efficiency, and Stability in HTTP-Based Adaptive Video Streaming with FESTIVE. In Proceedings of the 8th International Conference on Emerging Networking Experiments and Technologies (Nice, France) (CoNEXT '12). Association for Computing Machinery, New York, NY, USA, 97--108. https://doi.org/10.1145/2413176.2413189Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Xing Liu, Bo Han, Feng Qian, and Matteo Varvello. 2019. LIME: Understanding Commercial 360° Live Video Streaming Services. In Proceedings of the 10th ACM Multimedia Systems Conference (Amherst, Massachusetts) (MMSys '19). Association for Computing Machinery, New York, NY, USA, 154--164. https://doi.org/10.1145/3304109.3306220Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Vignesh V Menon, Christian Feldmann, Hadi Amirpour, Mohammad Ghanbari, and Christian Timmerer. 2022. VCA: Video Complexity Analyzer. In Proceedings of the 13th ACM Multimedia Systems Conference (Athlone, Ireland) (MMSys '22). Association for Computing Machinery, New York, NY, USA, 259--264. https://doi.org/10.1145/3524273.3532896Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Lothar Pantel and Lars C. Wolf. 2002. On the Impact of Delay on Real-Time Multiplayer Games. In Proceedings of the 12th International Workshop on Network and Operating Systems Support for Digital Audio and Video (Miami, Florida, USA) (NOSSDAV '02). Association for Computing Machinery, New York, NY, USA, 23--29. https://doi.org/10.1145/507670.507674Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Darijo Raca, Dylan Leahy, Cormac J. Sreenan, and Jason J. Quinlan. 2020. Beyond Throughput, the next Generation: A 5G Dataset with Channel and Context Metrics. In Proceedings of the 11th ACM Multimedia Systems Conference (Istanbul, Turkey) (MMSys '20). Association for Computing Machinery, New York, NY, USA, 303--308. https://doi.org/10.1145/3339825.3394938Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Jihoon Ryoo, Kiwon Yun, Dimitris Samaras, Samir R. Das, and Gregory Zelinsky. 2016. Design and Evaluation of a Foveated Video Streaming Service for Commodity Client Devices. In Proceedings of the 7th International Conference on Multimedia Systems (Klagenfurt, Austria) (MMSys '16). Association for Computing Machinery, New York, NY, USA, Article 6, 11 pages. https://doi.org/10.1145/2910017.2910592Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Jihoon Ryoo, Kiwon Yun, Dimitris Samaras, Samir R. Das, and Gregory Zelinsky. 2016. Design and Evaluation of a Foveated Video Streaming Service for Commodity Client Devices. In Proceedings of the 7th International Conference on Multimedia Systems (Klagenfurt, Austria) (MMSys '16). Association for Computing Machinery, New York, NY, USA, Article 6, 11 pages. https://doi.org/10.1145/2910017.2910592Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Andrea Scupola, Alessandra Mastrocola, Paola Sasso, Romina Fasciani, Lucrezia Montrone, Benedetto Falsini, and Edoardo Abed. 2013. Assessment of Retinal Function Before and After Idiopathic Macular Hole Surgery. American Journal of Ophthalmology 156, 1 (2013), 132--139.e1. https://doi.org/10.1016/j.ajo.2013.02.007Google ScholarGoogle ScholarCross RefCross Ref
  30. Hans Strasburger, Ingo Rentschler, and Martin Jüttner. 2011. Peripheral vision and pattern recognition: A review. Journal of Vision 11, 5 (Dec. 2011), 13--13. https://doi.org/10.1167/1L5.13_eprint: https://arvojournals.org/arvo/content_public/journal/jov/933487/jov-11-5-13.pdf.Google ScholarGoogle ScholarCross RefCross Ref
  31. M. Tun, K.K. Loo, and J. Cosmas. 2008. Rate control algorithm based on quality factor optimization for Dirac video codec. Signal Processing: Image Communication 23, 9 (2008), 649--664. https://doi.org/10.1016/j.image.2008.07.003Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. FovOptix: Human Vision-Compatible Video Encoding and Adaptive Streaming in VR Cloud Gaming

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Conferences
                  MMSys '24: Proceedings of the 15th ACM Multimedia Systems Conference
                  April 2024
                  557 pages
                  ISBN:9798400704123
                  DOI:10.1145/3625468

                  Copyright © 2024 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 17 April 2024

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article
                  • Research
                  • Refereed limited

                  Acceptance Rates

                  Overall Acceptance Rate176of530submissions,33%
                • Article Metrics

                  • Downloads (Last 12 months)60
                  • Downloads (Last 6 weeks)60

                  Other Metrics

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader