skip to main content
10.1145/3404397.3404466acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article
Public Access

A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost

Published:17 August 2020Publication History

ABSTRACT

Currently, many web applications are deployed on cloud storage service provided by cloud service providers (CSPs). A CSP offers different types of storage including hot, cold and archive storage and sets unit prices for these different types, which vary substantially. By properly assigning the data files of a web application to different types of storage based on their usage profiles and the CSP’s pricing policy, a cloud customer potentially can achieve substantial cost savings and minimize the payment to the CSP. However, no previous research handles this problem. Towards this goal, we present a Markov Decision Process formulation for the cost minimization problem, and then develop a reinforcement learning based approach to effectively solve the problem, which changes the type of storage of each data file periodically to minimize money cost in long term. We then propose a method to aggregate concurrently requested data files to further reduce the cloud storage service payment for a web application. Our experiments with Wikipedia traces show the effectiveness of the proposed methods for minimizing cloud customer cost in comparison with other methods.

References

  1. [n.d.]. Amazon S3. https://aws.amazon.com/cn/s3/, [accessed in Jan. 2020].Google ScholarGoogle Scholar
  2. [n.d.]. ARIMA model for Time Series Forecasting. https://machinelearningmastery.com/arima-for-time-series-forecasting-with-python/, [accessed in Jan. 2020].Google ScholarGoogle Scholar
  3. [n.d.]. Azure Storage Pricing Policy. https://azure.microsoft.com/en-us/pricing/details/storage/blobs/ , [accessed in Jan. 2020].Google ScholarGoogle Scholar
  4. [n.d.]. Google Cloud Storage. https://cloud.google.com/storage/, [accessed in Jan. 2020].Google ScholarGoogle Scholar
  5. [n.d.]. Microsoft Azure. https://azure.microsoft.com/en-us/, [accessed in Jan. 2020].Google ScholarGoogle Scholar
  6. [n.d.]. Page View statistics from Wikimedia Projects. https://dumps.wikimedia.org/other/pagecounts-ez/, [accessed in Jan. 2020].Google ScholarGoogle Scholar
  7. Martín A., Paul B., Jianmin C., Zhifeng C., Andy D., Jeffrey D., Matthieu D., Sanjay G., Geoffrey I., and Michael I.2016. Tensorflow: a system for large-scale machine learning.. In Proc. of OSDI.Google ScholarGoogle Scholar
  8. H. Abu-Libdeh, L. Princehouse, and H. Weatherspoon. 2010. RACS: a case for cloud storage diversity. In Proc. of SOCC.Google ScholarGoogle Scholar
  9. A. Adya, W. Bolosky, M. Castro, G. Cermak, R. Chaiken, J. Douceur, J. Howell, J. Lorch, M. Theimer, and R. Wattenhofer. 2002. FARSITE: Federated, available, and reliable storage for an incompletely trusted environment. ACM SIGOPS Operating Systems Review(2002).Google ScholarGoogle ScholarCross RefCross Ref
  10. G. Alvarez, E. Borowsky, S. Go, T. Romer, R. Becker-Szendy, R. Golding, A. Merchant, M. Spasojevic, A. Veitch, and J. Wilkes. 2001. Minerva: An automated resource provisioning tool for large-scale storage systems. Trans. on TOCS (2001).Google ScholarGoogle Scholar
  11. E. Anderson, M. Hobbs, K. Keeton, S. Spence, M. Uysal, and A. Veitch. 2002. Hippodrome: Running Circles Around Storage Administration.. In Proc. of FAST.Google ScholarGoogle Scholar
  12. L. Chen, J. Lingys, K. Chen, and F. Liu. 2018. Auto: Scaling deep reinforcement learning for datacenter-scale automatic traffic optimization. In Proc. of SIGCOM.Google ScholarGoogle Scholar
  13. J. E, Y. Cui, M. Ruan, Z. Li, and E. Zhai. 2019. HyCloud: Tweaking Hybrid Cloud Storage Services for Cost-Efficient Filesystem Hosting. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  14. J. Gao, H. Wang, and H. Shen. 2020. Smartly Handling Renewable Energy Instability in Supporting A Cloud Datacenter. In Proc. of IPDPS.Google ScholarGoogle Scholar
  15. S. Hillmer and G. Tiao. 1982. An ARIMA-model-based approach to seasonal adjustment. J. Amer. Statist. Assoc.(1982).Google ScholarGoogle Scholar
  16. R. Howard. 1964. Dynamic programming and Markov processes. (1964).Google ScholarGoogle Scholar
  17. H. Jin, H. Guo, L. Su, K. Nahrstedt, and X. Wang. 2019. Dynamic Task Pricing in Multi-Requester Mobile Crowd Sensing with Markov Correlated Equilibrium. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  18. Ana K., Heiner L., and Christos K.2018. Selecta: heterogeneous cloud storage configuration for data analytics. In Proc. of USENIX ATC.Google ScholarGoogle Scholar
  19. Leslieb K. and Andrew L., Michaeland M.1996. Reinforcement learning: A survey. Journal of artificial intelligence research(1996).Google ScholarGoogle Scholar
  20. R. Kotla, L. Alvisi, and M. Dahlin. 2007. SafeStore: A durable and practical storage system. In Proc. of ATC.Google ScholarGoogle Scholar
  21. Yang L., Li G., Akara S., and Yike G.2014. Enabling performance as a service for a cloud storage system. In Proc. of CLOUD.Google ScholarGoogle Scholar
  22. H. Li, L. Zhong, J. Liu, B. Li, and K. Xu. 2011. Cost-effective partial migration of VoD services to content clouds. In Proc. of Cloud.Google ScholarGoogle Scholar
  23. M. Li, C. Qin, J. Li, and P. Lee. 2016. CDStore: Toward reliable, secure, and cost-efficient cloud storage via convergent dispersal. Prof. of ATC (2016).Google ScholarGoogle Scholar
  24. G. Liu, H. Shen, and H. Wang. 2017. An economical and SLO-guaranteed cloud storage service across multiple cloud service providers. Trans. on TPDS (2017).Google ScholarGoogle Scholar
  25. H. Madhyastha, J. McCullough, G. Porter, R. Kapoor, S. Savage, A. Snoeren, and A. Vahdat. 2012. scc: cluster storage provisioning informed by application characteristics and SLAs.. In Proc. of FAST.Google ScholarGoogle Scholar
  26. H. Mao, M. Alizadeh, I. Menache, and S. Kandula. 2016. Resource management with deep reinforcement learning. In Proc. of HotNet.Google ScholarGoogle Scholar
  27. H. Mao, R. Netravali, and M. Alizadeh. 2017. Neural adaptive video streaming with pensieve. In Proc. of SIGCOM.Google ScholarGoogle Scholar
  28. W. Mao, Z. Zheng, and F. Wu. 2019. Pricing for revenue maximization in iot data markets: An information design perspective. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  29. V. Mnih, P. Badia, M. Mirza, A. Graves, T. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu. 2016. Asynchronous methods for deep reinforcement learning. In International conference on machine learning.Google ScholarGoogle Scholar
  30. V. Mnih, K. Kavukcuoglu, D. Silver, A. Rusu, J. Veness, M. Bellemare, A. Graves, M. Riedmiller, A. Fidjeland, and G. Ostrovski. 2015. Human-level control through deep reinforcement learning. Nature (2015).Google ScholarGoogle Scholar
  31. Di Niu, Hong Xu, and Baochun Li. 2012. Quality-assured cloud bandwidth auto-scaling for video-on-demand applications.. In Proc. of INFOCOM.Google ScholarGoogle ScholarCross RefCross Ref
  32. B. Plaza. 2011. Google Analytics for measuring website performance. Tourism Management (2011).Google ScholarGoogle Scholar
  33. Z. Pooranian, K. Chen, C. Yu, and M. Conti. 2018. RARE: Defeating side channels based on data-deduplication in cloud storage. In Proc. of INFOCOM workshop.Google ScholarGoogle Scholar
  34. H. Roh, C. Jung, W. Lee, and D. Du. 2013. Resource pricing game in geo-distributed clouds. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  35. Richard S., Doina P., and Satinder S.1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. AI (1999).Google ScholarGoogle Scholar
  36. D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. Van Den Driessche, J. Schrittwieser, I. Antonoglou, and V. Panneershelvam. 2016. Mastering the game of Go with deep neural networks and tree search. nature (2016).Google ScholarGoogle Scholar
  37. Y. Song, M. Zafer, and K. Lee. 2012. Optimal bidding in spot instance market. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  38. R. Sutton, A. Barto, and F. Bach. 1998. Reinforcement learning: An introduction.Google ScholarGoogle Scholar
  39. R. Sutton, D. McAllester, S. Singh, and Y. Mansour. 2000. Policy gradient methods for reinforcement learning with function approximation. In Proc. of ANIPS.Google ScholarGoogle Scholar
  40. Zhe W., Curtis Y., and Harsha V M.2015. CosTLO: Cost-Effective Redundancy for Lower Latency Variance on Cloud Storage Services.. In Proc. of NSDI.Google ScholarGoogle Scholar
  41. F. Wang, J. Liu, and M. Chen. 2012. CALMS: Cloud-assisted live media streaming for globalized demands with time/region diversities. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  42. H. Wang and H. Shen. 2018. Proactive incast congestion control in a datacenter serving web applications. In Proc. of INFOCOM.Google ScholarGoogle Scholar
  43. B. Wickremasinghe and R. Buyya. 2009. CloudAnalyst: A CloudSim-based tool for modelling and analysis of large scale cloud computing environments. Prof. of MEDC (2009).Google ScholarGoogle Scholar
  44. A. Wieder, P. Bhatotia, A. Post, and R. Rodrigues. 2012. Orchestrating the Deployment of Computations in the Cloud with Conductor.. In Proc. of NSDI.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Z. Wu, M. Butkiewicz, D. Perkins, E. Katz-Bassett, and H. Madhyastha. 2013. Spanstore: Cost-effective geo-replicated storage spanning multiple cloud services. In Proc. of SOSP.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICPP '20: Proceedings of the 49th International Conference on Parallel Processing
    August 2020
    844 pages
    ISBN:9781450388160
    DOI:10.1145/3404397

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 17 August 2020

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate91of313submissions,29%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format