skip to main content
10.1145/3605181.3626191acmconferencesArticle/Chapter ViewAbstractPublication PagessospConference Proceedingsconference-collections
short-paper
Open Access

Enabling In-Vitro Serverless Systems Research

Published:23 October 2023Publication History

ABSTRACT

Serverless is an increasingly popular cloud computing paradigm that has stimulated new systems research opportunities. However, developing and evaluating serverless systems in a research setting (i.e., "in-vitro", without access to a large-scale production cluster and real workloads) is challenging yet vital for innovation. Recently, several serverless providers have released production traces consisting of large sets of functions with their invocation inter-arrival time, execution time, and memory footprint distributions. However, executing the workload synthesized from these traces requires a massive cluster, making experiments expensive and time-consuming.

In this work, we show how to use the data available in production traces to construct workload summaries of configurable scales that remain highly representative of the original trace characteristics and can be used to evaluate serverless systems in-vitro. Compared to random sampling of functions from the original trace, our method can generate summaries of up to 10X higher representativity, measured as the average of the Wasserstein distances of the distributions of interest (e.g., function execution time and invocation inter-arrival time) from the respective distributions in the original trace. We release our toolchain that enables researchers to synthesize representative workload summaries and show how it can be used to evaluate the performance of serverless systems at diverse load scale factors.

References

  1. Additional Autoscaling Configuration for Knative Pod Autoscaler. Available at https://knative.dev/docs/serving/autoscaling/kpa-specific/#stable-window.Google ScholarGoogle Scholar
  2. Fission: Open Source, Kubernetes-Native Serverless Framework. Available at https://fission.io.Google ScholarGoogle Scholar
  3. Fn project. Available at https://fnproject.io.Google ScholarGoogle Scholar
  4. Knative. Available at https://knative.dev.Google ScholarGoogle Scholar
  5. Prometheus. Available at https://prometheus.io.Google ScholarGoogle Scholar
  6. Transaction Processing Performance Council. Available at https://www.tpc.org.Google ScholarGoogle Scholar
  7. What are Industry 4.0, the Fourth Industrial Revolution, and 4IR? Available at https://www.mckinsey.com/featured-insights/mckinsey-explainers/what-are-industry-4-0-the-fourth-industrial-revolution-and-4ir.Google ScholarGoogle Scholar
  8. Apache. OpenWhisk. Available at https://openwhisk.apache.org/.Google ScholarGoogle Scholar
  9. Datadog. The State of Serverless 2021. Available at https://www.datadoghq.com/state-of-serverless-2021.Google ScholarGoogle Scholar
  10. Deep, S., Gruenheid, A., Koutris, P., Naughton, J. F., and Viglas, S. Comprehensive and Efficient Workload Compression. Proc. VLDB Endow. 14, 3 (2020), 418--430.Google ScholarGoogle Scholar
  11. Duplyakin, D., Ricci, R., Maricq, A., Wong, G., Duerig, J., Eide, E., Stoller, L., Hibler, M., Johnson, D., Webb, K., Akella, A., Wang, K.-C., Ricart, G., Landweber, L., Elliott, C., Zink, M., Cecchet, E., Kar, S., and Mishra, P. The Design and Operation of CloudLab. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC) (2019), pp. 1--14.Google ScholarGoogle Scholar
  12. Fuerst, A., and Sharma, P. FaasCache: Keeping Serverless Computing Alive with Greedy-Dual Caching. In Proceedings of the 26th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-XXVI) (2021), pp. 386--400.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hendrickson, S., Sturdevant, S., Harter, T., Venkataramani, V., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. Serverless Computation with OpenLambda. In 8th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) (2016).Google ScholarGoogle Scholar
  14. Kaffes, K., Yadwadkar, N. J., and Kozyrakis, C. Hermod: Principled and Practical Scheduling for Serverless Functions. In Proceedings of the 2022 ACM Symposium on Cloud Computing (SOCC) (2022), pp. 289--305.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Kogias, M., Mallon, S., and Bugnion, E. Lancet: A Self-Correcting Latency Measuring Tool. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC) (2019), pp. 881--896.Google ScholarGoogle Scholar
  16. Kubeless. Kubeless: The Kubernetes Native Serverless Framework. Available at https://kubeless.io.Google ScholarGoogle Scholar
  17. Leverich, J., and Kozyrakis, C. Reconciling High Server Utilization and Sub-Millisecond Quality-of-service. In Proceedings of the 2014 EuroSys Conference (2014), pp. 4:1--4:14.Google ScholarGoogle Scholar
  18. Li, Z., Zheng, L., Zhong, Y., Liu, V., Sheng, Y., Jin, X., Huang, Y., Chen, Z., Zhang, H., Gonzalez, J. E., and Stoica, I. AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving. In 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (2023), USENIX Association.Google ScholarGoogle Scholar
  19. Liang, M., Fu, W., Feng, L., Lin, Z., Panakanti, P., Zheng, S., Sridharan, S., and Delimitrou, C. Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks. In Proceedings of the 50th Annual International Symposium on Computer Architecture, ISCA 2023, Orlando, FL, USA, June 17--21, 2023 (2023), Y. Solihin and M. A. Heinrich, Eds., ACM, pp. 37:1--37:13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Panda, R., Zheng, X., Gerstlauer, A., and John, L. K. CAMP: Accurate Modeling of Core and Memory Locality for Proxy Generation of Big-Data Applications. In 2018 Design, Automation & Test in Europe Conference & Exhibition, DATE 2018, Dresden, Germany, March 19--23, 2018 (2018), J. Madsen and A. K. Coskun, Eds., IEEE, pp. 337--342.Google ScholarGoogle ScholarCross RefCross Ref
  21. Saxena, D., Ji, T., Singhvi, A., Khalid, J., and Akella, A. Memory Deduplication for Serverless Computing with Medes. In Proceedings of the 2022 EuroSys Conference (2022), pp. 714--729.Google ScholarGoogle Scholar
  22. Shahrad, M., Fonseca, R., Goiri, I., Chaudhry, G., Batum, P., Cooke, J., Laureano, E., Tresness, C., Russinovich, M., and Bianchini, R. Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider. In Proceedings of the 2020 USENIX Annual Technical Conference (ATC) (2020), pp. 205--218.Google ScholarGoogle Scholar
  23. Singhvi, A., Balasubramanian, A., Houck, K., Shaikh, M. D., Venkataraman, S., and Akella, A. Atoll: A Scalable Low-Latency Serverless Platform. In Proceedings of the 2021 ACM Symposium on Cloud Computing (SOCC) (2021), p. 138--152.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Ustiugov, D., Amariucai, T., and Grot, B. Analyzing Tail Latency in Serverless Clouds with STeLLAR. In Proceedings of the 2021 IEEE International Symposium on Workload Characterization (IISWC) (2021), pp. 51--62.Google ScholarGoogle Scholar
  25. Ustiugov, D., Amariucai, T., and Grot, B. Analyzing Tail Latency in Serverless Clouds with STeLLAR. In IEEE International Symposium on Workload Characterization (IISWC) (2021).Google ScholarGoogle Scholar
  26. Ustiugov, D., Petrov, P., Kogias, M., Bugnion, E., and Grot, B. Benchmarking, Analysis, and Optimization of Serverless Function Snapshots. In Proceedings of the 26th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-XXVI) (2021), pp. 559--572.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Wang, A., Chang, S., Tian, H., Wang, H., Yang, H., Li, H., Du, R., and Cheng, Y. FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute. In Proceedings of the 2021 USENIX Annual Technical Conference (ATC) (2021), pp. 443--457.Google ScholarGoogle Scholar
  28. Yu, Z., Eeckhout, L., Goswami, N., Li, T., John, L. K., Jin, H., Xu, C., and Wu, J. GPGPU-MiniBench: Accelerating GPGPU Micro-Architecture Simulation. IEEE Trans. Computers 64, 11 (2015), 3153--3166.Google ScholarGoogle Scholar
  29. Zhang, J., Jin, C., Huang, Y., Yi, L., Ding, Y., and Guo, F. KOLE: Breaking the Scalability Barrier for Managing Far Edge Nodes in Cloud. In Proceedings of the 2022 ACM Symposium on Cloud Computing (SOCC) (2022).Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    WORDS '23: Proceedings of the 4th Workshop on Resource Disaggregation and Serverless
    October 2023
    60 pages
    ISBN:9798400702501
    DOI:10.1145/3605181

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 23 October 2023

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • short-paper

    Upcoming Conference

    SOSP '24
  • Article Metrics

    • Downloads (Last 12 months)301
    • Downloads (Last 6 weeks)29

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader