skip to main content
research-article

Inter-datacenter bulk transfers with netstitcher

Published:15 August 2011Publication History
Skip Abstract Section

Abstract

Large datacenter operators with sites at multiple locations dimension their key resources according to the peak demand of the geographic area that each site covers. The demand of specific areas follows strong diurnal patterns with high peak to valley ratios that result in poor average utilization across a day. In this paper, we show how to rescue unutilized bandwidth across multiple datacenters and backbone networks and use it for non-real-time applications, such as backups, propagation of bulky updates, and migration of data. Achieving the above is non-trivial since leftover bandwidth appears at different times, for different durations, and at different places in the world.

For this purpose, we have designed, implemented, and validated NetStitcher, a system that employs a network of storage nodes to stitch together unutilized bandwidth, whenever and wherever it exists. It gathers information about leftover resources, uses a store-and-forward algorithm to schedule data transfers, and adapts to resource fluctuations.

We have compared NetStitcher with other bulk transfer mechanisms using both a testbed and a live deployment on a real CDN. Our testbed evaluation shows that NetStitcher outperforms all other mechanisms and can rescue up to five times additional datacenter bandwidth thus making it a valuable tool for datacenter providers. Our live CDN deployment demonstrates that our solution can perform large data transfers at a much lower cost than naive end-to-end or store-and-forward schemes.

Skip Supplemental Material Section

Supplemental Material

sigcomm_3_1.mp4

mp4

141.2 MB

References

  1. Amazon Simple Storage Service. aws.amazon.com/s3/.Google ScholarGoogle Scholar
  2. Datacenter map. www.datacentermap.com/datacenters.html.Google ScholarGoogle Scholar
  3. Equinix datacenter map. www.equinix.com/data-center-locations/map/.Google ScholarGoogle Scholar
  4. Facebook Statistics. www.facebook.com/press/info.php?statistics.Google ScholarGoogle Scholar
  5. James Hamilton's blog: Inter-datacenter replication & geo-redundancy. perspectives.mvdirona.com/2010/05/10/InterDatacenterReplicationGeoRedundancy.aspx.Google ScholarGoogle Scholar
  6. Personal communication.Google ScholarGoogle Scholar
  7. Riverbed Products & Services. http://www.riverbed.com/us/products/index.php.Google ScholarGoogle Scholar
  8. Signiant Products. www.signiant.com/Products-content-distribution-management/.Google ScholarGoogle Scholar
  9. Telefonica International Wholesale Services Network Map. http://www.e-mergia.com/en/mapaRed.html.Google ScholarGoogle Scholar
  10. Forrester Research. The Future of Data Center Wide-Area Networking. info.infineta.com/l/5622/2011-01-27/Y26, 2010.Google ScholarGoogle Scholar
  11. Ovum. CDNs The Carrier Opportunity. reports.innovaro.com/reports/cdns-the-carrier-opportunity, 2010.Google ScholarGoogle Scholar
  12. B. Ager, F. Schneider, J. Kim, and A. Feldmann. Revisiting Cacheability in Times of User Generated Content. In Global Internet'10.Google ScholarGoogle Scholar
  13. D. Andersen, H. Balakrishnan, F. Kaashoek, and R. Morris. Resilient Overlay Networks. In SOSP'01. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Beaver, S. Kumar, H. C. Li, J. Sobel, and P. Vajgel. Finding a Needle in Haystack: Facebook's Photo Storage. In OSDI'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Breitgand, D. Raz, and Y. Shavitt. The Traveling Miser Problem. In IEEE/ACM Transactions on Networking, '06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Chen, W. He, J. Liu, S. Nath, L. Rigas, L. Xiao, and F. Zhao. Energy-Aware Server Provisioning and Load Dispatching for Connection-Intensive Internet Services. In NSDI'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Chen, S. Jain, V. K. Adhikari, Z.-L. Zhang, and K. Xu. A First Look at Inter-Data Center Traffic Characteristics via Yahoo! Datasets. In INFOCOM '11.Google ScholarGoogle Scholar
  18. B. Cohen. Incentives Build Robustness in BitTorrent. In P2P Econ'03.Google ScholarGoogle Scholar
  19. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. MIT Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Gray. Long Term Storage Trends and You, 2006. http://research.microsoft.com/~Gray/talks/IO_talk_2006.ppt.Google ScholarGoogle Scholar
  21. A. Greenberg, J. Hamilton, D. A. Maltz, and P. Patel. The Cost of a Aloud: Research Problems in Data Center Networks. SIGCOMM Comput. Commun. Rev., 39(1), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Greenberg, J. R. Hamilton, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. A. Maltz, P. Patel, and S. Sengupta. VL2: a Scalable and Flexible Data Center Network. In SIGCOMM'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Grossglauser and D. Tse. Mobility Increases the Capacity of Ad-hoc Wireless Networks. In IEEE/ACM Transactions on Networking, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Jain, K. Fall, and R. Patra. Routing in a Delay Tolerant Network. In SIGCOMM'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Katabi, I. Bazzi, and X. Yang. A Passive Approach for Detecting Shared Bottlenecks. In Proc. of IEEE ICCN '02.Google ScholarGoogle Scholar
  26. R. Kohavi, R. M. Henne, and D. Sommerfield. Practical Guide to Controlled Experiments on the Web: Listen to Your Customers not to the HiPPO. In KDD'07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. R. Kokku, A. Bohra, S. Ganguly, and A. Venkataramani. A Multipath Background Network Architecture. In INFOCOM'07.Google ScholarGoogle Scholar
  28. N. Laoutaris and P. Rodriguez. Good Things Come to Those Who (Can) Wait or How to Handle Delay Tolerant Traffic and Make Peace on the Internet. In HotNets-VII'08.Google ScholarGoogle Scholar
  29. N. Laoutaris, M. Sirivianos, X. Yang, and P. Rodriguez. Inter-Datacenter Bulk Transfers with NetSticher. Technical report, 2011. Available at the authors' website.Google ScholarGoogle Scholar
  30. N. Laoutaris, G. Smaragdakis, P. Rodriguez, and R. Sundaram. Delay Tolerant Bulk Data Transfers on the Internet. In SIGMETRICS'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Leighton, C. Stein, F. Makedon, E. Tardos, S. Plotkin, and S. Tragoudas. Fast Approximation Algorithms for Multicommodity Flow Problems. In STOC'91. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. S. Mitchell and J. Roy. Pulp Python LP Modeler. code.google.com/p/pulp-or/.Google ScholarGoogle Scholar
  33. H. Pucha, D. G. Andersen, and M. Kaminsky. Exploiting Similarity for Multi-Source Downloads using File Handprints. In NSDI'07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. H. Pucha, M. Kaminsky, D. Andersen, and M. Kozuch. Adaptive file transfers for diverse environments. In USENIX ATC'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. F. Shahrokhi and D. Matula. The Maximum Concurrent Flow Problem. In Journal of the ACM, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. S. Shalunov and B. Teitelbaum. Qbone Scavenger Service (QBSS) Definition. Internet2 technical report, Mar. 2001.Google ScholarGoogle Scholar
  37. D. Ziegler. Distributed Peta-Scale Data Transfer. www.cs.huji.ac.il/ dhay/IND2011.html.Google ScholarGoogle Scholar

Index Terms

  1. Inter-datacenter bulk transfers with netstitcher

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGCOMM Computer Communication Review
      ACM SIGCOMM Computer Communication Review  Volume 41, Issue 4
      SIGCOMM '11
      August 2011
      480 pages
      ISSN:0146-4833
      DOI:10.1145/2043164
      Issue’s Table of Contents
      • cover image ACM Conferences
        SIGCOMM '11: Proceedings of the ACM SIGCOMM 2011 conference
        August 2011
        502 pages
        ISBN:9781450307970
        DOI:10.1145/2018436

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 August 2011

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader