skip to main content
10.1145/3424978.3425051acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaeConference Proceedingsconference-collections
research-article

Parallel Computing Framework Based on MapReduce and GPU Clusters

Authors Info & Claims
Published:20 October 2020Publication History

ABSTRACT

In recent years, driven by hardware technology, the computing power and programmability of GPUs have been rapidly developed. With the characteristics of highly parallel computing, GPUs are no longer limited to daily graphics processing tasks. It begins to involve a wider range of high-performance generalpurpose computing field. One of the hotspots in the field of highperformance parallel computing is MapReduce, a massive data processing framework. Through inexpensive ordinary computer clusters, we can obtain large-scale data computing capabilities that were previously only owned by expensive large servers. However, most existing MapReduce systems run on CPU clusters, and the computing performance of a single node is limited. Therefore, this paper proposes a parallel computing framework based on GPU cluster and MapReduce, and validates the effectiveness of the framework through experiments. Experiments have proven that our framework can complete the work, and it has a significant speedup for large-scale applications.

References

  1. Mittal S (2016). A survey of techniques for approximate computing[J]. ACM Computing Surveys (CSUR), 48(4), 62.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. CUDA NVIDIA. CUDA Programming Guide.Google ScholarGoogle Scholar
  3. Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung (2003). The Google File System, SOSP 03, New York, NY, USA, pp. 29--43.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jeffrey Dean and Sanjay Ghemawat (2008). MapReduce: Simplyfied Data Processing on Large Clusters. Communications of the ACM, vol. 51, pp. 107--113.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Apache Hadoop, http://hadoop.apache.org/.Google ScholarGoogle Scholar
  6. L Shvachko, Hairong Kuang, S Radia, et al. (2010). The Hadoop Distributed File System. IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1--10.Google ScholarGoogle Scholar
  7. HBase, http://hbase.apache.org/.Google ScholarGoogle Scholar
  8. Ranger C, Raghuraman R, Penmetsa A, et al. (2007). Evaluating mapreduce for multi-core and multiprocessor systems[C]. High Performance Computer Architure HPCA. IEEE 13th International Symposium on IEEE, 13--24.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yang C T, Huang C L and Lin C F (2011). Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters[J]. Computer Physics Communications, 182(1), 266--269.Google ScholarGoogle ScholarCross RefCross Ref
  10. He B, Fang W, Luo Q, et al. (2008). Mars: a MapReduce framework on graphics processors[C]. Proceedings of the 17th international conference on Parallel architectures and compilation techniques ACM, 260--269.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hong C, Chen D, Chen W, et al. (2010). MapCG:writing parallel program protable between CPU and GPU[C]. Proceedings of the 19th international conference on Parallel architectures and compilation techniques. ACM, 217--226.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Xin Miao and Li Hao (2012). An Implementation of GPU Accelerated MapReduce: Using Hadoop with OpenCL for Data- and Compute-Intensive Jobs. In IJCSS, pp. 6--11.Google ScholarGoogle Scholar
  13. Styart J A and Qwens J D (2011). Multi-GPU MapReduce on GPU clusters[C]. Parallel & Distributed Processing Symposium (IPDPS), IEEE International. IEEE, 1068--1079.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Heng Gao, Jie Tang and Gangshan Wu (2013). A MapReduce Computing Framework Based on GPU Cluster. IEEE Conference, High Performance Computing and Communications & Embedded and Ubiquitous Computing, pages 1902--1907.Google ScholarGoogle Scholar
  15. Mengjun Xie, Kyoung-Don Kang and Basaran C (2013). Moim: A Multi-GPU MapReduce Framework. IEEE 16th International Conference on Computational Science and Engineering (CSE), pp.1279, 1286, 3-5 Dec.Google ScholarGoogle Scholar
  16. Yiru Guo, Weiguo Liu, Gong B, Voss G and Muller-Wittig W (2013). GCMR: A GPU Cluster-Based MapReduce Framework for Large- Scale Data Processing. High Performance Computing and Communications & IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), IEEE 10th International Conference on, pp. 580, 586, 13-15 Nov.Google ScholarGoogle Scholar

Index Terms

  1. Parallel Computing Framework Based on MapReduce and GPU Clusters

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        CSAE '20: Proceedings of the 4th International Conference on Computer Science and Application Engineering
        October 2020
        1038 pages
        ISBN:9781450377720
        DOI:10.1145/3424978

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 October 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        CSAE '20 Paper Acceptance Rate179of387submissions,46%Overall Acceptance Rate368of770submissions,48%
      • Article Metrics

        • Downloads (Last 12 months)17
        • Downloads (Last 6 weeks)2

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader