Standard Toolkits for Hadoop and Analytics

Koitzsch, Kerry

doi:10.1007/978-1-4842-1910-2_3

Kerry Koitzsch²

4702 Accesses

Abstract

In this chapter, we take a look at the necessary ingredients for a BDA system: the standard libraries and toolkits most useful for building BDAs. We describe an example system (which we develop throughout the remainder of the book) using standard toolkits from the Hadoop and Spark ecosystems. We also use other analytical toolkits, such as R and Weka, with mainstream development components such as Ant, Maven, npm, pip, Bower, and other system building tools. "Glueware components" such as Apache Camel, Spring Framework, Spring Data, Apache Kafka, Apache Tika, and others can be used to create a Hadoop-based system appropriate for a variety of applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Sunnyvale, California, USA
Kerry Koitzsch

Authors

Kerry Koitzsch
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Koitzsch, K. (2017). Standard Toolkits for Hadoop and Analytics. In: Pro Hadoop Data Analytics . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1910-2_3

Download citation

DOI: https://doi.org/10.1007/978-1-4842-1910-2_3
Published: 30 December 2016
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-1909-6
Online ISBN: 978-1-4842-1910-2
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)

Publish with us

Policies and ethics