Abstract
In this chapter, we take a look at the necessary ingredients for a BDA system: the standard libraries and toolkits most useful for building BDAs. We describe an example system (which we develop throughout the remainder of the book) using standard toolkits from the Hadoop and Spark ecosystems. We also use other analytical toolkits, such as R and Weka, with mainstream development components such as Ant, Maven, npm, pip, Bower, and other system building tools. "Glueware components" such as Apache Camel, Spring Framework, Spring Data, Apache Kafka, Apache Tika, and others can be used to create a Hadoop-based system appropriate for a variety of applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Kerry Koitzsch
About this chapter
Cite this chapter
Koitzsch, K. (2017). Standard Toolkits for Hadoop and Analytics. In: Pro Hadoop Data Analytics . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1910-2_3
Download citation
DOI: https://doi.org/10.1007/978-1-4842-1910-2_3
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-1909-6
Online ISBN: 978-1-4842-1910-2
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)