Apple Academic Press, 2021. — 426 p. — ISBN13: 978-0-42932-173-3.
The authors provide an understanding of big data and MapReduce by clearly presenting the basic terminologies and concepts. They have employed over 100 illustrations and many worked-out examples to convey the concepts and methods used in big data, the inner workings of MapReduce, and single node/multi-node installation on physical/virtual machines. This book covers almost all the necessary information on Hadoop MapReduce for most online certification exams. Upon completing this book, readers will find it easy to understand other big data processing tools such as Spark, Storm, etc.
Ultimately, readers will be able to
understand what big data is and the factors that are involved
understand the inner workings of MapReduce, which is essential for certification exams
learn the features and weaknesses of MapReduce
set up Hadoop clusters with 100s of physical/virtual machines
create a virtual machine in AWS
write MapReduce with Eclipse in a simple way
understand other big data processing tools and their applications
Big Data
Hadoop Framework
Hadoop 1.2.1 Installation
Hadoop Ecosystem
Hadoop 2.7.0
Hadoop 2.7.0 Installation
Data Science
AppendixesPublic Datasets
MapReduce Exercise
Case Study: Application Development for NYSE Dataset
Web References