Big Data Analytics with Hadoop 3
Dive deep into Big Data concepts, platforms, analytics and their applications using the power of Hadoop 3 About This Book * Leverage the power of Hadoop 3 to build effective big data analytics solutions on-premise and on cloud * Integrate Hadoop with other big data tools such as R, Python, Apache Spark and Apache Flink * Get deep insights from your Big Data using Hadoop 3 with the help of real-world examples Who This Book Is For If you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3's powerful features, this book is for you. If you're new to Big Data analytics, this book will also help you. A basic understanding of the Java programming language is required for this book. What You Will Learn * Explore the new features of Hadoop 3 along with HDFS, YARN and MapReduce. * Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples * Integrate Hadoop with R and Python for more efficient big data processing * Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics * Setup a Hadoop cluster on AWS cloud * Perform Big Data Analytics on AWS using Elastic Map Reduce In Detail Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. This book shows you how to do just that, with the help of practical examples. You will start with getting a quick overview of the new features introduced in Hadoop 3 along with HDFS, MapReduce and YARN , and how they enable faster, more efficient big data processing. Further, you will learn how to integrate Hadoop with the open source tools such as Python and R to analyse and visualise data and to perform statistical computing on Big Data. The book will also show you how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing, and demonstrates how to use Hadoop to build analytics solutions on the cloud. Finally, you will learn to build an end to end pipeline to perform Big Data Analytics using practical use cases. By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform Big Data analytics and get insights from your Big Data without any hassle.