Big Data Analytics with R and Hadoop(Paperback)

Vignesh Prajapati

買這商品的人也買了...

商品描述

If you're an R developer looking to harness the power of big data analytics with Hadoop, then this book tells you everything you need to integrate the two. You'll end up capable of building a data analytics engine with huge potential.

Overview

  • Write Hadoop MapReduce within R
  • Learn data analytics with R and the Hadoop platform
  • Handle HDFS data within R
  • Understand Hadoop streaming with R
  • Encode and enrich datasets into R

In Detail

Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased revenue. New methods of working with big data, such as Hadoop and MapReduce, offer alternatives to traditional data warehousing.

Big Data Analytics with R and Hadoop is focused on the techniques of integrating R and Hadoop by various tools such as RHIPE and RHadoop. A powerful data analytics engine can be built, which can process analytics algorithms over a large scale dataset in a scalable manner. This can be implemented through data analytics operations of R, MapReduce, and HDFS of Hadoop.

You will start with the installation and configuration of R and Hadoop. Next, you will discover information on various practical data analytics examples with R and Hadoop. Finally, you will learn how to import/export from various data sources to R. Big Data Analytics with R and Hadoop will also give you an easy understanding of the R and Hadoop connectors RHIPE, RHadoop, and Hadoop streaming.

What you will learn from this book

  • Integrate R and Hadoop via RHIPE, RHadoop, and Hadoop streaming
  • Develop and run a MapReduce application that runs with R and Hadoop
  • Handle HDFS data from within R using RHIPE and RHadoop
  • Run Hadoop streaming and MapReduce with R
  • Import and export from various data sources to R

Approach

Big Data Analytics with R and Hadoop is a tutorial style book that focuses on all the powerful big data tasks that can be achieved by integrating R and Hadoop.

Who this book is written for

This book is ideal for R developers who are looking for a way to perform big data analytics with Hadoop. This book is also aimed at those who know Hadoop and want to build some intelligent applications over Big data with R packages. It would be helpful if readers have basic knowledge of R.

商品描述(中文翻譯)

如果您是一位R開發者,希望利用Hadoop的大數據分析能力,那麼這本書將告訴您如何整合這兩者。您將能夠建立一個具有巨大潛力的數據分析引擎。

概述:
- 在R中編寫Hadoop MapReduce
- 學習使用R和Hadoop平台進行數據分析
- 在R中處理HDFS數據
- 了解使用R的Hadoop streaming
- 將數據集編碼和豐富化到R中

詳細內容:
大數據分析是指對各種類型的大量數據進行檢查,以發現隱藏的模式、未知的相關性和其他有用的信息的過程。這些信息可以為組織提供競爭優勢,並帶來業務利益,例如更有效的營銷和增加的收入。與傳統的數據倉儲相比,使用Hadoop和MapReduce等新的大數據處理方法提供了替代方案。

《R和Hadoop的大數據分析》專注於通過RHIPE和RHadoop等工具將R和Hadoop整合的技術。可以建立一個強大的數據分析引擎,可以以可擴展的方式處理大規模數據集上的分析算法。這可以通過R、MapReduce和Hadoop的HDFS的數據分析操作來實現。

您將從安裝和配置R和Hadoop開始。接下來,您將了解使用R和Hadoop進行各種實際數據分析示例的信息。最後,您將學習如何將數據從各種數據源導入/導出到R中。《R和Hadoop的大數據分析》還將讓您輕鬆理解R和Hadoop的連接器RHIPE、RHadoop和Hadoop streaming。

本書的學習重點:
- 通過RHIPE、RHadoop和Hadoop streaming將R和Hadoop整合
- 開發並運行在R和Hadoop上運行的MapReduce應用程序
- 使用RHIPE和RHadoop在R中處理HDFS數據
- 使用R運行Hadoop streaming和MapReduce
- 將數據從各種數據源導入/導出到R中

本書的特點:
《R和Hadoop的大數據分析》是一本以教程風格為主的書,重點介紹了通過整合R和Hadoop可以實現的所有強大的大數據任務。

本書的讀者對象:
本書適合希望使用Hadoop進行大數據分析的R開發者。本書還針對那些熟悉Hadoop並希望使用R包在大數據上構建一些智能應用程序的讀者。如果讀者具備基本的R知識,將會更有幫助。