Cloudera Administration Handbook (Paperback)

Rohit Menon

  • 出版商: Packt Publishing
  • 出版日期: 2014-07-19
  • 售價: $2,180
  • 貴賓價: 9.5$2,071
  • 語言: 英文
  • 頁數: 255
  • 裝訂: Paperback
  • ISBN: 1783558962
  • ISBN-13: 9781783558964
  • 相關分類: Hadoop大數據 Big-data
  • 海外代購書籍(需單獨結帳)



A complete, hands-on guide to building and maintaining large Apache Hadoop clusters using Cloudera Manager and CDH5


  • Understand the CDH architecture and its components and successfully set up a Hadoop cluster
  • Maintain, troubleshoot, and secure your cluster using Cloudera Manager
  • Easy-to-follow administrator’s guide with step-by-step explanations to help you master Apache Hadoop

In Detail

Apache Hadoop is an open source distributed computing technology that assists users in processing large volumes of data with relative ease, helping them to generate tremendous insights into their data. Cloudera, with their open source distribution of Hadoop, has made data analytics on big data possible and accessible to anyone interested.

This book fully prepares you to be a Hadoop administrator, with special emphasis on Cloudera's CDH. It provides step-by-step instructions on setting up and managing a robust Hadoop cluster running CDH5. This book will also equip you with an understanding of tools such as Cloudera Manager, which is currently being used by many companies to manage Hadoop clusters with hundreds of nodes. You will learn how to set up security using Kerberos. You will also use Cloudera Manager to set up alerts and events that will help you monitor and troubleshoot cluster issues.

What you will learn from this book

  • Understand the Apache Hadoop architecture and the future of distributed processing frameworks
  • Use HDFS and MapReduce for all file-related operations
  • Install and configure CDH to bring up an Apache Hadoop cluster
  • Configure HDFS High Availability and HDFS Federation to prevent single points of failure
  • Install and configure Cloudera Manager to perform administrator operations
  • Implement security by installing and configuring Kerberos for all services in the cluster
  • Add, remove, and rebalance nodes in a cluster using cluster management tools
  • Understand and configure the different backup options to back up your HDFS


An easy-to-follow Apache Hadoop administrator’s guide filled with practical screenshots and explanations for each step and configuration.

Who this book is written for

This book is great for administrators interested in setting up and managing a large Hadoop cluster. If you are an administrator, or want to be an administrator, and you are ready to build and maintain a production-level cluster running CDH5, then this book is for you.


一本完整的、實用的指南,教你如何使用Cloudera Manager和CDH5來建立和維護大型Apache Hadoop集群。

- 瞭解CDH架構及其組件,成功建立Hadoop集群
- 使用Cloudera Manager來維護、排除故障和保護你的集群
- 提供易於理解的管理員指南,逐步解釋幫助你掌握Apache Hadoop的技巧

Apache Hadoop是一個開源的分散式計算技術,幫助用戶輕鬆處理大量數據,幫助他們對數據生成深入的洞察。Cloudera通過他們的Hadoop開源發行版,使大數據的數據分析成為可能並對任何有興趣的人都可接觸。

本書將全面準備你成為一名Hadoop管理員,特別強調Cloudera的CDH。它提供了逐步指導,教你如何設置和管理運行CDH5的強大Hadoop集群。本書還將使你瞭解Cloudera Manager等工具,目前許多公司正在使用它來管理具有數百個節點的Hadoop集群。你將學習如何使用Kerberos設置安全性。你還將使用Cloudera Manager設置警報和事件,以幫助你監控和排除集群問題。

- 瞭解Apache Hadoop架構和分散式處理框架的未來
- 使用HDFS和MapReduce進行所有與文件相關的操作
- 安裝和配置CDH以啟動Apache Hadoop集群
- 配置HDFS高可用性和HDFS聯邦,以防止單點故障
- 安裝和配置Cloudera Manager執行管理員操作
- 通過安裝和配置Kerberos為集群中的所有服務實現安全性
- 使用集群管理工具添加、刪除和重新平衡節點
- 瞭解並配置不同的備份選項來備份你的HDFS

一本易於遵循的Apache Hadoop管理員指南,充滿了實用的屏幕截圖和每個步驟和配置的解釋。