HBase Administration Cookbook

Yifeng Jiang

  • 出版商: Packt Publishing
  • 出版日期: 2012-08-17
  • 售價: $1,900
  • 貴賓價: 9.5$1,805
  • 語言: 英文
  • 頁數: 332
  • 裝訂: Paperback
  • ISBN: 1849517142
  • ISBN-13: 9781849517140
  • 相關分類: NoSQL
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Master HBase configuration and administration for optimum database performance

  • Move large amounts of data into HBase and learn how to manage it efficiently
  • Set up HBase on the cloud, get it ready for production, and run it smoothly with high performance
  • Maximize the ability of HBase with the Hadoop eco-system including HDFS, MapReduce, Zookeeper, and Hive

In Detail

As an Open Source distributed big data store, HBase scales to billions of rows, with millions of columns and sits on top of the clusters of commodity machines. If you are looking for a way to store and access a huge amount of data in real-time, then look no further than HBase.

HBase Administration Cookbook provides practical examples and simple step-by-step instructions for you to administrate HBase with ease. The recipes cover a wide range of processes for managing a fully distributed, highly available HBase cluster on the cloud. Working with such a huge amount of data means that an organized and manageable process is key and this book will help you to achieve that.

The recipes in this practical cookbook start from setting up a fully distributed HBase cluster and moving data into it. You will learn how to use all of the tools for day-to-day administration tasks as well as for efficiently managing and monitoring the cluster to achieve the best performance possible. Understanding the relationship between Hadoop and HBase will allow you to get the best out of HBase so the book will show you how to set up Hadoop clusters, configure Hadoop to cooperate with HBase, and tune its performance.

What you will learn from this book

  • Set up a fully distributed, highly available HBase cluster and load data into it using the normal client API or your own MapReduce job
  • Access data in HBase via HBase Shell or Hive using its SQL-like query language
  • Backup and restore HBase table, along with its data distribution, and move or replicate data between different HBase clusters
  • Gather metrics then show them in graphs, monitor the cluster's status, and get notified if thresholds are exceeded
  • Tune your kernel settings with JVM GC, Hadoop, and HBase configuration to maximize the performance
  • Discover troubleshooting tools and tips in order to avoid the most commonly-found problems with HBase
  • Gain optimum performance with data compression, region splits, and by manually managing compaction
  • Learn advanced configuration and tuning for read and write-heavy clusters

Approach

As part of Packt's cookbook series, each recipe offers a practical, step-by-step solution to common problems found in HBase administration.

Who this book is written for

This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.

商品描述(中文翻譯)

《HBase管理食譜》是一本實用的書籍,提供了一系列實例和簡單的逐步指示,讓您輕鬆管理HBase。這些食譜涵蓋了在雲端上管理完全分散且高可用的HBase集群的各種流程。處理如此大量的數據意味著組織和可管理的過程至關重要,而本書將幫助您實現這一目標。

這本實用食譜的內容從設置完全分散的HBase集群並將數據移入其中開始。您將學習如何使用各種工具進行日常管理任務,以及有效管理和監控集群以實現最佳性能。了解Hadoop和HBase之間的關係將使您能夠充分利用HBase,因此本書將向您展示如何設置Hadoop集群,配置Hadoop與HBase的協作,並調整其性能。

本書將教您以下內容:
- 設置完全分散且高可用的HBase集群,並使用常規客戶端API或自己的MapReduce作業將數據加載到其中
- 通過HBase Shell或Hive使用類似SQL的查詢語言訪問HBase中的數據
- 備份和還原HBase表,以及其數據分佈,並在不同的HBase集群之間移動或複製數據
- 收集指標並以圖形方式顯示,監視集群的狀態,並在超過閾值時收到通知
- 通過JVM GC、Hadoop和HBase配置調整內核設置,以最大化性能
- 探索故障排除工具和技巧,以避免HBase中最常見的問題
- 通過數據壓縮、區域分割和手動管理合併來獲得最佳性能
- 學習讀寫密集型集群的高級配置和調整

作為Packt食譜系列的一部分,本書中的每個食譜都提供了HBase管理中常見問題的實用、逐步解決方案。

本書適合HBase管理員、開發人員,甚至可以幫助Hadoop管理員。您不需要具備HBase經驗,但需要基本了解Hadoop和MapReduce。