Apache Hive Essentials

Dayong Du

  • 出版商: Packt Publishing
  • 出版日期: 2015-02-28
  • 售價: $1,710
  • 貴賓價: 9.5$1,625
  • 語言: 英文
  • 頁數: 145
  • 裝訂: Paperback
  • ISBN: 1783558571
  • ISBN-13: 9781783558575
  • 相關分類: Hadoop
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

商品描述

Immerse yourself on a fantastic journey to discover the attributes of big data by using Hive

About This Book

  • Discover how Hive can coexist and work with other tools in the Hadoop ecosystem to create big data solutions
  • Grasp the skills needed, learn the best practices, and avoid the pitfalls in writing efficient Hive queries to analyze the big data
  • Create an environment to analyze big data using practical, example-oriented scenarios

Who This Book Is For

If you are a data analyst, developer, or simply someone who wants to use Hive to explore and analyze data in Hadoop, this is the book for you. Whether you are new to big data or an expert, with this book, you will be able to master both the basic and the advanced features of Hive. Since Hive is an SQL-like language, some previous experience with the SQL language and databases is useful to have a better understanding of this book.

What You Will Learn

  • Create and set up the Hive environment
  • Discover how to use Hive's definition language to describe data
  • Discover interesting data by joining and filtering datasets in Hive
  • Transform data by using Hive sorting, ordering, and functions
  • Aggregate and sample data in different ways
  • Boost Hive query performance and enhance data security in Hive
  • Customize Hive to your needs by using user-defined functions and integrate it with other tools

In Detail

In this book, we prepare you for your journey into big data by firstly introducing you to backgrounds in the big data domain along with the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skill in using the Hive language in an efficient manner. Towards the end, the book focuses on advanced topics such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.

By the end of the book, you will be familiar with Hive and able to work efficiently to find solutions to big data problems.

商品描述(中文翻譯)

深入探索使用Hive發現大數據屬性的奇妙旅程。

關於本書:
- 發現Hive如何與Hadoop生態系統中的其他工具共存並協同工作,創建大數據解決方案。
- 掌握所需技能,學習最佳實踐,避免在撰寫高效Hive查詢以分析大數據時遇到的問題。
- 通過實際的以例為導向的場景創建一個分析大數據的環境。

本書適合對象:
- 如果您是數據分析師、開發人員或只是想使用Hive在Hadoop中探索和分析數據的人,這本書適合您。
- 無論您是新手還是專家,通過本書,您將能夠掌握Hive的基本和高級功能。
- 由於Hive是一種類似SQL的語言,具有SQL語言和數據庫的一些先前經驗將有助於更好地理解本書。

您將學到什麼:
- 創建和設置Hive環境。
- 使用Hive的定義語言描述數據。
- 通過在Hive中聯接和過濾數據集來發現有趣的數據。
- 使用Hive的排序、排序和函數來轉換數據。
- 以不同方式聚合和抽樣數據。
- 提高Hive查詢性能並增強Hive中的數據安全性。
- 通過使用自定義函數自定義Hive並將其與其他工具集成。

詳細內容:
本書首先介紹了大數據領域的背景,並引導您建立和熟悉Hive工作環境,為您進入大數據之旅做好準備。接下來,本書通過示例指導您發現和轉換大數據的值。它還提高了您以高效方式使用Hive語言的技能。最後,本書專注於Hive中的性能、安全性和擴展等高級主題,將引導您在這個有價值的大數據之旅中進行令人興奮的冒險。

通過閱讀本書,您將熟悉Hive並能夠高效地解決大數據問題。