Polybase Revealed: Data Virtualization with SQL Server, Hadoop, Apache Spark, and Beyond (Paperback)

Feasel, Kevin

  • 出版商: Apress
  • 出版日期: 2019-12-21
  • 定價: $1,100
  • 售價: 9.5$1,045
  • 語言: 英文
  • 頁數: 311
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1484254600
  • ISBN-13: 9781484254608
  • 相關分類: HadoopMSSQLSparkSQL
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Harness the power of PolyBase data virtualization software to make data from a variety of sources easily accessible through SQL queries while using the T-SQL skills you already know and have mastered.

PolyBase Revealed shows you how to use the PolyBase feature of SQL Server 2019 to integrate SQL Server with Azure Blob Storage, Apache Hadoop, other SQL Server instances, Oracle, Cosmos DB, Apache Spark, and more. You will learn how PolyBase can help you reduce storage and other costs by avoiding the need for ETL processes that duplicate data in order to make it accessible from one source. PolyBase makes SQL Server into that one source, and T-SQL is your golden ticket. The book also covers PolyBase scale-out clusters, allowing you to distribute PolyBase queries among several SQL Server instances, thus improving performance.

With great flexibility comes great complexity, and this book shows you where to look when queries fail, complete with coverage of internals, troubleshooting techniques, and where to find more information on obscure cross-platform errors. Data virtualization is a key target for Microsoft with SQL Server 2019. This book will help you keep your skills current, remain relevant, and build new business and career opportunities around Microsoft’s product direction.

 

What You Will Learn

  • Install and configure PolyBase as a stand-alone service, or unlock its capabilities with a scale-out cluster
  • Understand how PolyBase interacts with outside data sources while presenting their data as regular SQL Server tables
  • Write queries combining data from SQL Server, Apache Hadoop, Oracle, Cosmos DB, Apache Spark, and more
  • Troubleshoot PolyBase queries using SQL Server Dynamic Management Views
  • Tune PolyBase queries using statistics and execution plans
  • Solve common business problems, including "cold storage" of infrequently accessed data and simplifying ETL jobs

商品描述(中文翻譯)

利用 PolyBase 數據虛擬化軟體的強大功能,透過 SQL 查詢輕鬆存取來自各種來源的數據,同時使用您已經熟悉且掌握的 T-SQL 技能。

《PolyBase 揭秘》展示了如何使用 SQL Server 2019 的 PolyBase 功能,將 SQL Server 與 Azure Blob Storage、Apache Hadoop、其他 SQL Server 實例、Oracle、Cosmos DB、Apache Spark 等整合。您將學習到 PolyBase 如何幫助您減少存儲和其他成本,避免需要 ETL 過程來複製數據以便從一個來源存取。PolyBase 將 SQL Server 變成了這個唯一的來源,而 T-SQL 則是您的金鑰。本書還介紹了 PolyBase 的擴展集群,使您能夠在多個 SQL Server 實例之間分佈 PolyBase 查詢,從而提高性能。

隨著極大的靈活性,也帶來了極大的複雜性,本書將向您展示查詢失敗時應該從何處入手,包括內部結構、故障排除技巧以及在不同平台之間出現的複雜錯誤的更多資訊。數據虛擬化是 Microsoft 在 SQL Server 2019 中的一個重要目標。本書將幫助您保持技能的時效性,保持相關性,並圍繞 Microsoft 的產品方向建立新的商業和職業機會。

您將學到以下內容:

- 安裝和配置 PolyBase 作為獨立服務,或通過擴展集群發揮其功能
- 了解 PolyBase 如何與外部數據源交互,同時將其數據呈現為常規的 SQL Server 表格
- 撰寫結合來自 SQL Server、Apache Hadoop、Oracle、Cosmos DB、Apache Spark 等的數據的查詢
- 使用 SQL Server 動態管理視圖來排除 PolyBase 查詢的問題
- 使用統計數據和執行計劃調整 PolyBase 查詢
- 解決常見的業務問題,包括對不經常訪問的數據進行「冷存儲」和簡化 ETL 作業。

作者簡介

Kevin Feasel is a Microsoft Data Platform MVP and CTO at Envizage where he specializes in T-SQL and R development, forcing Spark clusters to do his bidding, fighting with Kafka, and pulling rabbits out of hats on demand. He is the lead curator at Curated SQL (curatedsql.com). A resident of Durham, North Carolina, USA, Kevin can be found cycling the trails along the Triangle whenever the weather is nice enough.

 

 

 

作者簡介(中文翻譯)

Kevin Feasel 是一位微軟數據平台 MVP,也是 Envizage 的首席技術官。他專注於 T-SQL 和 R 開發,並擅長控制 Spark 集群、使用 Kafka 進行數據處理,以及根據需求隨時拿出驚人的解決方案。他是 Curated SQL (curatedsql.com) 的首席編輯。Kevin 目前居住在美國北卡羅來納州的杜倫市,只要天氣允許,你可以在三角洲的自行車道上找到他。