Azure Data Factory by Example: Practical Implementation for Data Engineers
暫譯: 透過範例學習 Azure Data Factory:數據工程師的實務實作

Swinbank, Richard

  • 出版商: Apress
  • 出版日期: 2024-03-23
  • 售價: $2,170
  • 貴賓價: 9.5$2,062
  • 語言: 英文
  • 頁數: 421
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9798868802171
  • ISBN-13: 9798868802171
  • 相關分類: Power BI
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Data engineers who need to hit the ground running will use this book to build skills in Azure Data Factory v2 (ADF). The tutorial-first approach to ADF taken in this book gets you working from the first chapter, explaining key ideas naturally as you encounter them. From creating your first data factory to building complex, metadata-driven nested pipelines, the book guides you through essential concepts in Microsoft's cloud-based ETL/ELT platform. It introduces components indispensable for the movement and transformation of data in the cloud. Then it demonstrates the tools necessary to orchestrate, monitor, and manage those components.
This edition, updated for 2024, includes the latest developments to the Azure Data Factory service:
  • Enhancements to existing pipeline activities such as Execute Pipeline, along with the introduction of new activities such as Script, and activities designed specifically to interact with Azure Synapse Analytics.
  • Improvements to flow control provided by activity deactivation and the Fail activity.
  • The introduction of reusable data flow components such as user-defined functions and flowlets.
  • Extensions to integration runtime capabilities including Managed VNet support.
  • The ability to trigger pipelines in response to custom events.
  • Tools for implementing boilerplate processes such as change data capture and metadata-driven data copying.

What You Will Learn
  • Create pipelines, activities, datasets, and linked services
  • Build reusable components using variables, parameters, and expressions
  • Move data into and around Azure services automatically
  • Transform data natively using ADF data flows and Power Query data wrangling
  • Master flow-of-control and triggers for tightly orchestrated pipeline execution
  • Publish and monitor pipelines easily and with confidence

Who This Book Is For
Data engineers and ETL developers taking their first steps in Azure Data Factory, SQL Server Integration Services users making the transition toward doing ETL in Microsoft's Azure cloud, and SQL Server database administrators involved in data warehousing and ETL operations

商品描述(中文翻譯)

需要迅速上手的資料工程師將使用本書來建立 Azure Data Factory v2 (ADF) 的技能。本書採用以教程為主的方式,從第一章開始就讓您開始實作,並在您遇到關鍵概念時自然地進行解釋。從創建您的第一個資料工廠到構建複雜的、以元數據驅動的嵌套管道,本書引導您了解微軟雲端 ETL/ELT 平台中的基本概念。它介紹了在雲端中移動和轉換資料所必需的組件,然後展示了協調、監控和管理這些組件所需的工具。

本版更新至 2024 年,包含 Azure Data Factory 服務的最新發展:
- 對現有管道活動(如 Execute Pipeline)的增強,以及引入新的活動(如 Script)和專門設計用於與 Azure Synapse Analytics 互動的活動。
- 透過活動停用和 Fail 活動提供的流程控制改進。
- 引入可重用的資料流組件,如用戶定義函數和 flowlets。
- 擴展整合執行時的能力,包括對 Managed VNet 的支援。
- 能夠根據自定義事件觸發管道。
- 實施樣板過程的工具,如變更資料捕獲和以元數據驅動的資料複製。

**您將學到什麼**
- 創建管道、活動、資料集和連結服務
- 使用變數、參數和表達式構建可重用的組件
- 自動將資料移入和移動到 Azure 服務中
- 使用 ADF 資料流和 Power Query 數據處理原生轉換資料
- 精通控制流程和觸發器以實現緊密協調的管道執行
- 輕鬆且自信地發布和監控管道

**本書適合誰**
資料工程師和 ETL 開發人員在 Azure Data Factory 中邁出第一步,SQL Server Integration Services 使用者向在微軟 Azure 雲端中進行 ETL 轉型,及參與資料倉儲和 ETL 操作的 SQL Server 資料庫管理員。

作者簡介

​Richard Swinbank is a data engineer and Microsoft Data Platform MVP. He specializes in building and automating analytics platforms using Microsoft technologies from the SQL Server stack to the Azure cloud. He is a fervent advocate of DataOps, with a technical focus on bringing automation to both analytics development and operations. An active member of the data community and keen knowledge-sharer, Richard is a volunteer, organizer, speaker, blogger, open source contributor, and author. He holds a PhD in computer science from the University of Birmingham (UK).


作者簡介(中文翻譯)

理查德·斯溫班克(Richard Swinbank)是一位數據工程師及微軟數據平台MVP。他專注於使用微軟技術從SQL Server堆疊到Azure雲端構建和自動化分析平台。他熱衷於DataOps,並在技術上專注於將自動化引入分析開發和運營中。作為數據社群的活躍成員和熱心的知識分享者,理查德是一名志願者、組織者、演講者、部落客、開源貢獻者和作者。他擁有英國伯明翰大學的計算機科學博士學位。