ETL with Azure Cookbook: Practical recipes for building modern ETL solutions to load and transform data from any source

Coté, Christian, Saitakhmetova, Madina, Lah, Matija

  • 出版商: Packt Publishing
  • 出版日期: 2020-09-30
  • 售價: $1,830
  • 貴賓價: 9.5$1,739
  • 語言: 英文
  • 頁數: 446
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1800203314
  • ISBN-13: 9781800203310
  • 相關分類: Microsoft Azure
  • 下單後立即進貨 (約3~4週)

買這商品的人也買了...

商品描述

ETL is one of the most common and tedious procedures for moving and processing data from one database to another. With the help of this book, you will be able to speed up the process by designing effective ETL solutions using the Azure services available for handling and transforming any data to suit your requirements.

 

With this cookbook, you'll become well versed in all the features of SQL Server Integration Services (SSIS) to perform data migration and ETL tasks that integrate with Azure. You'll learn how to transform data in Azure and understand how legacy systems perform ETL on-premises using SSIS. Later chapters will get you up to speed with connecting and retrieving data from SQL Server 2019 Big Data Clusters, and even show you how to extend and customize the SSIS toolbox using custom-developed tasks and transforms. This ETL book also contains practical recipes for moving and transforming data with Azure services, such as Data Factory and Azure Databricks, and lets you explore various options for migrating SSIS packages to Azure. Toward the end, you'll find out how to profile data in the cloud and automate service creation with Business Intelligence Markup Language (BIML).

 

By the end of this book, you'll have developed the skills you need to create and automate ETL solutions on-premises as well as in Azure.

商品描述(中文翻譯)

ETL是將數據從一個數據庫移動和處理到另一個數據庫的最常見和繁瑣的程序之一。通過這本書的幫助,您將能夠設計有效的ETL解決方案,利用Azure提供的服務來加快處理速度,並根據您的需求處理和轉換任何數據。

通過這本食譜,您將熟悉SQL Server Integration Services(SSIS)的所有功能,以執行與Azure集成的數據遷移和ETL任務。您將學習如何在Azure中轉換數據,並了解傳統系統如何使用SSIS在本地進行ETL。後面的章節將使您能夠快速連接並檢索來自SQL Server 2019 Big Data Clusters的數據,甚至向您展示如何使用自定義開發的任務和轉換擴展和自定義SSIS工具箱。這本ETL書還包含使用Azure服務(如Data Factory和Azure Databricks)移動和轉換數據的實用食譜,並讓您探索將SSIS包遷移到Azure的各種選項。最後,您將了解如何在雲端中對數據進行概要分析並使用商業智能標記語言(BIML)自動化服務創建。

通過閱讀本書,您將掌握在本地和Azure上創建和自動化ETL解決方案所需的技能。

作者簡介

Christian Cote is an IT professional with more than 15 years of experience working on data warehouse, big data, and business intelligence projects. Christian has developed expertise in data warehousing and data lakes over the years and has designed many ETL/BI processes using a range of tools on multiple platforms. He's presented at several conferences and code camps. He currently co-leads the SQL Server PASS chapter. He is also a Microsoft Data Platform Most Valuable Professional (MVP).

Matija Lah has more than 18 years of experience working with Microsoft SQL Server, mostly from architecting data-centric solutions in the legal domain. His contributions to the SQL Server community have led to him being awarded the Microsoft MVP award (Data Platform) between 2007 and 2017/2018. He spends most of his time on projects involving advanced information management and natural language processing, but often finds time to speak at events related to Microsoft SQL Server where he loves to share his experience with the SQL Server platform.

Madina Saitakhmetova is a developer specializing in BI. She has been in IT for 15 years, working with Microsoft SQL, .NET, Microsoft BI, Azure, and building BI solutions for medical, educational, and engineering companies. Her adventure with Microsoft BI began with Analysis Services and SSIS, and in later years she has been building her expertise in ETL/ELT, both on-premises and in the cloud. Finding patterns, automating processes, and making BI teams work more efficiently are challenges that drive her. During the past few years, BIML has become an important part of her work, increasing its efficiency and quality.

作者簡介(中文翻譯)

Christian Cote 是一位擁有超過15年經驗的IT專業人士,專注於數據倉庫、大數據和商業智能項目。多年來,Christian 在數據倉庫和數據湖方面積累了豐富的專業知識,並使用多種工具在多個平台上設計了許多ETL/BI流程。他曾在多個會議和程式營地上發表演講,目前共同領導SQL Server PASS章節。他還是一位微軟數據平台最有價值專家(MVP)。

Matija Lah 擁有超過18年的Microsoft SQL Server經驗,主要從事法律領域數據中心解決方案的架構設計。他對SQL Server社區的貢獻使他在2007年至2017/2018年間獲得了微軟MVP獎(數據平台)。他大部分時間都在從事高級信息管理和自然語言處理相關的項目,但他經常會找時間參加與Microsoft SQL Server相關的活動,並樂於分享他在SQL Server平台上的經驗。

Madina Saitakhmetova 是一位專注於商業智能的開發人員。她在IT行業已有15年的經驗,使用Microsoft SQL、.NET、Microsoft BI和Azure等技術,為醫療、教育和工程公司構建商業智能解決方案。她的Microsoft BI之旅始於Analysis Services和SSIS,後來她在本地和雲端上建立了ETL/ELT方面的專業知識。發現模式、自動化流程和提高商業智能團隊效率是驅使她前進的挑戰。在過去幾年中,BIML已成為她工作的重要組成部分,提高了效率和質量。

目錄大綱

Table of Contents

  1. Getting Started with Azure and SSIS 2019
  2. Introducing ETL
  3. Creating and Using SQL Server 2019 Big Data Clusters
  4. Azure Data Integration
  5. Extending SSIS with Custom Tasks and Transformations
  6. Azure Data Factory
  7. Azure Databricks
  8. SSIS cloud Migration Strategies
  9. Profiling data in Azure
  10. Manage SSIS and Azure Data Factory with BIML

目錄大綱(中文翻譯)

目錄


  1. 開始使用 Azure 和 SSIS 2019

  2. ETL 簡介

  3. 建立和使用 SQL Server 2019 大數據叢集

  4. Azure 數據整合

  5. 使用自訂任務和轉換擴充 SSIS

  6. Azure 數據工廠

  7. Azure Databricks

  8. SSIS 雲端遷移策略

  9. 在 Azure 中進行數據分析

  10. 使用 BIML 管理 SSIS 和 Azure 數據工廠