Data Engineering with Alteryx: Helping data engineers apply DataOps practices with Alteryx

Houghton, Paul

  • 出版商: Packt Publishing
  • 出版日期: 2022-06-30
  • 售價: $1,800
  • 貴賓價: 9.5$1,710
  • 語言: 英文
  • 頁數: 366
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1803236485
  • ISBN-13: 9781803236483
  • 下單後立即進貨 (約3~4週)

商品描述

Build and deploy data pipelines with Alteryx by applying practical DataOps principles

Key Features

• Learn DataOps principles to build data pipelines with Alteryx
• Build robust data pipelines with Alteryx Designer
• Use Alteryx Server and Alteryx Connect to share and deploy your data pipelines

Book Description

Alteryx is a GUI-based development platform for data analytic applications.

Data Engineering with Alteryx will help you leverage Alteryx's code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have.

This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You'll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you'll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process.

By the end of this Alteryx book, you'll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.

What you will learn

• Build a working pipeline to integrate an external data source
• Develop monitoring processes for the pipeline example
• Understand and apply DataOps principles to an Alteryx data pipeline
• Gain skills for data engineering with the Alteryx software stack
• Work with spatial analytics and machine learning techniques in an Alteryx workflow Explore Alteryx workflow deployment strategies using metadata validation and continuous integration
• Organize content on Alteryx Server and secure user access

Who this book is for

If you're a data engineer, data scientist, or data analyst who wants to set up a reliable process for developing data pipelines using Alteryx, this book is for you. You'll also find this book useful if you are trying to make the development and deployment of datasets more robust by following the DataOps principles. Familiarity with Alteryx products will be helpful but is not necessary.

商品描述(中文翻譯)

使用Alteryx建立和部署數據管道,並應用實用的DataOps原則。

主要特點:
- 學習使用Alteryx建立數據管道的DataOps原則
- 使用Alteryx Designer建立強大的數據管道
- 使用Alteryx Server和Alteryx Connect共享和部署數據管道

書籍描述:
Alteryx是一個基於GUI的數據分析應用開發平台。

《使用Alteryx進行數據工程》將幫助您充分利用Alteryx的無代碼特性,提高開發速度,同時仍能充分發揮您的代碼技能。

本書將教您DataOps的原則以及如何在Alteryx軟件堆棧中應用這些原則。您將使用Alteryx Designer建立數據管道,並加入錯誤處理和數據驗證,以確保數據集的可靠性。接下來,您將從原始數據中提取數據管道,將其轉換為強大的數據集,並按照持續集成流程發布到Alteryx Server。

通過閱讀本書,您將能夠建立驗證數據集、監控工作流程性能、管理訪問權限並推廣數據源使用的系統。

您將學到:
- 建立一個工作管道,以整合外部數據源
- 開發管道示例的監控流程
- 理解並應用DataOps原則到Alteryx數據管道
- 獲得使用Alteryx軟件堆棧進行數據工程的技能
- 在Alteryx工作流程中使用空間分析和機器學習技術
- 使用元數據驗證和持續集成探索Alteryx工作流程部署策略
- 在Alteryx Server上組織內容並保護用戶訪問權限

本書適合對使用Alteryx建立可靠的數據管道的數據工程師、數據科學家或數據分析師。如果您希望按照DataOps原則使數據集的開發和部署更加強大,本書也對您有所幫助。熟悉Alteryx產品將有所幫助,但不是必需的。

目錄大綱

1. Getting Started with Alteryx
2. Data Engineering with Alteryx
3. DataOps and Its Benefits
4. Sourcing the Data
5. Data Processing and Transformations
6. Destination Management
7. Extracting Value
8. Beginning Advanced Analytics
9. Testing Workflows and Outputs
10. Monitoring DataOps and Managing Changes
11. Securing and Managing Access
12. Making Data Easy to Use and Discoverable with Alteryx
13. Conclusion

目錄大綱(中文翻譯)

1. Alteryx 入門指南
2. 使用 Alteryx 進行資料工程
3. DataOps 及其優勢
4. 資料來源
5. 資料處理和轉換
6. 目的地管理
7. 提取價值
8. 開始進階分析
9. 測試工作流程和輸出
10. 監控 DataOps 和管理變更
11. 保護和管理存取權限
12. 使用 Alteryx 輕鬆使用和發現資料
13. 結論