Data Quality Fundamentals: A Practitioner's Guide to Building Trustworthy Data Pipelines (Paperback)
暫譯: 數據質量基礎:建立可信數據管道的實務指南 (平裝本)
Moses, Barr, Gavish, Lior, Vorwerck, Molly
- 出版商: O'Reilly
- 出版日期: 2022-10-11
- 定價: $2,290
- 售價: 9.5 折 $2,176
- 貴賓價: 9.0 折 $2,061
- 語言: 英文
- 頁數: 308
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1098112040
- ISBN-13: 9781098112042
-
相關分類:
Data-visualization
-
相關翻譯:
資料品質管理:資料可靠性與資料品質問題解決之道 (簡中版)
立即出貨
買這商品的人也買了...
-
無瑕的程式碼-敏捷軟體開發技巧守則 (Clean Code: A Handbook of Agile Software Craftsmanship)$580$452 -
無瑕的程式碼 番外篇-專業程式設計師的生存之道 (The Clean Coder: A Code of Conduct for Professional Programmers)
$360$281 -
Specification by Example 中文版:團隊如何交付正確的軟體 (Specification by Example: How Successful Teams Deliver the Right Software)$420$357 -
The Hundred-Page Machine Learning Book (paperback)$1,520$1,490 -
$2,124Architecture Patterns with Python: Enabling Test-Driven Development, Domain-Driven Design, and Event-Driven Microservices -
$1,935Fighting Churn with Data: The Science and Strategy of Customer Retention (Paperback) -
Learn Kubernetes in a Month of Lunches$2,150$2,043 -
大規模重構|奪回源碼庫的控制權 (Refactoring at Scale: Regaining Control of Your Codebase)$580$458 -
行銷 5.0:科技與人性完美融合時代的全方位戰略,運用 MarTech,設計顧客旅程,開啟數位消費新商機$450$383 -
GA 到 GA4: 掌握網站數據分析新工具的技術原理與商業思維(書況差限門市銷售)$500$350 -
Notion 人生管理術:從0開始,打造專屬自己的 All in One 高效數位系統$330$281 -
ACCELERATE:精益軟體與 DevOps 背後的科學 (Accelerate: The Science of Lean Software and DevOps: Building and Scaling High Performing Technology Organizations)$499$394 -
Introduction to Algorithms, 4/e (Hardcover)$2,190$2,146 -
億萬社長高獲利經營術:電商老闆賣愈少、賺愈多,還能活過零營收的祕密$380$323 -
發黑的香蕉怎麼賣?:從「不需要」變「好想要」!看見、讀完立刻買單的文字技巧$460$391 -
深入淺出 Swift 程式設計 (Head First Swift)$780$616 -
Google 的軟體工程之道|從程式設計經驗中吸取教訓 (Software Engineering at Google)$880$695 -
$2,520Building Evolutionary Architectures: Automated Software Governance, 2/e -
Aws Certified Solutions Architect Study Guide with Online Labs: Associate Saa-C03 Exam$4,750$4,513 -
ClickHouse 性能之巔從架構設計解讀性能之謎$534$507 -
領域驅動設計學習手冊 (Learning Domain-Driven Design)$580$458 -
Notion 最強效應用:卡片盒筆記法 × GTD 時間管理 × 電子手帳 × 數位履歷 × Notion AI$499$394 -
Communication Patterns: A Guide for Developers and Architects (Paperback)$2,052$1,944 -
Notion 全方位管理術:任務管理 × 收支記帳 × 知識筆記 × ChatGPT × Notion AI(iThome鐵人賽系列書)【軟精裝】$650$507 -
Databricks Certified Data Engineer Associate Study Guide: In-Depth Guidance and Practice (Paperback)$2,518$2,385
相關主題
商品描述
Do your product dashboards look funky? Are your quarterly reports stale? Is the dataset you're using broken or just plain wrong? These problems affect almost every team, yet they're usually addressed on an ad hoc basis and in a reactive manner. If you answered yes to any of the questions above, this book is for you.
Many data engineering teams today face the "good pipelines, bad data" problem. It doesn't matter how advanced your data infrastructure is if the data you're piping is bad. In this book, Barr Moses, Lior Gavish, and Molly Vorwerck from the data reliability company Monte Carlo explain how to tackle data quality and trust at scale by leveraging best practices and technologies used by some of the world's most innovative companies.
- Build more trustworthy and reliable data pipelines
- Write scripts to make data checks and identify broken pipelines with data observability
- Program your own data quality monitors from scratch
- Develop and lead data quality initiatives at your company
- Generate a dashboard to highlight your company's key data assets
- Automate data lineage graphs across your data ecosystem
- Build anomaly detectors for your critical data assets
商品描述(中文翻譯)
您的產品儀表板看起來奇怪嗎?您的季度報告過時了嗎?您使用的數據集是壞的還是完全錯誤的?這些問題影響幾乎每個團隊,但通常是以臨時的方式和反應性的方式來解決。如果您對上述任何問題回答是,那麼這本書就是為您而寫的。
如今,許多數據工程團隊面臨著「良好的管道,壞的數據」問題。無論您的數據基礎設施多麼先進,如果您傳輸的數據是壞的,那都沒有意義。在這本書中,來自數據可靠性公司 Monte Carlo 的 Barr Moses、Lior Gavish 和 Molly Vorwerck 解釋了如何利用一些世界上最具創新性的公司的最佳實踐和技術來解決大規模的數據質量和信任問題。
- 建立更值得信賴和可靠的數據管道
- 編寫腳本以進行數據檢查並識別壞的管道,實現數據可觀察性
- 從零開始編寫自己的數據質量監控器
- 在您的公司開發和主導數據質量計劃
- 生成儀表板以突出您公司的關鍵數據資產
- 自動化整個數據生態系統中的數據血緣圖
- 為您的關鍵數據資產建立異常檢測器