Delta Lake: The Definitive Guide: Modern Data Lakehouse Architectures with Data Lakes
暫譯: Delta Lake:權威指南:現代數據湖屋架構與數據湖
Lee, Denny, Wentling, Tristen, Haines, Scott
- 出版商: O'Reilly
- 出版日期: 2024-12-10
- 定價: $2,800
- 售價: 9.5 折 $2,660
- 貴賓價: 9.0 折 $2,520
- 語言: 英文
- 頁數: 380
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1098151941
- ISBN-13: 9781098151942
-
相關分類:
大數據 Big-data
立即出貨 (庫存 < 3)
買這商品的人也買了...
-
$454InfluxDB 原理與實戰 -
Kubeflow for Machine Learning: From Lab to Production$1,786$1,692 -
$500事件流實戰 -
MongoDB 技術手冊, 3/e (MongoDB: The Definitive Guide: Powerful and Scalable Data Storage, 3/e)$780$616 -
從 OS 等級探究:Redis 運作原理程式逐行講解$880$695 -
Advanced Python Programming : Accelerate your Python programs using proven techniques and design patterns, 2/e (Paperback)$1,800$1,710 -
Time Series Analysis with Python Cookbook: Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation (Paperback)$1,960$1,862 -
Cloud Finops: Collaborative, Real-Time Cloud Value Decision Making (Paperback)$2,641$2,502 -
$449跨數據中心機器學習:賦能多雲智能數算融合 -
使用 GitOps 實現 Kubernetes 的持續部署:模式、流程及工具$714$678 -
Elasticsearch 數據搜索與分析實戰$599$569 -
客戶留存數據分析與預測$768$730 -
Kafka 實戰$539$512 -
資料科學 SQL 工作術 – 以 MySQL 為例與情境式 ChatGPT 輔助學習 (SQL for Data Scientists - A Beginner’s Guide for Building Datasets for Analysis)$630$498 -
Learning Github Actions: Automation and Integration of CI/CD with Github (Paperback)$2,185$2,070 -
資料視覺化|使用 Python 與 JavaScript, 2/e (Data Visualization with Python and JavaScript: Scrape, Clean, Explore, and Transform Your Data, 2/e)$880$695 -
Practical Machine Learning on Databricks: Seamlessly transition ML models and MLOps on Databricks (Paperback)$1,750$1,663 -
資料治理技術手冊 (Data Governance: The Definitive Guide)$580$458 -
基於 GPT-3、ChatGPT、GPT-4 等 Transformer 架構的自然語言處理$599$569 -
數據湖倉$299$284 -
Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale (Paperback)$2,356$2,232 -
Apache Airflow Best Practices: A practical guide to orchestrating data workflow with Apache Airflow (Paperback)$1,700$1,615 -
CI/CD Design Patterns: Design and implement CI/CD using proven design patterns (Paperback)$1,650$1,568 -
$469GitHub Copilot 編程指南 -
本地端 Ollama × LangChain × LangGraph × LangSmith 開發手冊:打造 RAG、Agent、SQL 應用$750$593
相關主題
商品描述
Ready to simplify the process of building data lakehouses and data pipelines at scale? In this practical guide, learn how Delta Lake is helping data engineers, data scientists, and data analysts overcome key data reliability challenges with modern data engineering and management techniques.
Authors Denny Lee, Tristen Wentling, Scott Haines, and Prashanth Babu (with contributions from Delta Lake maintainer R. Tyler Croy) share expert insights on all things Delta Lake--including how to run batch and streaming jobs concurrently and accelerate the usability of your data. You'll also uncover how ACID transactions bring reliability to data lakehouses at scale.
This book helps you:
- Understand key data reliability challenges and how Delta Lake solves them
- Explain the critical role of Delta transaction logs as a single source of truth
- Learn the Delta Lake ecosystem with technologies like Apache Flink, Kafka, and Trino
- Architect data lakehouses with the medallion architecture
- Optimize Delta Lake performance with features like deletion vectors and liquid clustering
商品描述(中文翻譯)
準備好簡化大規模建構資料湖屋和資料管道的過程了嗎?在這本實用指南中,了解 Delta Lake 如何幫助資料工程師、資料科學家和資料分析師克服現代資料工程和管理技術中的關鍵資料可靠性挑戰。
作者 Denny Lee、Tristen Wentling、Scott Haines 和 Prashanth Babu(並有 Delta Lake 維護者 R. Tyler Croy 的貢獻)分享了有關 Delta Lake 的專家見解,包括如何同時運行批次和串流作業,並加速資料的可用性。您還將發現 ACID 交易如何在大規模資料湖屋中帶來可靠性。
這本書幫助您:
- 理解關鍵的資料可靠性挑戰以及 Delta Lake 如何解決這些挑戰
- 解釋 Delta 交易日誌作為單一真相來源的關鍵角色
- 學習 Delta Lake 生態系統,了解 Apache Flink、Kafka 和 Trino 等技術
- 使用獎牌架構設計資料湖屋
- 利用刪除向量和液態聚類等功能優化 Delta Lake 的性能