Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
暫譯: 實用數據科學:將數據湖轉化為商業資產的技術堆疊建設指南

Andreas François Vermeulen

  • 出版商: Apress
  • 出版日期: 2018-02-22
  • 售價: $2,450
  • 貴賓價: 9.5$2,328
  • 語言: 英文
  • 頁數: 805
  • 裝訂: Paperback
  • ISBN: 1484230531
  • ISBN-13: 9781484230534
  • 相關分類: Data Science
  • 海外代購書籍(需單獨結帳)

商品描述

Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets.

The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions.

What You'll Learn
  • Become fluent in the essential concepts and terminology of data science and data engineering 
  • Build and use a technology stack that meets industry criteria
  • Master the methods for retrieving actionable business knowledge
  • Coordinate the handling of polyglot data types in a data lake for repeatable results
Who This Book Is For

Data scientists and data engineers who are required to convert data from a data lake into actionable knowledge for their business, and students who aspire to be data scientists and data engineers

商品描述(中文翻譯)

學習如何建立數據科學技術堆疊,並使用可重複的方法進行良好的數據科學。您將學會如何將數據湖轉化為商業資產。

在《實用數據科學》中展示的數據科學技術堆疊是由業界普遍使用的組件構成。數據科學家 Andreas Vermeulen 詳細演示了如何構建和配置技術堆疊,以產生可重複的結果。他向您展示如何應用實用方法,從由多種數據類型和維度組成的數據湖中提取可行的商業知識。

您將學到的內容:
- 熟悉數據科學和數據工程的基本概念和術語
- 建立並使用符合業界標準的技術堆疊
- 掌握檢索可行商業知識的方法
- 協調在數據湖中處理多種數據類型以獲得可重複的結果

本書適合對象:
需要將數據湖中的數據轉化為可行知識的數據科學家和數據工程師,以及渴望成為數據科學家和數據工程師的學生。