Data Simplification: Taming Information With Open Source Tools

Jules J Berman

  • 出版商: Morgan Kaufmann
  • 出版日期: 2016-03-16
  • 售價: $2,100
  • 貴賓價: 9.5$1,995
  • 語言: 英文
  • 頁數: 398
  • 裝訂: Paperback
  • ISBN: 0128037814
  • ISBN-13: 9780128037812
  • 相關分類: Data Science
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Data Simplification: Taming Information With Open Source Tools addresses the simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools.

This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data.

Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user.

  • Discusses data simplification principles, methods, and tools that must be studied and mastered
  • Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data
  • Explains how to best utilize indexes to search, retrieve, and analyze textual data
  • Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods

商品描述(中文翻譯)

《資料簡化:利用開源工具掌握資訊》這本書討論了一個簡單的事實,即現代資料過於龐大和複雜,無法以其原始形式進行分析。資料簡化是將大型和複雜的資料轉換為可用的過程。在進行分析之前,必須對複雜的資料進行簡化,但資料簡化的過程絕非簡單,需要一套專業的技能和工具。

本書為來自各個科學領域的資料科學家提供了簡化資料的方法和工具,以便進行即時分析或長期存儲,並能夠輕鬆地重新運用或與其他資料整合。

作者Jules Berman根據多年的實踐經驗,並使用了眾多的例子和應用案例,討論了必須學習和掌握的資料簡化原則、方法和工具,開源工具、免費工具和程式碼片段可供重複使用和重新運用以簡化資料,自然語言處理和機器翻譯作為簡化資料的工具,以及資料摘要和可視化在使資料對最終用戶有用方面的作用。

本書還討論了必須學習和掌握的資料簡化原則、方法和工具,提供了開源工具、免費工具和程式碼片段,可供重複使用和重新運用以簡化資料,解釋了如何最佳利用索引來搜索、檢索和分析文本資料,並向資料科學家展示如何應用本體論、分類、類別、屬性和實例等傳統方法將資料應用於資料中。