Hands-On Data Science with the Command Line: Automate everyday data science tasks using command-line tools
暫譯: 命令列數據科學實務:使用命令列工具自動化日常數據科學任務
Jason Morris, Chris McCubbin, Raymond Page
- 出版商: Packt Publishing
- 出版日期: 2019-01-31
- 售價: $1,460
- 貴賓價: 9.5 折 $1,387
- 語言: 英文
- 頁數: 124
- 裝訂: Paperback
- ISBN: 1789132983
- ISBN-13: 9781789132984
-
相關分類:
Command Line、Data Science
海外代購書籍(需單獨結帳)
相關主題
商品描述
Big data processing and analytics at speed and scale using command line tools.
Key Features
- Perform string processing, numerical computations, and more using CLI tools
- Understand the essential components of data science development workflow
- Automate data pipeline scripts and visualization with the command line
Book Description
The Command Line has been in existence on UNIX-based OSes in the form of Bash shell for over 3 decades. However, very little is known to developers as to how command-line tools can be OSEMN (pronounced as awesome and standing for Obtaining, Scrubbing, Exploring, Modeling, and iNterpreting data) for carrying out simple-to-advanced data science tasks at speed.
This book will start with the requisite concepts and installation steps for carrying out data science tasks using the command line. You will learn to create a data pipeline to solve the problem of working with small-to medium-sized files on a single machine. You will understand the power of the command line, learn how to edit files using a text-based and an. You will not only learn how to automate jobs and scripts, but also learn how to visualize data using the command line.
By the end of this book, you will learn how to speed up the process and perform automated tasks using command-line tools.
What you will learn
- Understand how to set up the command line for data science
- Use AWK programming language commands to search quickly in large datasets.
- Work with files and APIs using the command line
- Share and collect data with CLI tools
- Perform visualization with commands and functions
- Uncover machine-level programming practices with a modern approach to data science
Who this book is for
This book is for data scientists and data analysts with little to no knowledge of the command line but has an understanding of data science. Perform everyday data science tasks using the power of command line tools.
Table of Contents
- Data Science at the Command line and Setting it up
- Essential Commands
- Obtaining and Working with Data,Detached Processing and Terminal Multiplexers
- Bash Functions and Data Visualization
- Loops, Functions and String Processing
- The Command Line as a Database, Math in Bash, and Bringing It All Together
商品描述(中文翻譯)
**快速且大規模的數據處理與分析,使用命令行工具。**
#### 主要特點
- 使用 CLI 工具進行字串處理、數值計算等
- 理解數據科學開發工作流程的基本組成部分
- 使用命令行自動化數據管道腳本和可視化
#### 書籍描述
命令行在基於 UNIX 的作業系統中以 Bash shell 的形式存在已超過三十年。然而,開發者對於命令行工具如何能夠成為 OSEMN(發音為「awesome」,代表獲取、清理、探索、建模和解釋數據)以快速執行簡單到高級的數據科學任務知之甚少。
本書將從執行數據科學任務所需的概念和安裝步驟開始。您將學會創建數據管道,以解決在單一機器上處理小型到中型文件的問題。您將理解命令行的強大,學習如何使用文本編輯器編輯文件。您不僅會學習如何自動化工作和腳本,還會學習如何使用命令行可視化數據。
在本書結束時,您將學會如何加速過程並使用命令行工具執行自動化任務。
#### 您將學到的內容
- 理解如何為數據科學設置命令行
- 使用 AWK 程式語言命令快速搜尋大型數據集
- 使用命令行處理文件和 API
- 使用 CLI 工具分享和收集數據
- 使用命令和函數進行可視化
- 以現代數據科學方法揭示機器級編程實踐
#### 本書適合誰
本書適合對命令行知識幾乎為零但對數據科學有一定理解的數據科學家和數據分析師。利用命令行工具的力量執行日常數據科學任務。
#### 目錄
1. 命令行中的數據科學及其設置
2. 基本命令
3. 獲取和處理數據、分離處理和終端多路復用器
4. Bash 函數和數據可視化
5. 循環、函數和字串處理
6. 將命令行作為數據庫、Bash 中的數學,以及整合所有內容