Building Python Real-Time Applications with Storm

Kartik Bhatnagar, Barry Hart

  • 出版商: Packt Publishing
  • 出版日期: 2015-11-30
  • 售價: $1,340
  • 貴賓價: 9.5$1,273
  • 語言: 英文
  • 頁數: 122
  • 裝訂: Paperback
  • ISBN: 1784392855
  • ISBN-13: 9781784392857
  • 相關分類: Python程式語言Storm
  • 下單後立即進貨 (約3~4週)

商品描述

Learn to process massive real-time data streams using Storm and Python―no Java required!

About This Book

  • Learn to use Apache Storm and the Python Petrel library to build distributed applications that process large streams of data
  • Explore sample applications in real-time and analyze them in the popular NoSQL databases MongoDB and Redis
  • Discover how to apply software development best practices to improve performance, productivity, and quality in your Storm projects

Who This Book Is For

This book is intended for Python developers who want to benefit from Storm's real-time data processing capabilities. If you are new to Python, you'll benefit from the attention to key supporting tools and techniques such as automated testing, virtual environments, and logging. If you're an experienced Python developer, you'll appreciate the thorough and detailed examples

What You Will Learn

  • Install Storm and learn about the prerequisites
  • Get to know the components of a Storm topology and how to control the flow of data between them
  • Ingest Twitter data directly into Storm
  • Use Storm with MongoDB and Redis
  • Build topologies and run them in Storm
  • Use an interactive graphical debugger to debug your topology as it's running in Storm
  • Test your topology components outside of Storm
  • Configure your topology using YAML

In Detail

Big data is a trending concept that everyone wants to learn about. With its ability to process all kinds of data in real time, Storm is an important addition to your big data “bag of tricks.”

At the same time, Python is one of the fastest-growing programming languages today. It has become a top choice for both data science and everyday application development. Together, Storm and Python enable you to build and deploy real-time big data applications quickly and easily.

You will begin with some basic command tutorials to set up storm and learn about its configurations in detail. You will then go through the requirement scenarios to create a Storm cluster. Next, you'll be provided with an overview of Petrel, followed by an example of Twitter topology and persistence using Redis and MongoDB. Finally, you will build a production-quality Storm topology using development best practices.

Style and approach

This book takes an easy-to-follow and a practical approach to help you understand all the concepts related to Storm and Python.

商品描述(中文翻譯)

學習使用Storm和Python處理大規模的即時數據流,無需Java!

關於本書
- 學習使用Apache Storm和Python Petrel庫構建處理大量數據流的分佈式應用程序
- 在流行的NoSQL數據庫MongoDB和Redis中探索實時應用程序並進行分析
- 發現如何應用軟件開發最佳實踐,提高Storm項目的性能、生產力和質量

本書適合對Storm的實時數據處理能力感興趣的Python開發人員。如果你是Python新手,你將受益於對關鍵支持工具和技術(如自動化測試、虛擬環境和日誌記錄)的關注。如果你是有經驗的Python開發人員,你將欣賞到詳細的例子。

你將學到什麼
- 安裝Storm並了解先決條件
- 了解Storm拓撲結構的組件以及如何控制它們之間的數據流
- 將Twitter數據直接輸入Storm
- 使用Storm與MongoDB和Redis
- 構建拓撲結構並在Storm中運行
- 使用交互式圖形調試器在Storm中運行時調試拓撲結構
- 在Storm之外測試拓撲結構組件
- 使用YAML配置拓撲結構

詳細內容
大數據是一個熱門的概念,每個人都想學習。Storm能夠實時處理各種數據,是你大數據工具包中的重要組成部分。

同時,Python是當今增長最快的編程語言之一。它已成為數據科學和日常應用程序開發的首選。Storm和Python一起,讓你能夠快速、輕鬆地構建和部署實時大數據應用程序。

你將從一些基本的命令教程開始,設置Storm並詳細了解其配置。然後,你將通過需求場景來創建一個Storm集群。接下來,你將獲得Petrel的概述,並通過Twitter拓撲結構和使用Redis和MongoDB進行持久化的示例。最後,你將使用開發最佳實踐構建一個生產質量的Storm拓撲結構。

風格和方法
本書採用易於理解和實用的方法,幫助你理解Storm和Python相關概念。