Real-Time Streaming with Apache Kafka, Spark, and Storm: Create Platforms That Can Quickly Crunch Data and Deliver Real-Time Analytics to Users (Engli
暫譯: 使用 Apache Kafka、Spark 和 Storm 進行即時串流:建立能快速處理數據並向用戶提供即時分析的平台

Jeyaraman, Brindha Priyadarshini

  • 出版商: BPB Publications
  • 出版日期: 2021-08-21
  • 售價: $950
  • 貴賓價: 9.5$903
  • 語言: 英文
  • 頁數: 182
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9390684595
  • ISBN-13: 9789390684595
  • 相關分類: Spark
  • 海外代購書籍(需單獨結帳)

商品描述

Build a platform using Apache Kafka, Spark, and Storm to generate real-time data insights and view them through Dashboards.

Key Features

● Extensive practical demonstration of Apache Kafka concepts, including producer and consumer examples.

● Includes graphical examples and explanations of implementing Kafka Producer and Kafka Consumer commands and methods.

● Covers integration and implementation of Spark-Kafka and Kafka-Storm architectures.

Description

Real-Time Streaming with Apache Kafka, Spark, and Storm is a book that provides an overview of the real-time streaming concepts and architectures of Apache Kafka, Storm, and Spark. The readers will learn how to build systems that can process data streams in real time using these technologies. They will be able to process a large amount of real-time data and perform analytics or generate insights as a result of this.

The architecture of Kafka and its various components are described in detail. A Kafka Cluster installation and configuration will be demonstrated. The Kafka publisher-subscriber system will be implemented in the Eclipse IDE using the Command Line and Java. The book discusses the architecture of Apache Storm, the concepts of Spout and Bolt, as well as their applications in a Transaction Alert System. It also describes Spark's core concepts, applications, and the use of Spark to implement a microservice. To learn about the process of integrating Kafka and Storm, two approaches to Spark and Kafka integration will be discussed.

This book will assist a software engineer to transition to a Big Data engineer and Big Data architect by providing knowledge of big data processing and the architectures of Kafka, Storm, and Spark Streaming.

What you will learn

● Creation of Kafka producers, consumers, and brokers using command line.

● End-to-end implementation of Kafka messaging system with Java in Eclipse.

● Perform installation and creation of a Storm Cluster and execute Storm Management commands.

● Implement Spouts, Bolts and a Topology in Storm for Transaction alert application system.

Who this book is for

This book is intended for Software Developers, Data Scientists, and Big Data Architects who want to build software systems to process data streams in real time. To understand the concepts in this book, knowledge of any programming language such as Java, Python, etc. is needed.

Table of Contents

1. Introduction to Kafka

2. Installing Kafka

3. Kafka Messaging

4. Kafka Producers

5. Kafka Consumers

6. Introduction to Storm

7. Installation and Configuration

8. Spouts and Bolts

9. Introduction to Spark

10. Spark Streaming

11. Kafka Integration with Storm

12. Kafka Integration with Spark

About the Authors

Brindha Priyadarshini Jeyaraman has more than 12+ years of work experience in Software Development and building Data analytics systems. She has completed her M.Tech in Knowledge Engineering with a gold medal from the National University of Singapore. She is an expert in understanding business problems, designing, and implementing solutions using Machine Learning. She has a strong software development background with extensive experience in implementing data analytics systems. She has worked on several Data Science projects in Transportation, E-commerce, Healthcare, Insurance, Banking and Finance Domains. She has completed her SCJP and SCWCD certifications.

LinkedIn Profile: https: //www.linkedin.com/in/brindha-jeyaraman-75347922/

商品描述(中文翻譯)

建立一個使用 Apache Kafka、Spark 和 Storm 的平台,以生成即時數據洞察並通過儀表板查看這些洞察。

主要特點
・廣泛的 Apache Kafka 概念實踐示範,包括生產者和消費者範例。
・包含實現 Kafka Producer 和 Kafka Consumer 命令及方法的圖形範例和解釋。
・涵蓋 Spark-Kafka 和 Kafka-Storm 架構的整合與實施。

描述
《使用 Apache Kafka、Spark 和 Storm 進行即時串流》是一本提供 Apache Kafka、Storm 和 Spark 的即時串流概念和架構概述的書籍。讀者將學習如何使用這些技術構建能夠即時處理數據串流的系統。他們將能夠處理大量的即時數據,並因此進行分析或生成洞察。

Kafka 的架構及其各種組件將詳細描述。將演示 Kafka 集群的安裝和配置。Kafka 的發布-訂閱系統將在 Eclipse IDE 中使用命令行和 Java 實現。書中討論了 Apache Storm 的架構、Spout 和 Bolt 的概念,以及它們在交易警報系統中的應用。還描述了 Spark 的核心概念、應用以及使用 Spark 實現微服務。為了了解 Kafka 和 Storm 的整合過程,將討論兩種 Spark 和 Kafka 整合的方法。

這本書將幫助軟體工程師轉型為大數據工程師和大數據架構師,提供有關大數據處理及 Kafka、Storm 和 Spark Streaming 架構的知識。

你將學到的內容
・使用命令行創建 Kafka 生產者、消費者和代理。
・在 Eclipse 中使用 Java 完成 Kafka 消息系統的端到端實施。
・執行 Storm 集群的安裝和創建,並執行 Storm 管理命令。
・在 Storm 中實現 Spouts、Bolts 和拓撲以用於交易警報應用系統。

本書適合的人群
本書適合希望構建即時處理數據串流的軟體開發人員、數據科學家和大數據架構師。要理解本書中的概念,需要具備 Java、Python 等任何編程語言的知識。

目錄
1. Kafka 簡介
2. 安裝 Kafka
3. Kafka 消息
4. Kafka 生產者
5. Kafka 消費者
6. Storm 簡介
7. 安裝與配置
8. Spouts 和 Bolts
9. Spark 簡介
10. Spark Streaming
11. Kafka 與 Storm 的整合
12. Kafka 與 Spark 的整合

關於作者
Brindha Priyadarshini Jeyaraman 擁有超過 12 年的軟體開發和數據分析系統構建的工作經驗。她在新加坡國立大學獲得知識工程碩士學位並獲得金牌。她擅長理解商業問題,設計和實施使用機器學習的解決方案。她擁有強大的軟體開發背景,並在實施數據分析系統方面擁有豐富的經驗。她曾參與多個交通、電子商務、醫療保健、保險、銀行和金融領域的數據科學項目。她已完成 SCJP 和 SCWCD 認證。

LinkedIn 個人資料: https://www.linkedin.com/in/brindha-jeyaraman-75347922/