Kafka Troubleshooting in Production: Stabilizing Kafka Clusters in the Cloud and On-Premises (Paperback)
暫譯: 生產環境中的 Kafka 故障排除:穩定雲端及本地的 Kafka 集群 (平裝本)
Eldor, Elad
- 出版商: Apress
- 出版日期: 2023-11-30
- 售價: $1,550
- 貴賓價: 9.5 折 $1,473
- 語言: 英文
- 頁數: 216
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1484294890
- ISBN-13: 9781484294895
-
相關分類:
Message Queue
海外代購書籍(需單獨結帳)
買這商品的人也買了...
-
無瑕的程式碼-敏捷完整篇-物件導向原則、設計模式與 C# 實踐 (Agile principles, patterns, and practices in C#)$790$616 -
Working Effectively with Legacy Code : 管理、修改、重構遺留程式碼的藝術 (中文版)$720$562 -
$477Rust 權威指南 (The Rust Programming Language (Covers Rust 2018)) -
再強一點:用 Go語言完成六個大型專案$780$616 -
$1,400Network Programming with Go: Learn to Code Secure and Reliable Network Services from Scratch -
黑帽 Python|給駭客與滲透測試者的 Python 開發指南, 2/e (Black Hat Python : Python Programming for Hackers and Pentesters, 2/e)$450$356 -
Spring REST API 開發與測試指南|使用 Swagger、HATEOAS、JUnit、Mockito、PowerMock、Spring Test$580$493 -
Linux 網路內功修煉 - 徹底了解底層原理及高性能架構$780$663 -
演算法生存指南(書況差限門市銷售)$800$632 -
從 Hooks 開始,讓你的網頁 React 起來 (第二版)(iT邦幫忙鐵人賽系列書)$720$562 -
哎呀!不小心刻了一套 React UI 元件庫 : 從無到有輕鬆上手(iThome鐵人賽系列書)$650$507 -
The Rust Programming Language, 2/e (Paperback)$1,800$1,710 -
哎呀!原來 React 這麼有趣好玩:圈叉、貪吃蛇、記憶方塊三款經典遊戲實戰練習(iThome鐵人賽系列書)$620$484 -
Smaller C|用於小型機器之精實程式碼 (Smaller C: Lean Code for Small Machines)$680$537 -
白話機器學習$780$616 -
React 思維進化:一次打破常見的觀念誤解,躍升專業前端開發者(iThome鐵人賽系列書)【軟精裝】$790$616 -
Python 風格徹底研究|超詳實、好理解的 Python 必學主題 (Dead Simple Python)$980$774 -
遞迴演算法大師親授面試心法:Python 與 JavaScript 解題全攻略 (The Recursive Book of Recursion)$680$530 -
建構機器學習系統實踐指南$620$490 -
機器學習的訓練資料 (Training Data for Machine Learning)$780$616 -
資料工程基礎|規劃和建構強大、穩健的資料系統 (Fundamentals of Data Engineering)$980$774 -
讓 AI 好好說話!從頭打造 LLM (大型語言模型) 實戰秘笈$680$537 -
日式 RPG 編年史:從 DQ 到 FF,角色扮演遊戲敘事手法完全剖析$380$300 -
軟體工程師的英語使用守則:English for Developers$420$357 -
內行人才知道的系統設計面試指南 第二輯 (System Design Interview – An Insider's Guide: Volume 2)$820$648
商品描述
This book provides Kafka administrators, site reliability engineers, and DataOps and DevOps practitioners with a list of real production issues that can occur in Kafka clusters and how to solve them. The production issues covered are assembled into a comprehensive troubleshooting guide for those engineers who are responsible for the stability and performance of Kafka clusters in production, whether those clusters are deployed in the cloud or on-premises. This book teaches you how to detect and troubleshoot the issues, and eventually how to prevent them.
Kafka stability is hard to achieve, especially in high throughput environments, and the purpose of this book is not only to make troubleshooting easier, but also to prevent production issues from occurring in the first place. The guidance in this book is drawn from the author's years of experience in helping clients and internal customers diagnose and resolve knotty production problems and stabilize their Kafka environments. The book is organized into recipe-style troubleshooting checklists that field engineers can easily follow when under pressure to fix an unstable cluster. This is the book you will want by your side when the stakes are high, and your job is on the line.
What You Will Learn
- Monitor and resolve production issues in your Kafka clusters
- Provision Kafka clusters with the lowest costs and still handle the required loads
- Perform root cause analyses of issues affecting your Kafka clusters
- Know the ways in which your Kafka cluster can affect its consumers and producers
- Prevent or minimize data loss and delays in data streaming
- Forestall production issues through an understanding of common failure points
- Create checklists for troubleshooting your Kafka clusters when problems occur
Who This Book Is For
Site reliability engineers tasked with maintaining stability of Kafka clusters, Kafka administrators who troubleshoot production issues around Kafka, DevOps and DataOps experts who are involved with provisioning Kafka (whether on-premises or in the cloud), developers of Kafka consumers and producers who wish to learn more about Kafka
商品描述(中文翻譯)
這本書為 Kafka 管理員、網站可靠性工程師以及 DataOps 和 DevOps 實踐者提供了一份可能在 Kafka 集群中發生的實際生產問題清單及其解決方案。所涵蓋的生產問題被整理成一個全面的故障排除指南,供那些負責生產環境中 Kafka 集群穩定性和性能的工程師使用,無論這些集群是部署在雲端還是本地。這本書教你如何檢測和排除問題,最終如何防止它們的發生。
Kafka 的穩定性難以實現,尤其是在高吞吐量的環境中,這本書的目的不僅是讓故障排除變得更容易,還是為了防止生產問題的發生。書中的指導來自於作者多年來幫助客戶和內部用戶診斷和解決棘手的生產問題以及穩定其 Kafka 環境的經驗。這本書以食譜式的故障排除檢查清單組織,現場工程師在面對不穩定集群的壓力下可以輕鬆遵循。當風險高、工作岌岌可危時,這本書將是你身邊必備的參考資料。
你將學到什麼
- 監控並解決你的 Kafka 集群中的生產問題
- 以最低成本配置 Kafka 集群,同時處理所需的負載
- 對影響你的 Kafka 集群的問題進行根本原因分析
- 了解你的 Kafka 集群如何影響其消費者和生產者
- 防止或最小化數據丟失和數據流延遲
- 通過了解常見故障點來預防生產問題
- 在問題發生時為你的 Kafka 集群創建故障排除檢查清單
本書適合誰閱讀
負責維護 Kafka 集群穩定性的網站可靠性工程師、針對 Kafka 生產問題進行故障排除的 Kafka 管理員、參與 Kafka 配置的 DevOps 和 DataOps 專家(無論是在本地還是雲端),希望深入了解 Kafka 的 Kafka 消費者和生產者開發者
作者簡介
Elad Eldor is a DataOps team leader in the Grow division of Unity (formerly ironSource), working on handling stability issues, improving performance, and reducing the cost of high-scale Kafka, Druid, Presto, and Spark clusters on AWS. He has 12 years of experience as a backend software engineer and six years handling DataOps of big data Linux-based clusters.
Prior to working at Unity, Elad was a Site Reliability Engineer (SRE) at Cognyte, where he developed big data applications and handled the reliability and scalability of Spark and Kafka clusters in production. His main interests are performance tuning and cost reduction of big data clusters.
作者簡介(中文翻譯)
Elad Eldor 是 Unity(前身為 ironSource)Grow 部門的 DataOps 團隊負責人,專注於處理穩定性問題、改善性能以及降低在 AWS 上運行的大規模 Kafka、Druid、Presto 和 Spark 集群的成本。他擁有 12 年的後端軟體工程師經驗,以及 6 年處理基於 Linux 的大數據集群的 DataOps 經驗。
在加入 Unity 之前,Elad 曾在 Cognyte 擔任網站可靠性工程師(SRE),負責開發大數據應用程式並處理生產環境中 Spark 和 Kafka 集群的可靠性和可擴展性。他的主要興趣是大數據集群的性能調優和成本降低。