Generative AI on Kubernetes: Operationalizing Large Language Models
暫譯: 在 Kubernetes 上的生成式 AI:大型語言模型的運營化

Huß, Roland, Zonca, Daniele

  • 出版商: O'Reilly
  • 出版日期: 2026-04-07
  • 售價: $2,110
  • 貴賓價: 9.5$2,004
  • 語言: 英文
  • 頁數: 404
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1098171926
  • ISBN-13: 9781098171926
  • 相關分類: Large language modelKubernetes
  • 尚未上市,無法訂購

商品描述

Generative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to unlock AI innovation with the power of cloud native infrastructure. Authors Roland Hu and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.

With actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you're experimenting with large-scale language models or facing the nuances of AI deployment at scale, you'll uncover expertise you need to operationalize this exciting technology effectively.

  • Learn to run GenAI models on Kubernetes for efficient scalability
  • Get techniques to train and fine-tune LLMs within Kubernetes environments
  • See how to deploy production-ready AI systems with automation and resource optimization
  • Discover how to monitor and scale GenAI applications to handle real-world demand
  • Uncover the best tools to operationalize your GenAI workloads
  • Learn how to run agent-based and AI-driven applications

商品描述(中文翻譯)

生成式人工智慧正在革新各行各業,而 Kubernetes 已迅速成為部署和管理這些資源密集型工作負載的基礎。本書作為一本實用的操作指南,適合 MLOps 工程師、軟體開發人員、Kubernetes 管理員和準備利用雲原生基礎設施釋放 AI 創新的 AI 專業人士。作者 Roland Hu 和 Daniele Zonca 提供了一個清晰的路線圖,涵蓋在 Kubernetes 上訓練、微調、部署和擴展生成式 AI 模型的過程,同時解決資源優化、自動化和安全性等挑戰。

透過實用的見解和真實案例,讀者將學會如何應對在生產環境中管理生成式 AI 應用的機會與複雜性。無論您是在實驗大規模語言模型,還是面對大規模 AI 部署的細微差別,您都將發現有效運用這項令人興奮的技術所需的專業知識。

- 學習如何在 Kubernetes 上運行生成式 AI 模型以實現高效擴展
- 獲取在 Kubernetes 環境中訓練和微調大型語言模型(LLMs)的技術
- 了解如何以自動化和資源優化的方式部署生產就緒的 AI 系統
- 探索如何監控和擴展生成式 AI 應用以應對現實世界的需求
- 發現最佳工具以實現您的生成式 AI 工作負載的運營化
- 學習如何運行基於代理和 AI 驅動的應用程式