Accelerating Deep Neural Networks
暫譯: 加速深度神經網絡

Sato, Ryoma

  • 出版商: Cambridge
  • 出版日期: 2026-06-04
  • 售價: $2,120
  • 貴賓價: 9.8$2,077
  • 語言: 英文
  • 頁數: 310
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 1009687085
  • ISBN-13: 9781009687089
  • 相關分類: DeepLearning
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Deep learning models are powerful, but are often large, slow, and expensive to run. This book is a practical guide to accelerating and compressing neural networks using proven techniques such as quantization, pruning, distillation, and fast architectures. It explains how and why these methods work, fostering a comprehensive understanding. Written for engineers, researchers, and advanced students, the book combines clear theoretical insights with hands-on PyTorch implementations and numerical results. Readers will learn how to reduce inference time and memory usage, lower deployment costs, and select the right acceleration strategy for their task. Whether you're working with large language models, vision systems, or edge devices, this book gives you the tools and intuition needed to build faster, leaner AI systems, without sacrificing performance. It is perfect for anyone who wants to go beyond intuition and take a principled approach to optimizing AI systems

商品描述(中文翻譯)

深度學習模型功能強大,但通常體積龐大、運行緩慢且成本高昂。本書是一本實用指南,介紹如何使用經過驗證的技術來加速和壓縮神經網絡,例如量化(quantization)、剪枝(pruning)、蒸餾(distillation)和快速架構(fast architectures)。本書解釋了這些方法的運作原理及其背後的原因,促進全面的理解。書中針對工程師、研究人員和高級學生,結合了清晰的理論見解與實作的 PyTorch 實現及數值結果。讀者將學會如何減少推理時間和內存使用、降低部署成本,並為其任務選擇合適的加速策略。無論您是在處理大型語言模型、視覺系統還是邊緣設備,本書都提供了構建更快、更精簡的 AI 系統所需的工具和直覺,而不會犧牲性能。這本書非常適合希望超越直覺,並採取原則性方法來優化 AI 系統的任何人。