Apache Spark 2.x Cookbook
            
暫譯: Apache Spark 2.x 食譜
        
        Rishi Yadav
- 出版商: Packt Publishing
- 出版日期: 2017-05-31
- 定價: $1,650
- 售價: 8.0 折 $1,320
- 語言: 英文
- 頁數: 294
- 裝訂: Paperback
- ISBN: 1787127265
- ISBN-13: 9781787127265
- 
    相關分類:
    
      Spark
 
立即出貨 (庫存=1)
買這商品的人也買了...
- 
                
                   Machine Learning With R Cookbook - 110 Recipes for Building Powerful Predictive Models with R (Paperback) Machine Learning With R Cookbook - 110 Recipes for Building Powerful Predictive Models with R (Paperback)$1,900$1,805
- 
                
                   Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark (Paperback) Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark (Paperback)$1,730$1,644
- 
                
                   Python Deep Learning (Paperback) Python Deep Learning (Paperback)$2,120$2,014
- 
                
                   TensorFlow + Keras 深度學習人工智慧實務應用 TensorFlow + Keras 深度學習人工智慧實務應用$590$460
- 
                
                   $2,144Deep Learning: Practical Neural Networks with Java $2,144Deep Learning: Practical Neural Networks with Java
商品描述
Key Features
- Contains recipes on solving real-time data-processing problems with Apache Spark
- Utilize core Spark modules such as Spark SQL, Spark MLlib, Spark Streaming, and GraphX processing
- A practical guide to help you master Apache Spark as your single big data computing platform
Book Description
While Apache Spark 1.x gained lot of traction and adoption in the early years, Spark 2.0 delivers very notable improvements in the areas of API, Performance, Structured Streaming, and simplifying building blocks to build better, faster, smarter, and accessible big data applications. This book uncovers all these features in the form of structured recipes to analyze and mature large and complex sets of data.
Starting with installing and configuring Apache Spark with various cluster managers, you will learn to set up development environments. Furthermore, you will be introduced to working with RDD's, Data Frames to operate on data with schemas, and real-time streaming with various sources such as Twitter Stream and Apache Kafka. You will also work through recipes on machine learning, including supervised learning, unsupervised learning, recommendation engines, deep learning algorithms, and GPU implementations on Spark.
Last but not the least, the final few chapters will help you delve more deeply into the concepts of graph processing using GraphX, securing your implementations, cluster optimization, and troubleshooting.
What you will learn
- Install and configure Apache Spark with various cluster managers
- Set up a development environment for Apache Spark
- Learn to operate on data in Spark with schemas
- Get to grips with real-time streaming analytics using Spark Streaming
- Master supervised learning and unsupervised learning using MLlib
- Build a recommendation engine using MLlib
- Use Tensorframes to manipulate Spark's DataFrames with TensorFlow programs for deep learning
- Develop a set of common applications or project types, and solutions that solve complex big data problems
商品描述(中文翻譯)
**主要特點**
- 包含使用 Apache Spark 解決即時數據處理問題的範例
- 利用核心 Spark 模組,如 Spark SQL、Spark MLlib、Spark Streaming 和 GraphX 處理
- 實用指南,幫助您掌握 Apache Spark 作為單一的大數據計算平台
**書籍描述**
雖然 Apache Spark 1.x 在早期獲得了大量的關注和採用,但 Spark 2.0 在 API、性能、結構化流處理以及簡化構建模塊方面提供了顯著的改進,以便構建更好、更快、更智能且可訪問的大數據應用程式。本書以結構化的範例形式揭示了所有這些特性,以分析和成熟大型且複雜的數據集。
從安裝和配置 Apache Spark 及各種叢集管理器開始,您將學習如何設置開發環境。此外,您將接觸到使用 RDD 和 Data Frames 操作具有結構的數據,以及使用 Twitter Stream 和 Apache Kafka 等各種來源進行即時流處理。您還將學習機器學習的範例,包括監督式學習、非監督式學習、推薦引擎、深度學習算法以及在 Spark 上的 GPU 實現。
最後幾章將幫助您更深入地探討使用 GraphX 的圖形處理概念、保護您的實現、叢集優化和故障排除。
**您將學到的內容**
- 安裝和配置 Apache Spark 及各種叢集管理器
- 為 Apache Spark 設置開發環境
- 學習如何在 Spark 中操作具有結構的數據
- 熟悉使用 Spark Streaming 進行即時流分析
- 精通使用 MLlib 的監督式學習和非監督式學習
- 使用 MLlib 構建推薦引擎
- 使用 Tensorframes 操作 Spark 的 DataFrames,並結合 TensorFlow 程式進行深度學習
- 開發一組常見應用程式或專案類型,以及解決複雜大數據問題的解決方案























































 
     
     
     
     
     
     
     
     
     
     
    
 
    
 
     
     
     
     
     
     
     
     
     
     
     
     
     
     
     
     
    