商品描述
      Apache Spark is a popular open-source big-data processing framework that's built around speed, ease of use, and unified distributed computing architecture. Not only it supports developing applications in different languages like Java, Scala, Python, and R, it's also hundred times faster in memory and ten times faster even when running on disk compared to traditional data processing frameworks. Whether you are currently working on a big data project or interested in learning more about topics like machine learning, streaming data processing, and graph data analytics, this book is for you. You can learn about Apache Spark and develop Spark programs for various use cases in big data analytics using the code examples provided. This book covers all the libraries in Spark ecosystem: Spark Core, Spark SQL, Spark Streaming, Spark ML, and Spark GraphX.
    
      商品描述(中文翻譯)
Apache Spark 是一個受歡迎的開源大數據處理框架,專注於速度、易用性和統一的分散式計算架構。它不僅支持使用 Java、Scala、Python 和 R 等不同語言開發應用程式,還在記憶體中比傳統數據處理框架快一百倍,即使在磁碟上運行也快十倍。無論您目前是否正在進行大數據專案,或是對機器學習、串流數據處理和圖形數據分析等主題感興趣,本書都適合您。您可以學習 Apache Spark,並使用提供的程式碼範例為各種大數據分析的使用案例開發 Spark 程式。本書涵蓋了 Spark 生態系統中的所有庫:Spark Core、Spark SQL、Spark Streaming、Spark ML 和 Spark GraphX。

 
     
     
     
     
     
     
     
     
     
     
     
     
     
     
     
     
     
    