Parallel R (Paperback)

Q. Ethan McCallum, Stephen Weston

  • 出版商: O'Reilly
  • 出版日期: 2011-11-29
  • 定價: $660
  • 售價: 9.5$627
  • 貴賓價: 9.0$594
  • 語言: 英文
  • 頁數: 126
  • 裝訂: Paperback
  • ISBN: 1449309925
  • ISBN-13: 9781449309923
  • 相關分類: R 語言
  • 立即出貨 (庫存 < 3)

買這商品的人也買了...

商品描述

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets. You’ll learn the basics of Snow, Multicore, Parallel, and some Hadoop-related tools, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

  • Snow: works well in a traditional cluster environment
  • Multicore: popular for multiprocessor and multicore computers
  • Parallel: part of the upcoming R 2.14.0 release
  • R+Hadoop: provides low-level access to a popular form of cluster computing
  • RHIPE: uses Hadoop’s power with R’s language and interactive shell
  • Segue: lets you use Elastic MapReduce as a backend for lapply-style operations

商品描述(中文翻譯)

R作為一個高品質、跨平台、開源的統計軟體產品,很難對其提出異議,除非你從事大數據分析的業務。這本簡明的書介紹了使用R分析大型數據集的幾種策略。你將學習到Snow、Multicore、Parallel和一些與Hadoop相關的工具的基礎知識,包括如何找到它們、如何使用它們、它們在什麼情況下運作良好以及什麼情況下不適用。

通過這些套件,你可以克服R的單線程特性,將工作分散到多個CPU上,或將工作卸載到多台機器上以解決R的內存限制。

以下是一些相關套件的介紹:
- Snow:在傳統的集群環境中運作良好
- Multicore:適用於多處理器和多核心計算機
- Parallel:即將在R 2.14.0版本中推出
- R+Hadoop:提供對一種流行的集群計算形式的低級訪問
- RHIPE:結合了Hadoop的強大功能和R的語言和互動式shell
- Segue:讓你可以使用Elastic MapReduce作為lapply-style操作的後端