Random Forests with R
暫譯: 使用 R 的隨機森林

Genuer, Robin, Poggi, Jean-Michel

  • 出版商: Springer
  • 出版日期: 2020-09-11
  • 售價: $2,480
  • 貴賓價: 9.5$2,356
  • 語言: 英文
  • 頁數: 98
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 3030564843
  • ISBN-13: 9783030564841
  • 相關分類: R 語言Data ScienceMachine Learning
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

This book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random forests can be adapted to both supervised classification problems and regression problems. In addition, they allow us to consider qualitative and quantitative explanatory variables together, without pre-processing. Moreover, they can be used to process standard data for which the number of observations is higher than the number of variables, while also performing very well in the high dimensional case, where the number of variables is quite large in comparison to the number of observations. Consequently, they are now among the preferred methods in the toolbox of statisticians and data scientists. The book is primarily intended for students in academic fields such as statistical education, but also for practitioners in statistics and machine learning. A scientific undergraduate degree is quite sufficient to take full advantage of the concepts, methods, and tools discussed. In terms of computer science skills, little background knowledge is required, though an introduction to the R language is recommended.

Random forests are part of the family of tree-based methods; accordingly, after an introductory chapter, Chapter 2 presents CART trees. The next three chapters are devoted to random forests. They focus on their presentation (Chapter 3), on the variable importance tool (Chapter 4), and on the variable selection problem (Chapter 5), respectively. After discussing the concepts and methods, we illustrate their implementation on a running example. Then, various complements are provided before examining additional examples. Throughout the book, each result is given together with the code (in R) that can be used to reproduce it. Thus, the book offers readers essential information and concepts, together with examples and the software tools needed to analyse data using random forests.

商品描述(中文翻譯)

這本書提供了一個以應用為導向的隨機森林指南:這是一種統計學習方法,因其卓越的預測性能而廣泛應用於許多領域,同時也因其靈活性,對所使用數據的性質幾乎沒有限制。事實上,隨機森林可以適應監督式分類問題和回歸問題。此外,它們允許我們同時考慮定性和定量的解釋變數,而無需進行預處理。此外,隨機森林可以用於處理觀察數量多於變數數量的標準數據,同時在高維情況下表現良好,這種情況下變數的數量相對於觀察數量來說是相當大的。因此,隨機森林現在已成為統計學家和數據科學家工具箱中首選的方法之一。本書主要針對統計教育等學術領域的學生,但也適合統計和機器學習的從業者。擁有科學本科學位就足以充分利用所討論的概念、方法和工具。在計算機科學技能方面,所需的背景知識不多,但建議對 R 語言有基本的介紹。

隨機森林屬於基於樹的方法家族;因此,在介紹章節之後,第二章介紹了 CART 樹。接下來的三章專注於隨機森林,分別聚焦於其介紹(第三章)、變數重要性工具(第四章)和變數選擇問題(第五章)。在討論概念和方法之後,我們通過一個運行示例來說明它們的實現。然後,提供各種補充內容,接著檢視其他示例。在整本書中,每個結果都附有可以用來重現該結果的代碼(使用 R 語言)。因此,本書為讀者提供了必要的信息和概念,並附有示例和使用隨機森林分析數據所需的軟體工具。

作者簡介

Robin Genuer is an Assistant Professor of Statistics at the University of Bordeaux and a member of the Inserm U1219 and Inria Bordeaux Sud-Ouest research centres.

Jean-Michel Poggi is a Professor of Statistics at the University of Paris and member of the LMO, the Orsay Mathematics Laboratory (University of Paris Saclay).

They have both produced various research works on random forests and have given numerous lectures and talks on the subject. They have also taught postgraduate and doctoral courses for a variety of audiences. Lastly, they have developed the R package VSURF


作者簡介(中文翻譯)

Robin Genuer 是波爾多大學的統計學助理教授,也是 Inserm U1219 和 Inria Bordeaux Sud-Ouest 研究中心的成員。

Jean-Michel Poggi 是巴黎大學的統計學教授,也是 LMO(奧爾塞數學實驗室,巴黎薩克雷大學)的成員。

他們都在隨機森林方面產出了多項研究成果,並就此主題進行了多次演講和報告。他們還為各種受眾教授研究生和博士課程。最後,他們開發了 R 套件 VSURF。