Algorithms for Reinforcement Learning (Paperback)
暫譯: 強化學習的演算法 (平裝本)
Csaba Szepesvari
- 出版商: Morgan & Claypool
- 出版日期: 2010-06-25
- 售價: $1,430
- 貴賓價: 9.5 折 $1,359
- 語言: 英文
- 頁數: 104
- 裝訂: Paperback
- ISBN: 1608454924
- ISBN-13: 9781608454921
-
相關分類:
Reinforcement
立即出貨 (庫存=1)
買這商品的人也買了...
-
SQL 語法範例辭典$550$468 -
Ruby 學習手冊 (Learning Ruby)$620$490 -
C++ Primer, 4/e (中文版)$990$891 -
新商業語言 SOA 與 Web 2.0 (The New Language of Business: SOA & Web 2.0)$350$315 -
Introduction to Algorithms, 3/e (IE-Paperback)$1,590$1,558 -
Computer Organization and Design: The Hardware/Software Interface, 4/e (ARM Edition) (Paperback)$1,200$1,176 -
Windows Server 2008 R2 Active Directory 建置實務$620$490 -
計算機組織與設計 (Computer Organization and Design: The Hardware/Software Interface, 4/e)$900$855 -
王者歸來 Java Web 整合開發─ JSP + Servlet + Struts + Hibernate + Spring$980$833 -
抓住你的 Photoshop CS5$590$502 -
Google Android SDK 開發範例大全, 3/e$950$751 -
學徒模式-優秀軟體開發者的養成之路 (Apprenticeship Patterns: Guidance for the Aspiring Software Craftsman)$420$332 -
Google!Android 3 手機應用程式設計入門, 4/e$550$435 -
Eclipse 完全攻略-從基礎 Java 到 PDE 外掛開發$600$468 -
鳥哥的 Linux 私房菜-伺服器架設篇, 3/e$800$632 -
深入淺出 Python (Head First Python)$780$616 -
SQL Server 2012 T-SQL 資料庫設計
$690$545 -
MongoDB 技術手冊 (MongoDB: The Definitive Guide)$450$356 -
Reinforcement and Systemic Machine Learning for Decision Making (Hardcover)$3,980$3,781 -
Ruby on Rails 最佳程式設計指南(Ruby on Rails 程式設計技術詳解)$650$553 -
無瑕的程式碼 - 敏捷軟體開發技巧守則 (Clean Code: A Handbook of Agile Software Craftsmanship)$580$452 -
超圖解 Arduino 互動設計入門 (附 Arduino UNO R3 開發板)$1,130$961 -
SAP ABAP 開發從入門到精通$714$678 -
Effective JavaScript 中文版 | 駕馭 JavaScript 的 68 個具體作法 (Effective JavaScript: 68 Specific Ways to Harness the Power of JavaScript)$450$356 -
Special Edition Data Science Interview Questions Solved in Python and Spark: with Deep Learning and Reinforcement Learning bonus topics in Keras (Paperback)$1,480$1,406
商品描述
Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective.What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming.We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.
商品描述(中文翻譯)
強化學習是一種學習範式,專注於學習如何控制系統,以最大化表達長期目標的數值性能指標。強化學習與監督學習的區別在於,學習者僅獲得有關其預測的部分反饋。此外,這些預測可能會通過影響受控系統的未來狀態而產生長期影響。因此,時間在此過程中扮演著特殊的角色。強化學習的目標是開發高效的學習演算法,以及理解這些演算法的優點和限制。強化學習引起了廣泛的興趣,因為它可以應用於許多實際問題,範圍從人工智慧到運籌學或控制工程。在本書中,我們專注於那些基於動態規劃強大理論的強化學習演算法。我們提供了一個相當全面的學習問題目錄,描述核心思想,列舉大量最先進的演算法,並討論它們的理論特性和限制。
