Книга: Md. Abdus Samad Kamal «Efficient Reinforcement Learning in High Dimensional Domains»

Efficient Reinforcement Learning in High Dimensional Domains

Производитель: "LAP Lambert Academic Publishing"

This book presents development of efficient reinforcement learning methods in a postgraduate research. A reinforcement learning agent tries every state-action pair to find the optimal policy without prior knowledge about the domain. In large domains visiting every state-action pair is not feasible by an agent, therefore standard reinforcement learning approach is not applicable in solving many real world problems. Three new methods are proposed to make the learning efficient according to the characteristics of the problems: Task-Oriented Reinforcement Learning reduces the problem size by viewing it from the task`s viewpoint that clarifies task relevant state variables. Symmetrical-Actions Reinforcement Leaning reduces the size of a learning problem by exploiting partial symmetry over action relevant state variables and representing actions values by a single function. Coordinated Multiagent Reinforcement Learning technique uses coordinator-agent hierarchy to keep the size of... ISBN:9783846555712

Издательство: "LAP Lambert Academic Publishing" (2011)

ISBN: 9783846555712

См. также в других словарях:

  • Genetic algorithm — A genetic algorithm (GA) is a search heuristic that mimics the process of natural evolution. This heuristic is routinely used to generate useful solutions to optimization and search problems. Genetic algorithms belong to the larger class of… …   Wikipedia

  • Business and Industry Review — ▪ 1999 Introduction Overview        Annual Average Rates of Growth of Manufacturing Output, 1980 97, Table Pattern of Output, 1994 97, Table Index Numbers of Production, Employment, and Productivity in Manufacturing Industries, Table (For Annual… …   Universalium

  • Artificial intelligence — AI redirects here. For other uses, see Ai. For other uses, see Artificial intelligence (disambiguation). TOPIO, a humanoid robot, played table tennis at Tokyo International Robot Exhibition (IREX) 2009.[1] Artificial intelligence ( …   Wikipedia

  • Cellular neural network — Cellular neural networks (CNN) are a parallel computing paradigm similar to neural networks, with the difference that communication is allowed between neighbouring units only. Typical applications include image processing, analyzing 3D surfaces,… …   Wikipedia

  • Applied behavior analysis — (ABA) is the science of applying experimentally derived principles of behavior to improve socially significant behavior. ABA takes what we know about behavior and uses it to bring about positive change (Applied). Behaviors are defined in… …   Wikipedia

Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»