NODOVOX | Discovery

Reinforcement Learning: A Survey

Artículo

Material complementario disponible Artículo OpenAlex

Leslie Pack Kaelbling; Michael L. Littman; Andrew Moore · Journal of Artificial Intelligence Research · 1996

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad...

Idioma en

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Reinforcement Learning: An Introduction

Artículo

Página del recurso disponible Artículo OpenAlex

Reinforcement Learning: An Introduction

Richard S. Sutton; Andrew G. Barto · IEEE Transactions on Neural Networks · 2005

An account of key ideas and algorithms in reinforcement learning. The discussion ranges from the history of the field's intellectual foundations to recent developments and applications. Areas studied include reinforcemen...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Reinforcement Learning: An Introduction

Artículo

Página del recurso disponible Artículo OpenAlex

Reinforcement Learning: An Introduction

Jeffrey D. Johnson; Jinghong Li; Zengshi Chen · Neurocomputing · 2000

Materias / palabras clave: Reinforcement learning; Computer science; Reinforcement; Artificial intelligence; Machine learning; Psychology; Social psychology

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Reinforcement Learning: An Introduction

Artículo

Página del recurso disponible Artículo OpenAlex

Reinforcement Learning: An Introduction

Richard S. Sutton; Andy Barto · IEEE Transactions on Neural Networks · 1998

Materias / palabras clave: Reinforcement learning; Computer science; Reinforcement; Artificial intelligence; Engineering; Structural engineering

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Human-level control through deep reinforcement learning

Artículo

Página del recurso disponible Artículo OpenAlex

Human-level control through deep reinforcement learning

Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A. Rusu; Joel Veness; Marc G. Bellemare; Alex Graves; Martin Riedmiller · Nature · 2015

Materias / palabras clave: Reinforcement learning; Computer science; Artificial intelligence; Variety (cybernetics); Deep learning; Control (management); Perception; Human–computer interaction; Machine learning; Neuroscience; Biology

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Using combination of actions in reinforcement learning

Artículo

Acceso abierto Artículo SEDICI UNLP

Using combination of actions in reinforcement learning

Karanik, Marcelo J. et al · SEDICI UNLP · 2010

Software agents are programs that can observe their environment and act in an attempt to reach their design goals. In most cases the selection of particular agent architecture determines the behaviour in response to the ...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Texto / recurso

Página del recurso disponible Texto / recurso OpenAlex

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Natan, Avraham; Stern, Roni; Kalech, Meir · arXiv (Cornell University) · 2017

Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-re...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Learning by knowledge sharing in autonomous intelligent systems

Texto / recurso

Acceso abierto Texto / recurso RI ITBA

Learning by knowledge sharing in autonomous intelligent systems

García Martínez, Ramón et al · RI ITBA · 2018 · ISSN 0302-9743

"Very few learning systems applied to problem solving have focused on learning operator definitions from the interaction with a completely unknown environment. In order to achieve better learning convergence, several age...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Discovering sensing capability in multi-agent systems

Texto / recurso

Acceso abierto Texto / recurso RI ITBA

Discovering sensing capability in multi-agent systems

Parpaglione, María Cristina et al · RI ITBA · 2022 · ISSN 7695-4400

"What should be the sensing capabilities of agents in a Multi-Agent System be to solve a problem efficiently, quickly and economicly? This question often appears when trying to solve a problem using Multi-Agent Systems. ...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

A parallel implementation of Q-learning based on communication with cache

Artículo

Material complementario disponible Artículo SEDICI UNLP

A parallel implementation of Q-learning based on communication with cache

Printista, Alicia Marcela et al · SEDICI UNLP · 2002

Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment t...

Idioma en

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Deep learning in neural networks: An overview

Texto / recurso

Página del recurso disponible Texto / recurso OpenAlex

Deep learning in neural networks: An overview

Jürgen Schmidhuber · Neural Networks · 2014

Materias / palabras clave: Artificial intelligence; Deep learning; Computer science; Artificial neural network; Backpropagation; Reinforcement learning; Machine learning; Deep neural networks; Unsupervised learning; Recurrent neural network; Encod...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Advances in Neural Information Processing Systems 14

Libro electrónico

Página del recurso disponible Libro electrónico OpenAlex

Advances in Neural Information Processing Systems 14

The MIT Press eBooks · 2002

The proceedings of the 2001 Neural Information Processing Systems (NIPS) Conference. The annual conference on Neural Information Processing Systems (NIPS) is the flagship conference on neural computation. The conference ...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Artificial intelligence: a modern approach

Artículo

Página del recurso disponible Artículo OpenAlex

Artificial intelligence: a modern approach

Dr. Anil Kumar; Sivasubramanian Balasubramanian; Dr. Haewon Byeon; Prof. Ganesh Vasudeo Manerkar · Choice Reviews Online · 1995

The long-anticipated revision of this #1 selling book offers the most comprehensive, state of the art introduction to the theory and practice of artificial intelligence for modern applications. Intelligent Agents. Solvin...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Techniques for improving the perfomance and scalability of directory-based shared-memory multiprocessors : A survey

Artículo

Acceso abierto Artículo SEDICI UNLP

Techniques for improving the perfomance and scalability of directory-based shared-memory multiprocessors : A survey

Acacio Sánchez, Manuel et al · SEDICI UNLP · 2003

Cache-coherent, nonumiform memory acces or cc-NUMA is an attractive architecture for building a spectrum of shared memory multiprocessors (whic are socing widespread use in commercial, technical and scientific applicatio...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Mastering the game of Go with deep neural networks and tree search

Artículo

Página del recurso disponible Artículo OpenAlex

Mastering the game of Go with deep neural networks and tree search

David Silver; Aja Huang; Chris J. Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou · Nature · 2016

Materias / palabras clave: Monte Carlo tree search; Computer science; Champion; Artificial neural network; Reinforcement learning; Artificial intelligence; Game tree; Value (mathematics); Search algorithm; Tree (set theory); Deep neural networks; ...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Mastering the game of Go without human knowledge

Artículo

Página del recurso disponible Artículo OpenAlex

Mastering the game of Go without human knowledge

David Silver; Julian Schrittwieser; Karen Simonyan; Ioannis Antonoglou; Aja Huang; Arthur Guez; Thomas Hubert; Lucas Baker · Nature · 2017

Materias / palabras clave: Champion; Artificial intelligence; Artificial neural network; Reinforcement learning; Computer science; Selection (genetic algorithm); Tree (set theory); Machine learning; Domain (mathematical analysis); Mathematics; Pol...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Improved automatic discovery of subgoals for options in hierarchical

Artículo

Acceso abierto Artículo SEDICI UNLP

Improved automatic discovery of subgoals for options in hierarchical

Kretchmar, R. Matthew et al · SEDICI UNLP · 2003

Options have been shown to be a key step in extending reinforcement learning beyond low-level reactionary systems to higher-level, planning systems. Most of the options research involves hand-crafted options; there has b...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Training a gaming agent on brainwaves

Artículo

Material complementario disponible Artículo RI ITBA

Training a gaming agent on brainwaves

Bartolomé, Francisco et al · RI ITBA · 2022

"Error-related potential (ErrP) are a particular type of Event-Related Potential (ERP) elicited by a person attending a recognizable error. These Electroencephalographic (EEG) signals can be used to train a gaming agent ...

Idioma en

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Advances in Neural Information Processing Systems 19

Libro electrónico

Página del recurso disponible Libro electrónico OpenAlex

Advances in Neural Information Processing Systems 19

The MIT Press eBooks · 2007

Papers from the 2006 flagship meeting on neural computation, with contributions from physicists, neuroscientists, mathematicians, statisticians, and computer scientists. The annual Neural Information Processing Systems (...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Artículo

Página del recurso disponible Artículo OpenAlex

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Ramprasaath R. Selvaraju; Michael Cogswell; Abhishek Das; Ramakrishna Vedantam; Devi Parikh; Dhruv Batra · OpenAlex · 2017

We propose a technique for producing `visual explanations' for decisions from a large class of Convolutional Neural Network (CNN)-based models, making them more transparent. Our approach - Gradient-weighted Class Activat...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Buscar recursos académicos

Resultados