SARSA

Results: 43



#Item
21Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda Carlton Downey Victoria University of Wellington, Wellington, New Zealand

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda Carlton Downey Victoria University of Wellington, Wellington, New Zealand

Add to Reading List

Source URL: users.cecs.anu.edu.au

Language: English - Date: 2010-05-05 13:10:11
22Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu

Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu

Add to Reading List

Source URL: nlp.stanford.edu

Language: English - Date: 2010-05-17 17:59:35
23Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes

Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes

Add to Reading List

Source URL: homepages.inf.ed.ac.uk

Language: English - Date: 2007-07-04 12:19:49
24LOGO_frontiersinpsychology

LOGO_frontiersinpsychology

Add to Reading List

Source URL: www.cs.utexas.edu

Language: English - Date: 2013-03-19 13:42:21
25Playing Atari with Deep Reinforcement Learning  Volodymyr Mnih Koray Kavukcuoglu

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2013-12-19 20:23:45
26A neural reinforcement learning model for tasks with unknown time delays Daniel Rasmussen ([removed]) Chris Eliasmith ([removed]) Centre for Theoretical Neuroscience, University of Waterloo Wate

A neural reinforcement learning model for tasks with unknown time delays Daniel Rasmussen ([removed]) Chris Eliasmith ([removed]) Centre for Theoretical Neuroscience, University of Waterloo Wate

Add to Reading List

Source URL: mindmodeling.org

Language: English - Date: 2013-07-15 14:54:54
27Automatic Task Decomposition and State Abstraction from Demonstration Luis C. Cobo Charles L. Isbell Jr.

Automatic Task Decomposition and State Abstraction from Demonstration Luis C. Cobo Charles L. Isbell Jr.

Add to Reading List

Source URL: www.cc.gatech.edu

Language: English - Date: 2012-03-31 18:30:01
28Object Focused Q-learning for Autonomous Agents Luis C. Cobo Charles L. Isbell Jr.  Andrea L. Thomaz

Object Focused Q-learning for Autonomous Agents Luis C. Cobo Charles L. Isbell Jr. Andrea L. Thomaz

Add to Reading List

Source URL: www.cc.gatech.edu

Language: English - Date: 2013-04-16 11:54:07
29Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning Paulina Varshavskaya, Leslie Pack Kaelbling and Daniela Rus Computer Science and AI Laboratory Massachusetts Institute of Technolog

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning Paulina Varshavskaya, Leslie Pack Kaelbling and Daniela Rus Computer Science and AI Laboratory Massachusetts Institute of Technolog

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2007-09-07 17:21:08
30Reinforcement learning with Gaussian processes  Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana

Reinforcement learning with Gaussian processes Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:15:01