SARSA - PDFSEARCH.IO - Document Search Engine

SARSA
Results: 43

#	Item
21	Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda Carlton Downey Victoria University of Wellington, Wellington, New Zealand Add to Reading List Source URL: users.cecs.anu.edu.au Language: English - Date: 2010-05-05 13:10:11 Markov chain Q-learning Artificial intelligence Learning Statistics SARSA Temporal difference learning
22	Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu Add to Reading List Source URL: nlp.stanford.edu Language: English - Date: 2010-05-17 17:59:35 SARSA Q-learning Reinforcement learning Temporal difference learning Machine learning Algorithm Apprenticeship learning Spatial memory Artificial intelligence Learning Mathematics
23	Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes Add to Reading List Source URL: homepages.inf.ed.ac.uk Language: English - Date: 2007-07-04 12:19:49 Stochastic control SARSA Markov models Theoretical computer science Reinforcement learning Q-learning Council on Environmental Quality Temporal difference learning Partially observable Markov decision process Statistics Markov processes Dynamic programming
24	LOGO_frontiersinpsychology Add to Reading List Source URL: www.cs.utexas.edu Language: English - Date: 2013-03-19 13:42:21 Cognitive architecture Reinforcement learning Q-learning Temporal difference learning Motivation Action selection Modularity SARSA ACT-R Artificial intelligence Behavior Mind
25	Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu Add to Reading List Source URL: arxiv.org Language: English - Date: 2013-12-19 20:23:45 Computational neuroscience Cybernetics Reinforcement learning Q-learning Temporal difference learning SARSA Markov decision process Unsupervised learning Recurrent neural network Machine learning Neural networks Statistics
26	A neural reinforcement learning model for tasks with unknown time delays Daniel Rasmussen ([removed]) Chris Eliasmith ([removed]) Centre for Theoretical Neuroscience, University of Waterloo Wate Add to Reading List Source URL: mindmodeling.org Language: English - Date: 2013-07-15 14:54:54 Statistics Cybernetics Computational statistics Artificial intelligence Learning Q-learning Artificial neural network Reinforcement learning SARSA Computational neuroscience Neural networks Machine learning
27	Automatic Task Decomposition and State Abstraction from Demonstration Luis C. Cobo Charles L. Isbell Jr. Add to Reading List Source URL: www.cc.gatech.edu Language: English - Date: 2012-03-31 18:30:01 Thought Machine learning Apprenticeship learning Computational neuroscience Reinforcement learning Abstraction Q-learning Ada SARSA Computing Mathematics Cognition
28	Object Focused Q-learning for Autonomous Agents Luis C. Cobo Charles L. Isbell Jr. Andrea L. Thomaz Add to Reading List Source URL: www.cc.gatech.edu Language: English - Date: 2013-04-16 11:54:07 Q-learning Markov decision process Theoretical computer science SARSA Algorithm Function Statistics Mathematics Reinforcement learning
29	Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning Paulina Varshavskaya, Leslie Pack Kaelbling and Daniela Rus Computer Science and AI Laboratory Massachusetts Institute of Technolog Add to Reading List Source URL: people.csail.mit.edu Language: English - Date: 2007-09-07 17:21:08 Stochastic control Reinforcement learning Computational neuroscience Partially observable Markov decision process Machine learning Markov decision process Q-learning Self-reconfiguring modular robot SARSA Statistics Dynamic programming Markov processes
30	Reinforcement learning with Gaussian processes Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-12-01 11:15:01 Control theory Linear filters Stochastic differential equations Kalman filter Markov decision process Normal distribution Gaussian process Q-learning SARSA Statistics Markov models Stochastic processes