Back to Results
First PageMeta Content
Smooth functions / Distribution / Functional analysis / Universal property


GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of
Add to Reading List

Document Date: 2010-01-22 02:08:08


Open Document

File Size: 149,40 KB

Share Result on Facebook