registrieren | anmelden | FAQ      [?] 

Tag q-learning [13 articles]

Recent papers classified by the tag q-learning.
  • notes Learning to Act using Real-Time Dynamic Programming
    No. UM-CS-1993-002. (, 1993)
    by Andrew G Barto, Steven J Bradtke, Satinder P Singh
    posted to q-learning by minhn on 2007-08-11 06:48:50 as ***
  • Pseudo-convergent Q-Learning by Competitive Pricebots
    (2000), pp. 463-470.
    by Jeffrey O Kephart, Gerald Tesauro
    posted to q-learning ai by cybrpunk to the group dopsy on 2007-06-06 11:08:00 as **
  • Q-learning
    Machine Learning, Vol. 8, No. 3. (1 May 1992), pp. 279-292.
    by Christopher J Watkins, Peter Dayan
  • Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning
    Artificial Intelligence, Vol. 112, No. 1-2. (1999), pp. 181-211.
    by Richard S Sutton, Doina Precup, Satinder Singh
  • Technical Note: Q-Learning
    Machine Learning, Vol. 8, No. 3. (1 May 1992), pp. 279-292.
    by Christopher JCH Watkins, Peter Dayan
  • A Comparative Study of Parallel Reinforcement Learning Methods with a PC Cluster System
    (2006), pp. 416-419.
    by Masayuki Kushida, Kenichi Takahashi, Hiroaki Ueda, Tetsuhiro Miyahara
  • A new Q-learning algorithm based on the metropolis criterion
    Systems, Man, and Cybernetics, Part B, IEEE Transactions on, Vol. 34, No. 5. (2004), pp. 2140-2143.
    by Maozu Guo, Yang Liu, J Malec
    posted to q-learning reinforcement-learning metropolis 2004 by ddahlem on 2008-09-29 12:54:15 as read
  • Multiagent reinforcement learning using function approximation
    Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, Vol. 30, No. 4. (2000), pp. 485-497.
    by O Abul, F Polat, R Alhajj
  • Potential-based shaping and Q-value initialization are equivalent
    Journal of Artificial Intelligence Research, Vol. 19 (2003), pp. 205-208.
    by Eric Wiewiora
    posted to shaping reward reinforcement-learning q-value q-learning 2003 by ddahlem on 2008-09-30 11:40:03 as read
  • Improving elevator performance using reinforcement learning
    Vol. 8 (1996), pp. 1017-1023.
    by Robert H Crites, Andrew G Barto
  • Reinforcement learning methods for continuous-time Markov decision problems
    Vol. 7 (1995), pp. 393-400.
    by Steven J Bradtke, Michael O Duff
  • The MAXQ Method for Hierarchical Reinforcement Learning
    (1998), pp. 118-126.
    by Thomas G Dietterich
  • A hybrid web recommender system based on Q-learning
    (2008), pp. 1164-1168.
    by Nima Taghipour, Ahmad Kardan
  • Bemerkung: Sie können diese Seite wie folgt zitieren: http://www.citeulike.org/tag/q-learning

    RIS BibTeX
    CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.