registrieren | anmelden | FAQ      [?] 

Tag reinforcement-learning [173 articles]

Recent papers classified by the tag reinforcement-learning.
  • Keepaway Soccer: From Machine Learning Testbed to Benchmark
    RoboCup 2005: Robot Soccer World Cup IX (2006), pp. 93-105.
    by Peter Stone, Gregory Kuhlmann, Matthew Taylor, Yaxin Liu
    posted to reinforcement-learning by tsoumakas on 2008-10-11 08:30:33 as **
  • Keepaway Soccer: A Machine Learning Test bed
    RoboCup 2001: Robot Soccer World Cup V (2002), pp. 207-237.
    by Peter Stone, Richard Sutton
    posted to reinforcement-learning by tsoumakas on 2008-10-11 08:32:54 as **
  • Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies
    IEEE Internet Computing, Vol. 11, No. 1. (2007), pp. 22-30.
    by G Tesauro
  • The primate amygdala represents the positive and negative value of visual stimuli during learning
    Nature, Vol. 439, No. 7078. (16 February 2006), pp. 865-870.
    by Joseph J Paton, Marina A Belova, Sara E Morrison, Daniel C Salzman
  • Network formation by reinforcement learning: the long and medium run
    (5 April 2004)
    by Robin Pemantle, Brian Skyrms
  • Reinforcement Learning in Computational Finance
    by Yazann Romahi
    posted to reinforcement-learning by Scis0000002 on 2008-02-01 01:17:01 as **
  • Learning to trade via direct reinforcement
    IEEE-NN, Vol. 12 (July 2001), pp. 875-889.
    by J Moody, M Saffell
    posted to computational-trading reinforcement-learning by Scis0000002 on 2008-01-04 10:29:18 as **
  • Reinforcement learning in non-Markov environments
    (1994)
    by S Whitehead, L Lin
    posted to markovity nonmarkovity reinforcement-learning by Scis0000002 on 2007-12-18 22:38:01 as **
  • Journal of Artificial Intelligence Research 15 (2001) 319-350 Submitted 9/00; published 11/01 Infinite-Horizon Policy-Gradient Estimation
    by Jonathan B Jbaxter
    posted to reinforcement-learning by Scis0000002 on 2007-06-21 15:02:16 as **
  • Constructive Reinforcement Learning
    by Jose H Orallo
    posted to constructive reinforcement-learning by Scis0000002 on 2007-07-06 09:58:29 as **
  • Complex Behavior Specification for Autonomous Systems
    (1992), pp. 170-177.
    by J Malec
  • On Using Guidance in Relational Reinforcement Learning
    by Kurt Driessens, Saso Dzeroski
    posted to reinforcement-learning by Scis0000002 on 2007-11-28 14:02:24 as **
  • Integrating experimentation and guidance in relational reinforcement learning
    (2002)
    posted to reinforcement-learning by Scis0000002 on 2007-11-28 13:55:11 as **
  • Learning to trade via direct reinforcement
    Neural Networks, IEEE Transactions on, Vol. 12, No. 4. (2001), pp. 875-889.
    by J Moody, M Saffell
  • Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
    Artificial Intelligence, Vol. 112, No. 1-2. (1999), pp. 181-211.
    by Richard S Sutton, Doina Precup, Satinder P Singh
    posted to reinforcement-learning by Scis0000002 on 2008-01-20 02:22:54 as ** along with 2 people metropol cijat
  • A Reinforcement Learning Algorithm based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis
    by Abhijit Gosavi
    posted to reinforcement-learning by Scis0000002 on 2007-06-21 15:02:36 as **
  • Quantum Reinforcement Learning
    Advances in Natural Computation (2005), pp. 686-689.
    by Daoyi Dong, Chunlin Chen, Zonghai Chen
  • Reinforcement learning for optimized trade execution
    (2006), pp. 673-680.
    by Yuriy Nevmyvaka, Yi Feng, Michael Kearns
  • A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation
    Autonomic Computing, 2006. ICAC '06. IEEE International Conference on (2006), pp. 65-73.
    by G Tesauro, NK Jong, R Das, MN Bennani
  • Operant conditioning in skinnerbots
    Adaptive Behavior, Vol. 5, No. 3/4. (1997), pp. 219-247.
    posted to conditioning reinforcement-learning by scis0000001 on 2007-04-29 18:23:59 as **
  • Reinforcement Learning in Computational Finance
    by Yazann Romahi
  • Goedel Machines: Self-Referential Universal Problem Solvers Making Provably Optimal Self-Improvements
    (27 Dec 2004)
    by Juergen Schmidhuber
  • Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization: First experiments with the HASSLE algorithm
    (2003)
  • Universal Reinforcement Learning
    (20 Jul 2007)
    by Vivek F Farias, Ciamac C Moallemi, Tsachy Weissman, Benjamin Van Roy
  • Reinforcement Learning with Linear Function Approximation and LQ control Converges
    (9 Mar 2007)
    by Istvan Szita, Andras Lorincz
    posted to reinforcement-learning by sato-ryu on 2007-03-14 08:19:05 as **
  • Neural reinforcement learning for an obstacle avoidance behavior
    (1996), pp. 6/1-3.
    by C Touzet
    posted to reinforcement-learning by sato-ryu on 2007-06-03 16:13:29 as **
  • Function optimization using connectionist reinforcement learning algorithms
    Connection Science, Vol. 3 (1991), pp. 241-268.
    by R Williams, J Peng
  • A Bayesian Framework for Reinforcement Learning
    (2000), pp. 943-950.
    by Malcolm Strens
  • Reinforcement Learning for Operational Space Control
    Robotics and Automation, 2007 IEEE International Conference on (2007), pp. 2111-2116.
    by J Peters, S Schaal
    posted to reinforcement-learning optimal-control by rockelegancy on 2008-09-29 13:33:12 as *****
  • Using Reward-weighted Regression for Reinforcement Learning of Task Space Control
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on (2007), pp. 262-267.
    by J Peters, S Schaal
    posted to reinforcement-learning optimal-control by rockelegancy on 2008-09-29 13:38:00 as read
  • Learning to Control in Operational Space
    The International Journal of Robotics Research, Vol. 27, No. 2. (1 February 2008), pp. 197-212.
    by Jan Peters, Stefan Schaal
  • notes Learning in Multi-Robot Systems
    (1996), pp. 152-163.
    by Maja J Mataric
    edited by Gerhard Wei\ss, Sandip Sen
    posted to machine-learning multi-agent-systems reinforcement-learning by pthimon on 2007-08-01 15:43:20 as **
  • Learning to Cooperate via Policy Search
    (2000), pp. 307-314.
    by Leonid Peshkin, Kee E Kim, Nicolas Meuleau, Leslie P Kaelbling
    posted to consensus coordination reinforcement-learning by pthimon on 2007-10-29 18:04:48 as **
  • Geodesic Gaussian kernels for value function approximation
    Autonomous Robots
    by Masashi Sugiyama, Hirotaka Hachiya, Christopher Towell, Sethu Vijayakumar
  • Evolution and Learning in an Intrinsically Motivated Reinforcement Learning Robot
    Advances in Artificial Life (2007), pp. 294-303.
    by Massimiliano Schembri, Marco Mirolli, Gianluca Baldassarre
  • Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
    Nature Neuroscience, Vol. 10, No. 12. (18 November 2007), pp. 1615-1624.
    by Matthew R Roesch, Donna J Calu, Geoffrey Schoenbaum
  • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    Nature Neuroscience, Vol. 8, No. 12. (06 November 2005), pp. 1704-1711.
    by Nathaniel D Daw, Yael Niv, Peter Dayan
  • A computational substrate for incentive salience.
    Trends Neurosci, Vol. 26, No. 8. (August 2003), pp. 423-428.
    by SM McClure, ND Daw, PR Montague
  • Reinforcement learning and decision making in monkeys during a competitive game
    Cognitive Brain Research, Vol. 22, No. 1. (December 2004), pp. 45-58.
    by Daeyeol Lee, Michelle L Conroy, Benjamin P Mcgreevy, Dominic J Barraclough
    posted to decision games reinforcement-learning by nelmor on 2007-07-30 09:57:04 as ** along with 1 person brian
  • Tripartite Mechanism of Extinction Suggested by Dopamine Neuron Activity and Temporal Difference Model
    J. Neurosci., Vol. 28, No. 39. (24 September 2008), pp. 9619-9631.
    by Wei-Xing Pan, Robert Schmidt, Jeffery R Wickens, Brian I Hyland
    posted to td reinforcement-learning rats extinction dopamine by nelmor on 2008-09-25 11:08:11 as **
  • Midbrain dopamine neurons encode a quantitative reward prediction error signal.
    Neuron, Vol. 47, No. 1. (7 July 2005), pp. 129-141.
    by HM Bayer, PW Glimcher
  • Functional organization of the medial frontal cortex.
    Curr Opin Neurobiol, Vol. 17, No. 2. (April 2007), pp. 220-227.
    by MF Rushworth, MJ Buckley, TE Behrens, ME Walton, DM Bannerman
  • Computational roles for dopamine in behavioural control
    Nature, Vol. 431, No. 7010. (14 October 2004), pp. 760-767.
    by Read P Montague, Steven E Hyman, Jonathan D Cohen
  • Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game
    J. Neurosci., Vol. 27, No. 31. (1 August 2007), pp. 8366-8377.
    by Hyojung Seo, Daeyeol Lee
    posted to decision reinforcement-learning reward risk by nelmor on 2007-08-03 10:47:08 as **
  • Midbrain dopamine neurons encode decisions for future action
    Nature Neuroscience, Vol. 9, No. 8. (23 July 2006), pp. 1057-1063.
    by Genela Morris, Alon Nevet, David Arkadir, Eilon Vaadia, Hagai Bergman
  • notes Representation of action-specific reward values in the striatum.
    Science, Vol. 310, No. 5752. (25 November 2005), pp. 1337-1340.
    by K Samejima, Y Ueda, K Doya, M Kimura
  • The computational neurobiology of learning and reward
    Current Opinion in Neurobiology, Vol. 16, No. 2. (April 2006), pp. 199-204.
    by Nathaniel D Daw, Kenji Doya
  • Solving the Distal Reward Problem through Linkage of STDP and Dopamine Signaling.
    Cereb Cortex (13 January 2007)
    by Eugene M M Izhikevich
  • Glutamatergic activation of anterior cingulate cortex produces an aversive teaching signal
    Nat Neurosci, Vol. 7, No. 4. (April 2004), pp. 398-403.
    by Joshua P Johansen, Howard L Fields
    posted to aversive cingulate reinforcement-learning by nelmor on 2007-06-12 14:24:10 as **
  • Reinforcement learning signals predict future decisions.
    J Neurosci, Vol. 27, No. 2. (10 January 2007), pp. 371-378.
    by MX Cohen, C Ranganath
  • Bemerkung: Sie können diese Seite wie folgt zitieren: http://www.citeulike.org/tag/reinforcement-learning

    Result page: 1 2 3 4 Next RIS BibTeX
    CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.