Algorithms of Reinforcement Learning


The ambition of this page is to be a comprehensive collection of links to papers describing RL algorithms. In order to make this list manageable we should only consider RL algorithms that originated a class of algorithms and have been used/studied by at least one researcher(s) unaffiliated with the original inventor(s) of the algorithm.


Associative Reinforcement Learning Algorithms

Actor-Critic

 

Temporal Differences (TD)

 

Q-learning

Dyna        (shows that learning and planning are related and integral to RL)

 

Real-Time Dynamic Programming

 

SARSA     (onpolicy method for control)

 

LSTD

 

LSPI     (Least Squares Policy Iteration)

 

Parti-Game

 

Prioritized Sweeping

 

E^3

  

Sparse-Sampling    

 

Trajectory Tree

 

Policy Search Algorithms

 

Policy Gradient Algorithms

  

Linear Programming based RL methods

 

Algorithms for Structured MDPs/POMDPs

 

Hierarchical Reinforcement Learning Algorithms