| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Stop wasting time looking for files and revisions. Connect your Gmail, DriveDropbox, and Slack accounts and in less than 2 minutes, Dokkio will automatically organize all your file attachments. Learn more and claim your free account.

View
 

RL is table lookup

Page history last edited by Satinder Singh 11 years, 8 months ago

One hopes this myth is no longer subscribed to by anybody, but just in case...

 


In the early days of RL, much of the research in the field was focused on developing new algorithms and

  1. analyzing them for basic kinds of soundness such as asymptotic convergence and for this assuming lookup tables sufficed, and
  2. empirically testing them for basic correctness and again for this experiments with small problems in which one can use lookup tables sufficed. Thus many viewers of early RL talks got treated to many research papers with RL and lookup tables. This may account for the myth say in the early 1990's. It should have frittered away given that all of the success stories in RL use function approximation. Indeed, the two basic text books on RL (Sutton & Barto's Reinforcement Learning: An Introduction and Bertsekas & Tsitsiklis' Neuro Dynamic Programming) present many results and much discussion of RL and function approximation.

Of course, even today when someone develops a new RL algorithmic idea they often test it first on small lookup table like problems. This may account for the few holdouts on this otherwise untenable myth.

 

Comments (0)

You don't have permission to comment on this page.