filler

The repository contains useful downloadable material related to my research and teaching, including Matlab software, presentations, and demonstration movies. If an item has a "»" button to its right, this button can be clicked to reveal more information; the "«" button then hides this information again (requires Javascript).

Software

  • Approximate RL and DP toolbox, July 2013 release. (13 July 2013, 1.6 MBytes). »
  • Optimistic planning, a selection of algorithms as a stand-alone package. (13 July 2013, 79.3 KBytes). »
  • MARL toolbox ver. 1.3, a Matlab multi-agent reinforcement learning toolbox (4 August 2010, 336.9 KBytes). »
  • MARL toolbox documentation, the documentation files for the MARL toolbox (4 August 2010, 223.1 KBytes). »
  • Approximate RL and DP toolbox, developed in Matlab. (6 June 2010, 967.6 KBytes). »
  • makepdf, a Windows XP batch script to automate the creation of PDF files from DVI (21 November 2008, 2.4 KBytes).

Presentations

  • Optimistic planning for continuous-action deterministic systems, an overview of the SOOP algorithm (1 July 2013, 1.4 MBytes). »
  • Optimistic planning for networked control systems, explaining how the features of planning make it suitable for NCS (25 June 2013, 2.7 MBytes). »
  • Reinforcement learning with function approximation, my talk in the Optimal Adaptive Control workshop at the IEEE Conference on Decision and Control (11 December 2011, 5.5 MBytes). »
  • Optimistic planning for near-optimal control in MDPs, an in-depth description of our optimistic planning algorithm and its analysis (1 December 2011, 1.1 MBytes). »
  • Reinforcement learning lectures, introducing classical and approximate RL (3 March 2010, 2.1 MBytes). »
  • Reinforcement learning in continuous state and action spaces, my defense presentation, with a very gentle introduction to the topic. (13 January 2009, 391.3 KBytes). »
  • Model-based reinforcement learning with fuzzy approximation, an overview of our fuzzy Q-iteration algorithm, with convergence and consistency results. (9 April 2008, 930.5 KBytes). »
  • Reinforcement learning for multi-agent systems, a good overview talk to which I collaborated; this was presented by Prof. Robert Babuska at the CABS colloquium (the link opens a separate download page) (22 June 2006).

Demonstration Movies

  • Learning to swing up an inverted pendulum, using online least-squares policy iteration. (8 January 2009, 51.8 MBytes). »
  • Final swingup solution, after the online LSPI learning experiment was completed. (8 January 2009, 864.9 KBytes).
  • Robot goalkeeper learning to catch the ball, using approximate online RL and experience replay (demo by Sander Adam). (1 October 2008, 13.3 MBytes).