Mathematical Foundations of Reinforcement Learning [1 ECTS - 8hours]

Relatore:  Prof. Dante Kalise - University of Nottingham
  martedì 9 giugno 2020 alle ore 10.30

Mathematical Foundations of Reinforcement Learning [1 ECTS - 8hours]

This course concerns multi-stage decision processes in the framework of dynamic programming and the Bellman equation, where optimal policies are synthesized based on both immediate and long-term rewards. However, the computational requirements of dynamic programming techniques can be prohibitive as the policy/state space is overwhelmingly large, the so-called Bellman's curse of dimensionality". In this course we will overcome this difficulty by means of different techniques for the computation of suboptimal solutions to dynamic programming equations. The lectures will address theoretical, algorithmic, and computational aspects of such techniques.

Teacher: Dr. Dante Kalise (email: dante.kalise@nottingham.ac.uk)

Lectures will be recorded and  live streamed according to the following schedule:
Tue     9 June    10:30-12:30;    [video]
Wed  10 June    10:30-12:30;   [video]
Thu   11 June 
  12:30-14:30;   [video]  associated article: [arxiv]
Fri     12 
June    10:30-12:30.   [video]  find below the matlab code zermelo.m

Find the handwritten notes below


Students willing to participate are asked to send a registration email to: giacomo.albi@univr.it















 
Titolo Formato  (Lingua, Dimensione, Data pubblicazione)
Lect 1  pdfpdf (it, 16396 KB, 11/06/20)
Lect 2  pdfpdf (it, 4467 KB, 11/06/20)
Lect 3  pdfpdf (it, 5196 KB, 12/06/20)
Lect 4  pdfpdf (it, 4112 KB, 12/06/20)
zermelo.m  octet-streamoctet-stream (it, 1 KB, 12/06/20)

Referente
Giacomo Albi

Referente esterno
Data pubblicazione
25 maggio 2020

Offerta formativa

Condividi