Data e ora inizio evento:
Data e ora fine evento:
Sede:
Dipartimento di Matematica, Università di Roma "Tor Vergata"
Aula esterna:
Aula Dal Passo
The talk discusses a framework to analyze certain model-based reinforcement learning algorithm. Roughly speaking, this approach consists in designing a model to deal with situations in which the system dynamics is not known and encodes the available information about the state dynamics that an agent has as a measure on the space of functions. In this framework, a natural question is if whether the optimal policies and the value functions converge, respectively, to an optimal policy and to the value function of the real, underlying optimal control problem as soon as more information on the environment is gathered by the agent. We provide a positive answer in the linear-quadratic case and discuss some results also in the control-affine nonlinear case.
NB:This talk is part of the activity of the MUR Excellence Department Project MATH@TOV CUP E83C23000330006
NB:This talk is part of the activity of the MUR Excellence Department Project MATH@TOV CUP E83C23000330006
Speaker ed affiliazione:
Michele Palladino
Contatti/Organizzatori:
molle@mat.uniroma2.it
Data pubblicazione evento: