Talks@IDSIA: Dr Mattiheu Geist - Kalman Temporal Differences

Istituto Dalle Molle di studi sull'intelligenza artificiale

Data d'inizio: 22 dicembre 2009

Data di fine: 23 dicembre 2009

Dear Colleagues,

on Tuesday, 22nd of December, Dr Matthieu Geist will give us a talk titled:

Kalman Temporal Differences

The place is: Sala Primavera, Galleria 2, 6928 Manno

Abstract:

Generalization is an important problem in reinforcement learning (RL), and value function approximation (VFA) is a way to handle it. A value function approximator should exhibit some features: being sample efficient (data can be expansive, especially in an industrial context), handling nonlinearities (nonlinear parameterization such as multilayer perceptron, Bellman optimality equation), handling nonstationarities (nonstationary system, but above all generalized policy iteration induces nonstationarities) and providing an uncertainty information about estimates (which should provide useful for the dilemma between exploration and exploitation). After a quick survey on VFA, it will be shown that casting value function approximation as a filtering problem allows introducing a framework which handles all these problems at the same time.

Contatti

Istituto Dalle Molle di studi sull'intelligenza artificiale

+41 58 666 66 66

[email protected]

Allegati

Add to your calendar

Condividi

Facebook

X

LinkedIn

Whatsapp

Email

Facoltà

Studia con noi

Ricerca

Info pratiche

Notizie ed eventi

Talks@IDSIA: Dr Mattiheu Geist - Kalman Temporal Differences

Contatti

Allegati

Condividi

Stampa

Indicazioni

Resta in contatto