Λέσχη Φίλων Στατιστικής - GrStats forum
AUEB Stats Seminars 4/3/2021: Practical Distributionally Robust Markov Decision Processes using Relative Entropy by William Greenall (UCL) Forumgrstats

Join the forum, it's quick and easy

Λέσχη Φίλων Στατιστικής - GrStats forum
AUEB Stats Seminars 4/3/2021: Practical Distributionally Robust Markov Decision Processes using Relative Entropy by William Greenall (UCL) Forumgrstats
Λέσχη Φίλων Στατιστικής - GrStats forum
Would you like to react to this message? Create an account in a few clicks or log in to continue.
Για προβλήματα εγγραφής και άλλες πληροφορίες επικοινωνήστε με : grstats.forum@gmail.com ή grstats@stat-athens.aueb.gr

Go down
grstats
grstats
Posts : 959
Join date : 2009-10-21
http://stat-athens.aueb.gr/~grstats/

AUEB Stats Seminars 4/3/2021: Practical Distributionally Robust Markov Decision Processes using Relative Entropy by William Greenall (UCL) Empty AUEB Stats Seminars 4/3/2021: Practical Distributionally Robust Markov Decision Processes using Relative Entropy by William Greenall (UCL)

Tue 2 Mar 2021 - 22:15
AUEB STATISTICS SEMINAR SERIES MARCH 2021

AUEB Stats Seminars 4/3/2021: Practical Distributionally Robust Markov Decision Processes using Relative Entropy by William Greenall (UCL) 2021_a11


William Greenall (PhD student, UCL. Supervisor: Petros Dellaportas)

Practical Distributionally Robust Markov Decision Processes using Relative Entropy

ABSTRACT

Distributionally Robust Markov Decision Processes offer a toolset for improving performance of sequential optimisation algorithms in the face of poor or particularly uncertain estimates of a transition model. The literature has focused on the use of Wasserstein distances as a tool for regulating the extent of robustness, but is not simple to use due to its lack of closed forms. On the other hand, the Kullback-Leibler divergence has been shunned as its use has heretofore implied limited flexibility. I present a method to render the KL-divergence a useful and practical tool for the construction of ambiguity sets, and build both discrete-state and continuous-state space decision processes using the formulation.

link: [url=meet.google.com/ywg-hzrp-xgf]meet.google.com/ywg-hzrp-xgf[/url]

Ημερομηνία Εκδήλωσης:
Thursday, March 4, 2021 - 12:30

Back to top
Permissions in this forum:
You cannot reply to topics in this forum