BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//University of Liverpool Computer Science Seminar System//v2//EN
BEGIN:VEVENT
DTSTAMP:20260411T111345Z
UID:Seminar-dept-385@lxserverM.csc.liv.ac.uk
ORGANIZER:CN=Lutz Oettershagen:MAILTO:Lutz.Oettershagen@liverpool.ac.uk
DTSTART:20150630T130000
DTEND:20150630T140000
SUMMARY:School Seminar Series
DESCRIPTION:Prof Ann Nowe: Multi-objective reinforcement learning\n\nMany real-world problems involve the optimization of multiple, possibly conflicting objectives. Multi-objective reinforcement learning (MORL) is a generalisation of standard reinforcement learning where the scalar reward signal is extended to multiple feedback signals, in essence, one for each objective.  In this talk, I present an overview of our multi-criteria n-armed bandit approaches as well as a novel temporal difference learning algorithm that integrates the Pareto dominance relation into a reinforcement learning approach. This Pareto Q-learning  algorithm is a multi-policy algorithm that learns a set of Pareto dominating policies. A key element of the algorithm is the fact that the immediate reward vector is estimated separately from the set of expected future discounted reward vectors. This decomposition allows us to update the sets and to exploit the learned policies consistently throughout the state space.\n\nhttps://www.csc.liv.ac.uk/research/seminars/abstract.php?id=385
LOCATION:Ashton Lecture Theatre
END:VEVENT
END:VCALENDAR
