Microsoft Future Leaders in Robotics and AI Seminar Series: Thomas Berrueta

Friday, April 26, 2024
2:30 p.m.
Online Seminar

Towards Transparent & Reliable Embodied Reinforcement Learning Agents

Thomas Berrueta
PhD Candidate
Northwestern University

Abstract

The task-capabilities of robotic agents depend crucially on how their bodies materialize any action or decision made. Despite the clear importance of robot embodiment to decision-making, most algorithmic frameworks neglect or dismiss its impact. An important example of this lies in the widely used assumption that data are independent and identically distributed, which underpins all of machine learning. However, when data are collected sequentially from the experiences of agents this assumption does not generally hold, as is the case in most robotic reinforcement learning. In this talk, I will present a framework capable of overcoming these limitations by leveraging the statistical mechanics of ergodic processes, which I refer to as Maximum Diffusion Reinforcement Learning (MaxDiff RL). By measuring and exploiting statistical correlations, MaxDiff RL accounts for agent embodiment during learning and achieves provably robust state-of-the-art performance. Moreover, I will show that MaxDiff RL also provably enables agents to learn in continuous deployments. Taken together, I will argue that these results highlight future opportunities to formalize the role of robot embodiment during learning and control, paving the way towards more transparent and reliable embodied autonomy.

Bio

Thomas Berrueta is an interdisciplinary researcher and Ph.D. candidate at Northwestern University's Center for Robotics and Biosystems. He is a recipient of Northwestern University's Presidential Fellowship, which is the highest honor conferred to a graduate student by the university. His research explores the consequences of agent embodiment on learning and control across length scales. At the intersection of statistical physics, artificial intelligence, and robotics, his work seeks to make autonomous systems more adaptable, robust, and life-like by exploiting randomness and uncertainty in ways that parallel biology.

Audience: Public

Browse All Events

November 2024

SU	MO	TU	WE	TH	FR	SA
27	28	29	30	31	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

Submit an Event

SU	MO	TU	WE	TH	FR	SA
27	28	29	30	31	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

SU	MO	TU	WE	TH	FR	SA
27	28	29	30	31	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

SU	MO	TU	WE	TH	FR	SA
27	28	29	30	31	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30