AI Lecture Series
Attending
The format is offered hybrid via Zoom (click) and on-site in the open space (ground floor) of the MC building. If you are new to the campus, check out our Campus Map and Directions.
Speaker
Thiago D. Simão, Safety in Reinforcement Learning, 10.04.2025 15:00 CEST
Thiago D. Simão, Safety in Reinforcement Learning, 10.04.2025 15:00 CEST
Title: Safety in Reinforcement Learning
Abstract: Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the underlying environment. These agents often learn through experience, using a trial-and-error strategy that can lead to practical innovations, but this randomized process might cause undesirable events. Safe RL studies how to make such agents more reliable and how to ensure they behave appropriately. In this talk, we discuss these issues from online settings, where the agent interacts directly with the environment, to offline settings, where the agent only has access to historical data. We present new RL methods that exploit different types of prior knowledge to provide safety and reliability. Exploiting such prior knowledge, we present reliable offline algorithms that can improve the policy using less data and online algorithms that comply with safety constraints while learning. Besides safety and reliability, we also touch on other challenges preventing the deployment of RL in the real world, such as partial observability, generalization, and high-dimensional data.
Vitae: Thiago D. Simão (he/him) is an Assistant Professor in the Department of Mathematics and Computer Science at Eindhoven University of Technology (TU/e). He completed his Ph.D. in the Algorithmics Group at Delft University of Technology under the supervision of Dr. Matthijs Spaan. Following his doctoral studies, he worked as a Postdoctoral Researcher in the Department of Software Science at Radboud University Nijmegen, collaborating with Dr. Nils Jansen. His research focuses on enhancing the reliability of AI techniques to enable safe deployment in real-world applications. Specifically, he is interested in safe reinforcement learning, ensuring AI systems meet performance guarantees while preventing catastrophic failures. Through his work, he contributed to the advancement of trustworthy AI solutions in complex and uncertain environments.
Slides: PDF
Umang Bhatt, Orchestrating AI Agents among Humans, 13.03.2025 15:00 CET
Umang Bhatt, Orchestrating AI Agents among Humans, 13.03.2025 15:00 CET
Title: Orchestrating AI Agents among Humans
Abstract: As AI agents are deployed in real-world settings, determining when to expose users to AI assistance becomes increasingly critical. Effective use of AI agents requires invoking the right agent at the right time. We introduce Modiste, an interactive tool for learning personalized decision support policies that dynamically adjust user access to AI agents. Modiste leverages tools from contextual bandits to optimize when and how AI agents provide support to balance performance, cost, and constraints. We further characterize the theoretical conditions under which orchestration between agents is beneficial. Our empirical studies show how selective access to AI agents, including deliberate disengagement from AI, can improve decision outcomes, reduce unnecessary AI use, and align AI agents with real-world constraints. We conclude with a call for human-centered interactive evaluation of AI agents by assessing the effectiveness of AI agents via multi-turn interactions with experts.
Vitae: Umang Bhatt is an Assistant Professor & Faculty Fellow at the Center for Data Science at New York University and a Senior Research Associate in Safe and Ethical AI at the Alan Turing Institute. He completed his PhD in the Machine Learning Group at the University of Cambridge. His research lies in human-AI collaboration, AI governance, and algorithmic transparency. His work has been supported by a JP Morgan PhD Fellowship and a Mozilla Fellowship. Previously, he was a Research Fellow at the Partnership on AI, a Fellow at Harvard’s Center for Research on Computation and Society, and an Advisor to the Responsible AI Institute. Umang received his MS and BS in Electrical and Computer Engineering from Carnegie Mellon University.
Organizers


