Michael S. Lee

Hi, I am a final-year PhD student in the Robotics Institute at Carnegie Mellon University, working at the intersection of explainable AI and human-robot interaction. I am fortunate to be advised by Reid Simmons and Henny Admoni, and am part of the Human and Robot Partners (HARP) Lab.

As AI and robots increasingly enter human society, it is paramount that their reward functions and subsequent behaviors are transparent (i.e. understandable and predictable) to humans. Thus, I research how robots may intuitively summarize and convey their underlying reward functions (and subsequent policies) to humans using informative demonstrations.

I previously completed a Master's at Carnegie Mellon with Red Whittaker and Nathan Michael studying autonomous radiation source localization. And before that, I graduated from Princeton University ('16) with degrees in Mechanical & Aerospace Engineering and a certificate in Robotics & Intelligent Systems. My undergraduate thesis on modeling uncertainty in stereo visual odometry was advised by Nathan Michael and Jianxiong Xiao.

I plan to transition into industry in 2024, and am currently looking for opportunities related to my research interests (including but not limited to transparency, value/AI alignment, explainable AI, human-robot interaction).

Email  /  CV  /  Google Scholar  /  LinkedIn

profile photo
Research

Toward greater transparency (i.e. understandability and predictability) robot behaviors to humans, my current research explores how a robot may teach its underlying reward function to a human learner using informative demonstrations (i.e. ones that exhibit key tradeoffs in decision making).

Our first key insight is that a demonstration's informativeness to a human is not intrinsic, but is inherently tied to that human's prior beliefs and their current expectations of robot behavior. We thus rely on inverse reinforcement learning and counterfactual reasoning (i.e. the extent to which the robot's demonstration differs from the human's current expectations) to evaluate a candidate demonstration's informativeness to a human at revealing the robot's reward function.

Our second key insight is that informativeness and difficulty of comprehension are often two sides of the same coin for humans, and we leverage ideas from cognitive science (e.g. zone of proximal development / "Goldilocks" principle) to ensure that the selected demonstrations present the right level of challenge. Too small of a difference and the reconciliation in the human's mind is trivial, and too large of a difference and the gap is irreconcilable in one shot; we use scaffolding to incrementally increase in information gain and simultaneously ease humans into learning.

Finally, we are currently exploring how to select a suite of informative tests (which query the human's ability to correctly predict robot behavior unseen scenarios) that reveal and bridge remaining gaps in the human learner's understanding, which can then be further supported through subsequent targeted demonstrations in a closed-loop fashion.

Closed-loop Reasoning about Counterfactuals to Improve Policy Transparency
Michael S. Lee, Henny Admoni, Reid Simmons
ICML Workshop on Counterfactuals in Minds and Machines, 2023
pdf / poster

Design a closed-loop teaching scheme where AI policy is made more transparent to a human via demonstrations, tests, and feedback. Demonstrations are selected to be informative and understandable given a human's counterfactual expectations of the AI's policy.

Leveraging Contextual Counterfactuals Toward Belief Calibration
Qiuyi (Richard) Zhang, Michael S. Lee, Sherol Chen
ICML Workshop on Counterfactuals in Minds and Machines, 2023
pdf / poster

Leverage counterfactual reasoning over possible outcomes and recourses to identify optimal belief strengths (i.e. parameters) for algorithmic decision making.

Making Robot Policies Transparent to Humans Through Demonstrations
Michael S. Lee

My research statement summarizing the work to date and highlighting potential directions for future work.

Reasoning about Counterfactuals to Improve Human Inverse Reinforcement Learning
Michael S. Lee, Henny Admoni, Reid Simmons
International Conference on Intelligent Robots and Systems (IROS), 2022
pdf / code / user study

An informative demonstration is one that differs strongly from the learner’s expectations of what the robot will do given their current understanding of the robot’s decision making.

Building the Foundation of Robot Explanation Generation Using Behavior Trees
Zhao Han, Daniel Giger, Jordan Allspaw, Henny Admoni, Michael S. Lee, Holly Yanco
ACM Transactions on Human-Robot Interaction (THRI), 2022
pdf / code / project page / slides

A behavior tree hierarchically composed of goals, subgoals, steps, and actions supports explanation generation algorithms that convey causal information about robot behavior.

Machine Teaching for Human Inverse Reinforcement Learning
Michael S. Lee, Henny Admoni, Reid Simmons
Frontiers in Robotics and AI, 2021
pdf / code / user study

Augmenting an inverse reinforcement learning model of how humans learn from demonstrations with teaching strategies (e.g. scaffolding, simplicity, pattern discovery, and testing) better accommodates human learners.

Self-Assessing and Communicating Manipulation Proficiency Through Active Uncertainty Characterization
Michael S. Lee,
Pioneers Workshop at ACM/IEEE International Conference on Human-Robot Interaction (HRI), 2019
pdf

This proposal thus investigates how a robot can actively assess both its proficiency at a task and its confidence in that assessment through appropriate measures of uncertainty that can be efficiently and effectively communicated to a human.

I previously developed a novel gamma radiation map representation, source localization algorithm, and frontier-based exploration for efficient radiological characterization of nuclear facilities using a robot equipped with a gamma-ray camera.

Radiation Source Localization using a Gamma-ray Camera
Michael S. Lee
Master's Thesis, 2018
pdf / slides

The proposed frontier-based exploration method biases frontier selection with the observed radiation field gradient to quickly search an environment until a proximal source is detected.

Active Range and Bearing-based Radiation Source Localization
Michael S. Lee, Daniel Shy, Red Whittaker, Nathan Michael
International Conference on Intelligent Robots and Systems (IROS), 2018
pdf / slides

The proposed active source localization algorithm greedily selects new waypoints that maximize the Fisher Information provided by the gamma-ray camera’s range and bearing observations.

3-D Volumetric Gamma-ray Imaging and Source Localization with a Mobile Robot
Michael S. Lee Matthew Hanczor, Jiyang Chu, Zhong He, Nathan Michael, Red Whittaker
Waste Management Conference, 2018
pdf / slides

A ground robot equipped with a Compton gamma camera localizes multiple gamma radiation sources to within an average of 0.26 m or 0.6% of the environment dimensions in two 5×4 m2 and 14×6 m2 laboratory environments.



Design by Jon Barron.