@article{peters2024ral,
title={Contingency Games for Multi-Agent Interaction},
author={Peters, Lasse and Bajcsy, Andrea and Chiu, Chih-Yuan and Fridovich-Keil, David and Laine, Forrest and Ferranti, Laura and Alonso-Mora, Javier},
journal={IEEE Robotics and Automation Letters (RA-L)},
year={2024},
}
Contingency planning, wherein an agent generates a set of possible plans conditioned on the outcome of an uncertain event, is an increasingly popular way for robots to act under uncertainty. In this work we take a game-theoretic perspective on contingency planning, tailored to multi-agent scenarios in which a robot’s actions impact the decisions of other agents and vice versa. The resulting contingency game allows the robot to efficiently interact with other agents by generating strategic motion plans conditioned on multiple possible intents for other actors in the scene. Contingency games are parameterized via a scalar variable which represents a future time when intent uncertainty will be resolved. By estimating this parameter online, we construct a game-theoretic motion planner that adapts to changing beliefs while anticipating future certainty. We show that existing variants of game-theoretic planning under uncertainty are readily obtained as special cases of contingency games. Through a series of simulated autonomous driving scenarios, we demonstrate that contingency games close the gap between certainty-equivalent games that commit to a single hypothesis and non-contingent multi-hypothesis games that do not account for future uncertainty reduction.
Released upon publication of the manuscript:
Backend
Beyond these repositories, our contingency game solver relies on the following packages, already open-sourced:
Contingency games are best understood via an interactive example. Below, we provide an interactive version of the example given in the video above: an autonomous robot encountering a jaywalking pedestrian.
By varying the sliders controlling the main parameters of the contingency game, we can build intuition for how the robot’s behavior changes as a function of the uncertainty in the pedestrian’s intent. In particular, observe that:
The paper includes the following remark:
A low threshold results in more conservative behavior. As informed by a parameter sweep over , we choose for all experiments for best performance.
Below, we show the data underpinning this statement. This data was generated by running the receding-horizon Monte Carlo study for the crosswalk example at 11 different threshold values of , namely, . Note that we compute belief entropy with log-base = 2, so that corresponds to a uniform initial belief.