Submodular Surrogates for Value of Information

Yuxin Chen; Shervin Javdani; Amin Karbasi; J. Bagnell; Siddhartha Srinivasa; Andreas Krause

doi:10.1609/aaai.v29i1.9694

Authors

Yuxin Chen ETH Zurich
Shervin Javdani Carnegie Mellon University
Amin Karbasi Yale University
J. Bagnell Carnegie Mellon University
Siddhartha Srinivasa Carnegie Mellon University
Andreas Krause ETH Zurich

DOI:

https://doi.org/10.1609/aaai.v29i1.9694

Keywords:

Sequential Decision Making, Value of Information, Adaptive Submodularity, Decision Region Determination, Touch-based Localizatoin

Abstract

How should we gather information to make effective decisions? A classical answer to this fundamental problem is given by the decision-theoretic value of information. Unfortunately, optimizing this objective is intractable, and myopic (greedy) approximations are known to perform poorly. In this paper, we introduce DiRECt, an efficient yet near-optimal algorithm for nonmyopically optimizing value of information. Crucially, DiRECt uses a novel surrogate objective that is: (1) aligned with the value of information problem (2) efficient to evaluate and (3) adaptive submodular. This latter property enables us to utilize an efficient greedy optimization while providing strong approximation guarantees. We demonstrate the utility of our approach on four diverse case-studies: touch-based robotic localization, comparison-based preference learning, wild-life conservation management, and preference elicitation in behavioral economics. In the first application, we demonstrate DiRECt in closed-loop on an actual robotic platform.

Submodular Surrogates for Value of Information

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription