Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

Guangyao Shi; Gaurav S. Sukhatme

Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

Guangyao Shi, Gaurav S. Sukhatme

TL;DR

This work considers a new type of inverse combinatorial optimization, Inverse Submodular Maximization (ISM), for human-in-the-loop multi-robot coordination, and proposes a new formulation, which aims to find a new set of parameters that minimally deviate from the current parameters while causing a greedy algorithm to output actions which are the same as those desired by the human supervisors.

Abstract

We consider a new type of inverse combinatorial optimization, Inverse Submodular Maximization (ISM), for human-in-the-loop multi-robot coordination. Forward combinatorial optimization, defined as the process of solving a combinatorial problem given the reward (cost)-related parameters, is widely used in multi-robot coordination. In the standard pipeline, the reward (cost)-related parameters are designed offline by domain experts first and then these parameters are utilized for coordinating robots online. What if we need to change these parameters by non-expert human supervisors who watch over the robots during tasks to adapt to some new requirements? We are interested in the case where human supervisors can suggest what actions to take, and the robots need to change the internal parameters based on such suggestions. We study such problems from the perspective of inverse combinatorial optimization, i.e., the process of finding parameters given solutions to the problem. Specifically, we propose a new formulation for ISM, in which we aim to find a new set of parameters that minimally deviate from the current parameters and can make the greedy algorithm output actions the same as those suggested by humans. We show that such problems can be formulated as a Mixed Integer Quadratic Program (MIQP). However, MIQP involves exponentially many binary variables, making it intractable for the existing solver when the problem size is large. We propose a new algorithm under the Branch $\&$ Bound paradigm to solve such problems. In numerical simulations, we demonstrate how to use ISM in multi-robot multi-objective coverage control, and we show that the proposed algorithm achieves significant advantages in running time and peak memory usage compared to directly using an existing solver.

Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

TL;DR

Abstract

Bound paradigm to solve such problems. In numerical simulations, we demonstrate how to use ISM in multi-robot multi-objective coverage control, and we show that the proposed algorithm achieves significant advantages in running time and peak memory usage compared to directly using an existing solver.

Paper Structure (13 sections, 4 theorems, 17 equations, 5 figures, 2 algorithms)

This paper contains 13 sections, 4 theorems, 17 equations, 5 figures, 2 algorithms.

Introduction
Related Work
Preliminaries
Forward Submodular Maximization
Problem Formulation
Case Study: Multi-Robot Coverage Control
Algorithm for ISM
Experiments
A Qualitative Example
Algorithm Validation
Conclusion
Appendix
Proof of Theorem \ref{['theorem:submodular_objective_horizon']}

Key Result

Theorem 1

The objective defined in Eq. eq:submodular_coverage_objecive is monotone submodular.

Figures (5)

Figure 1: A motivating example of inverse submodular maximization. A team of robots is deployed to detect multiple events, each of which is associated with a priority, and the task is cast as a submodular maximization problem. Black dotted lines are actions derived from task optimization and red dotted lines are actions suggested by humans. The team needs to minimally adjust the submodular objective to account for human suggestions.
Figure 2: One iteration of the proposed BB-ISM algorithm.
Figure 3: A qualitative example to illustrate how ISM can be used in the human-in-the-loop multi-robot multi-objective coverage control. (a) Three event density functions. (b) Robot trajectories without human suggestions. (c) Robot trajectories with human suggestions.
Figure 4: Optimality comparisons with baselines.
Figure 5: Running time and peak memory usage comparisons with baselines.

Theorems & Definitions (8)

Definition 1: Submodularity
Theorem 1
Theorem 2
Remark 1
Lemma 1
proof
Theorem 3: Theorem 1 in sun2017submodularity
proof

Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

TL;DR

Abstract

Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (8)