Table of Contents
Fetching ...

Approximation Algorithms for Action-Reward Query-Commit Matching

Mahsa Derakhshan, Andisheh Ghasemi, Calum MacRury

Abstract

Matching problems under uncertainty arise in applications such as kidney exchange, hiring, and online marketplaces. A decision-maker must sequentially explore potential matches under local exploration constraints, while committing irrevocably to successful matches as they are revealed. The query-commit matching problem captures these challenges by modeling edges that succeed independently with known probabilities and must be accepted upon success, subject to vertex patience (time-out) constraints limiting the number of incident queries. In this work, we introduce the action-reward query-commit matching problem, a strict generalization of query-commit matching in which each query selects an action from a known action space, determining both the success probability and the reward of the queried edge. If an edge is queried using a chosen action and succeeds, it is irrevocably added to the matching, and the corresponding reward is obtained; otherwise, the edge is permanently discarded. We study the design of approximation algorithms for this problem on bipartite graphs. This model captures a broad class of stochastic matching problems, including the sequential pricing problem introduced by Pollner, Roghani, Saberi, and Wajc (EC~2022). On the positive side, Pollner et al. designed a polynomial-time approximation algorithm achieving a ratio of $0.426$ in the one-sided patience setting, which degrades to $0.395$ when both sides have bounded patience. In this work, we design computationally efficient algorithms for the action-reward query-commit in one-sided and two-sided patience settings, achieving approximation ratios of $1-1/e \approx 0.63$ and $\frac{1}{27}\!\left(19-67/e^3\right) \approx 0.58$ respectively. These results improve the state of the art for the sequential pricing problem, surpassing the previous guarantees of $0.426$ and $0.395$.

Approximation Algorithms for Action-Reward Query-Commit Matching

Abstract

Matching problems under uncertainty arise in applications such as kidney exchange, hiring, and online marketplaces. A decision-maker must sequentially explore potential matches under local exploration constraints, while committing irrevocably to successful matches as they are revealed. The query-commit matching problem captures these challenges by modeling edges that succeed independently with known probabilities and must be accepted upon success, subject to vertex patience (time-out) constraints limiting the number of incident queries. In this work, we introduce the action-reward query-commit matching problem, a strict generalization of query-commit matching in which each query selects an action from a known action space, determining both the success probability and the reward of the queried edge. If an edge is queried using a chosen action and succeeds, it is irrevocably added to the matching, and the corresponding reward is obtained; otherwise, the edge is permanently discarded. We study the design of approximation algorithms for this problem on bipartite graphs. This model captures a broad class of stochastic matching problems, including the sequential pricing problem introduced by Pollner, Roghani, Saberi, and Wajc (EC~2022). On the positive side, Pollner et al. designed a polynomial-time approximation algorithm achieving a ratio of in the one-sided patience setting, which degrades to when both sides have bounded patience. In this work, we design computationally efficient algorithms for the action-reward query-commit in one-sided and two-sided patience settings, achieving approximation ratios of and respectively. These results improve the state of the art for the sequential pricing problem, surpassing the previous guarantees of and .
Paper Structure (38 sections, 21 theorems, 117 equations, 3 algorithms)

This paper contains 38 sections, 21 theorems, 117 equations, 3 algorithms.

Key Result

Theorem 1.1

Given a bipartite graph $G=(U,V,E)$ with action set $\mathcal{A}$, let $\beta = 1-1/e$ if $\ell_u \in \{1, \infty\}$ for all $u \in U$, and $\beta = \frac{1}{27} (19 - \frac{67}{e^3})$, otherwise. For any arbitrarily small $\epsilon \in (0, 1)$, there exists a policy with expected reward of Moreover, the policy can be computed in time $f(1/\epsilon)\cdot \text{poly}(|G|, |\mathcal{A}|)$ where $f

Theorems & Definitions (27)

  • Theorem 1.1: Main Theorem
  • Theorem 2.1: Proof in \ref{['pf:thm:LP_relaxation']}
  • Lemma 3.1: Proof in \ref{['pf:lem:edge_variable']}
  • Definition 1
  • Lemma 3.2: Proof in \ref{['pf:lem:fixed_query_vertex']}
  • Theorem 3.3
  • proof : Proof of \ref{['thm:reduction_to_trs']}
  • Theorem 3.4
  • Lemma 3.5: Proof in \ref{['pf:lem:trs_to_prcrs']}
  • Lemma 3.6: Proof in \ref{['pf:lem:p_values']}
  • ...and 17 more