The Last Success Problem with Samples

Toru Yoshinaga; Yasushi Kawase

The Last Success Problem with Samples

Toru Yoshinaga, Yasushi Kawase

TL;DR

The paper tackles the last success problem under unknown distributions by introducing sampling-based variants. It shows a stark separation from the complete-information benchmark: with no samples, no positive win guarantee exists; with a single sample, a deterministic ASLS policy achieves at least $1/4$ when the odds-sum $R$ satisfies $R\ge(\sqrt{3}-1)/2$, while a randomized approach reaches $1/4$ only for $R\ge1/2$. For any fixed $\varepsilon>0$, a constant number of samples per distribution suffices to guarantee $1/e-\varepsilon$, and further, with more samples one can approach the complete-information bound when $R\ge1$, though no finite-sample policy reaches exactly $1/e$. The results establish both tight upper-lower bounds and practical threshold-based policies, advancing understanding of online stopping with limited information.

Abstract

The last success problem is an optimal stopping problem that aims to maximize the probability of stopping on the last success in a sequence of independent $n$ Bernoulli trials. In the classical setting where complete information about the distributions is available, Bruss~\cite{B00} provided an optimal stopping policy that ensures a winning probability of $1/e$. However, assuming complete knowledge of the distributions is unrealistic in many practical applications. This paper investigates a variant of the last success problem where samples from each distribution are available instead of complete knowledge of them. When a single sample from each distribution is allowed, we provide a deterministic policy that guarantees a winning probability of $1/4$. This is best possible by the upper bound provided by Nuti and Vondrák~\cite{NV23}. Furthermore, for any positive constant $ε$, we show that a constant number of samples from each distribution is sufficient to guarantee a winning probability of $1/e-ε$.

The Last Success Problem with Samples

TL;DR

when the odds-sum

satisfies

, while a randomized approach reaches

only for

. For any fixed

, a constant number of samples per distribution suffices to guarantee

, and further, with more samples one can approach the complete-information bound when

, though no finite-sample policy reaches exactly

. The results establish both tight upper-lower bounds and practical threshold-based policies, advancing understanding of online stopping with limited information.

Abstract

The last success problem is an optimal stopping problem that aims to maximize the probability of stopping on the last success in a sequence of independent

Bernoulli trials. In the classical setting where complete information about the distributions is available, Bruss~\cite{B00} provided an optimal stopping policy that ensures a winning probability of

. However, assuming complete knowledge of the distributions is unrealistic in many practical applications. This paper investigates a variant of the last success problem where samples from each distribution are available instead of complete knowledge of them. When a single sample from each distribution is allowed, we provide a deterministic policy that guarantees a winning probability of

. This is best possible by the upper bound provided by Nuti and Vondrák~\cite{NV23}. Furthermore, for any positive constant

, we show that a constant number of samples from each distribution is sufficient to guarantee a winning probability of

Paper Structure (10 sections, 14 theorems, 35 equations, 2 figures)

This paper contains 10 sections, 14 theorems, 35 equations, 2 figures.

Introduction
Related work
Preliminaries
Model
Basic observation
Single sample model
Estimating the sum of the odds
Direct utilization of success positions
Applying a policy by Nuti and Vondrák
Multiple Sample Model

Key Result

Theorem 1

For any positive real $\epsilon$, there does not exist a (randomized) stopping policy for the single sample last success problem that guarantees a winning probability of $1/4 + \epsilon$ even when a success occurs with probability one.

Figures (2)

Figure 1: Classifications of the relative positions of $i_1,i_2,j_1,j_2$. Red lines indicate locations without success.
Figure 2: The winning probability of the ASLS policy (red line) and an upper bound of any policy for the $(1,R)$-last success problem (black dashed line).

Theorems & Definitions (25)

Theorem 1: Nuti and Vondrák NV23
Lemma 1
proof
Proposition 1: Bruss B00
Proposition 2
proof
Theorem 2
Theorem 3
proof
Theorem 4
...and 15 more

The Last Success Problem with Samples

TL;DR

Abstract

The Last Success Problem with Samples

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (25)