Table of Contents
Fetching ...

General Position Subset Selection in Line Arrangements

Adrian Dumitrescu

TL;DR

This work investigates General Position Subset Selection (GPSS), the problem of selecting a largest subset of $n$ plane points with no three collinear, known to be NP-hard and APX-hard. It develops three improved approximation regimes by harnessing probabilistic methods and incidence geometry: (i) a constant-factor approximation for $\alpha$-dense lattice point sets with factor $c(\alpha)=Ω(α^{-2})$, (ii) an $Ω((\log n)^{-1/2})$-approximation for vertices of generic line arrangements, and (iii) an $Ω((\log n)^{-1/2})$-approximation when $\ell(P)=O(\sqrt{n})$ and $\kappa(P)=O(\sqrt{n})$. The core strategies mix grid-based Vandermonde-inspired partitions, and a two-step random sampling with deletion in line arrangements, underpinned by Beck-type incidence bounds and duality. Together, these results show that structured input sets permit substantially stronger GPSS guarantees than the general bound $Ω(n^{-1/2})$, with practical implications for selecting large in-general-position subsets in lattice-like and line-arrangement geometries.

Abstract

Given a set of points in the plane, the \textsc{General Position Subset Selection} problem is that of finding a maximum-size subset of points in general position, i.e., with no three points collinear. The problem is known to be ${\rm NP}$-complete and ${\rm APX}$-hard, and the best approximation ratio known is $Ω\left({\rm OPT}^{-1/2}\right) =Ω(n^{-1/2})$. Here we obtain better approximations in three specials cases: (I) A constant factor approximation for the case where the input set consists of lattice points and is \emph{dense}, which means that the ratio between the maximum and the minimum distance in $P$ is of the order of $Θ(\sqrt{n})$. (II) An $Ω\left((\log{n})^{-1/2}\right)$-approximation for the case where the input set is the set of vertices of a \emph{generic} $n$-line arrangement, i.e., one with $Ω(n^2)$ vertices. The scenario in (I) is a special case of that in (II). (III) An $Ω\left((\log{n})^{-1/2}\right)$-approximation for the case where the input set has at most $O(\sqrt{n})$ points collinear and can be covered by $O(\sqrt{n})$ lines. Our approximations rely on probabilistic methods and results from incidence geometry.

General Position Subset Selection in Line Arrangements

TL;DR

This work investigates General Position Subset Selection (GPSS), the problem of selecting a largest subset of plane points with no three collinear, known to be NP-hard and APX-hard. It develops three improved approximation regimes by harnessing probabilistic methods and incidence geometry: (i) a constant-factor approximation for -dense lattice point sets with factor , (ii) an -approximation for vertices of generic line arrangements, and (iii) an -approximation when and . The core strategies mix grid-based Vandermonde-inspired partitions, and a two-step random sampling with deletion in line arrangements, underpinned by Beck-type incidence bounds and duality. Together, these results show that structured input sets permit substantially stronger GPSS guarantees than the general bound , with practical implications for selecting large in-general-position subsets in lattice-like and line-arrangement geometries.

Abstract

Given a set of points in the plane, the \textsc{General Position Subset Selection} problem is that of finding a maximum-size subset of points in general position, i.e., with no three points collinear. The problem is known to be -complete and -hard, and the best approximation ratio known is . Here we obtain better approximations in three specials cases: (I) A constant factor approximation for the case where the input set consists of lattice points and is \emph{dense}, which means that the ratio between the maximum and the minimum distance in is of the order of . (II) An -approximation for the case where the input set is the set of vertices of a \emph{generic} -line arrangement, i.e., one with vertices. The scenario in (I) is a special case of that in (II). (III) An -approximation for the case where the input set has at most points collinear and can be covered by lines. Our approximations rely on probabilistic methods and results from incidence geometry.

Paper Structure

This paper contains 12 sections, 7 theorems, 14 equations, 3 figures.

Key Result

Lemma 1

PW13. Let $P$ be a set of $n$ points in the plane with at most $\ell$ collinear. Then the number of collinear triples in $P$ is $T = O(n^2 \log{\ell} + \ell^2 n)$.

Figures (3)

  • Figure 1: Left: a $2$-dense set of $n=25$ points in the $8 \times 8$ grid ($m=8$). Center: $V_0$; $|V_0|=m=8$ and $p=11$; points in $V_i$ may lie outside the grid, e.g., $(3,9)$ lies $2$ units above it. Right: The approximation algorithm returns $P \cap V_0$ (or $P \cap V_3$); here $|P \cap V_0|=|P \cap V_3|=4$.
  • Figure 2: Left: a generic line arrangement consisting of three bundles of $n/3$ nearly parallel lines each. Right: a non-generic line arrangement consisting of $3$ parallel lines and $n-3$ concurrent lines.
  • Figure 3: Collinear vertices in a line arrangement --incident to an induced line.

Theorems & Definitions (8)

  • Lemma 1
  • Theorem 2
  • Theorem 3
  • Theorem 4
  • Corollary 5
  • Theorem 6
  • Lemma 7
  • proof