Table of Contents
Fetching ...

Truthful High Dimensional Sparse Linear Regression

Liyang Zhu, Amina Manseur, Meng Ding, Jinyan Liu, Jinhui Xu, Di Wang

TL;DR

This is the first study on designing truthful (and privacy-preserving) mechanisms for high dimensional sparse linear regression and deriving a novel private estimator that has a closed-form expression.

Abstract

We study the problem of fitting the high dimensional sparse linear regression model with sub-Gaussian covariates and responses, where the data are provided by strategic or self-interested agents (individuals) who prioritize their privacy of data disclosure. In contrast to the classical setting, our focus is on designing mechanisms that can effectively incentivize most agents to truthfully report their data while preserving the privacy of individual reports. Simultaneously, we seek an estimator which should be close to the underlying parameter. We attempt to solve the problem by deriving a novel private estimator that has a closed-form expression. Based on the estimator, we propose a mechanism which has the following properties via some appropriate design of the computation and payment scheme: (1) the mechanism is $(o(1), O(n^{-Ω({1})}))$-jointly differentially private, where $n$ is the number of agents; (2) it is an $o(\frac{1}{n})$-approximate Bayes Nash equilibrium for a $(1-o(1))$-fraction of agents to truthfully report their data; (3) the output could achieve an error of $o(1)$ to the underlying parameter; (4) it is individually rational for a $(1-o(1))$ fraction of agents in the mechanism; (5) the payment budget required from the analyst to run the mechanism is $o(1)$. To the best of our knowledge, this is the first study on designing truthful (and privacy-preserving) mechanisms for high dimensional sparse linear regression.

Truthful High Dimensional Sparse Linear Regression

TL;DR

This is the first study on designing truthful (and privacy-preserving) mechanisms for high dimensional sparse linear regression and deriving a novel private estimator that has a closed-form expression.

Abstract

We study the problem of fitting the high dimensional sparse linear regression model with sub-Gaussian covariates and responses, where the data are provided by strategic or self-interested agents (individuals) who prioritize their privacy of data disclosure. In contrast to the classical setting, our focus is on designing mechanisms that can effectively incentivize most agents to truthfully report their data while preserving the privacy of individual reports. Simultaneously, we seek an estimator which should be close to the underlying parameter. We attempt to solve the problem by deriving a novel private estimator that has a closed-form expression. Based on the estimator, we propose a mechanism which has the following properties via some appropriate design of the computation and payment scheme: (1) the mechanism is -jointly differentially private, where is the number of agents; (2) it is an -approximate Bayes Nash equilibrium for a -fraction of agents to truthfully report their data; (3) the output could achieve an error of to the underlying parameter; (4) it is individually rational for a fraction of agents in the mechanism; (5) the payment budget required from the analyst to run the mechanism is . To the best of our knowledge, this is the first study on designing truthful (and privacy-preserving) mechanisms for high dimensional sparse linear regression.

Paper Structure

This paper contains 30 sections, 25 theorems, 95 equations, 1 table, 1 algorithm.

Key Result

Lemma 1

The estimation error of private estimator $\ddot{\Sigma}_{\bar{X} \bar{X}}$ satisfies with probability $1-d^{-\Omega(1)}$ that where the expectation takes over the randomness of the data records and the algorithm.

Theorems & Definitions (57)

  • Lemma 1
  • Remark 1
  • Theorem 2: Privacy
  • Remark 2
  • Theorem 3: Accuracy
  • Remark 3
  • Lemma 4
  • Theorem 5: Truthfulness
  • Theorem 6: Individual rationality
  • Remark 4
  • ...and 47 more