Some Statistical and Data Challenges When Building Early-Stage Digital Experimentation and Measurement Capabilities
C. H. Bryan Liu
TL;DR
This work develops a rigorous framework for building and justifying early-stage digital experimentation and measurement (DEM) capabilities. It introduces a ranking under lower uncertainty model to quantify the value and risk of DEM-driven noise reduction, providing closed-form expressions for the expected gain ${\mathbb{E}}(\mathcal{D})$ and its variance, thereby enabling Sharpe-ratio based business cases. It then offers a comprehensive treatment of statistical testing in digital experiments (NHST, Bayesian, sequential, and non-parametric methods), followed by a taxonomy of digital experiment datasets and an evaluation framework for personalization experiment designs. The thesis is substantiated with empirical verifications, public datasets (including the ASOS Digital Experiments Dataset), and case studies in e-commerce and marketing, culminating in a concrete path to apply these methodologies in industry settings and future research directions. Overall, the work advances both theory and practice for evaluating, designing, and deploying DEM capabilities with emphasis on data efficiency, causality, and scalable decision-making under uncertainty $${\Delta}$$, ${\mathcal{V}}$, ${\mathcal{E}}$, ${\mathcal{W}}$, and ${\mathcal{D}}$ across multiple designs and datasets.
Abstract
Digital experimentation and measurement (DEM) capabilities -- the knowledge and tools necessary to run experiments with digital products, services, or experiences and measure their impact -- are fast becoming part of the standard toolkit of digital/data-driven organisations in guiding business decisions. Many large technology companies report having mature DEM capabilities, and several businesses have been established purely to manage experiments for others. Given the growing evidence that data-driven organisations tend to outperform their non-data-driven counterparts, there has never been a greater need for organisations to build/acquire DEM capabilities to thrive in the current digital era. This thesis presents several novel approaches to statistical and data challenges for organisations building DEM capabilities. We focus on the fundamentals associated with building DEM capabilities, which lead to a richer understanding of the underlying assumptions and thus enable us to develop more appropriate capabilities. We address why one should engage in DEM by quantifying the benefits and risks of acquiring DEM capabilities. This is done using a ranking under lower uncertainty model, enabling one to construct a business case. We also examine what ingredients are necessary to run digital experiments. In addition to clarifying the existing literature around statistical tests, datasets, and methods in experimental design and causal inference, we construct an additional dataset and detailed case studies on applying state-of-the-art methods. Finally, we investigate when a digital experiment design would outperform another, leading to an evaluation framework that compares competing designs' data efficiency.
