Normalizing Speed-accuracy Biases in 2D Pointing Tasks with Better Calculation of Effective Target Widths

Shota Yamanaka; I. Scott MacKenzie

Normalizing Speed-accuracy Biases in 2D Pointing Tasks with Better Calculation of Effective Target Widths

Shota Yamanaka, I. Scott MacKenzie

TL;DR

The paper investigates how to normalize speed–accuracy biases in 2D pointing tasks by comparing univariate ($\upsigma_x$) and bivariate ($\upsigma_{xy}$) standard deviations for the effective width $W_ ext{e}$ within an ISO-style Fitts' law framework. Using three bias instructions and a large crowdsourced dataset (n=342) across multiple amplitudes and widths, the authors show that univariate $\upsigma_x$ yields higher model-fit ($R^2$) in mixed bias conditions, while TT-based effective width models with effective amplitude ($A_e$) offer the strongest throughput stability. A Monte Carlo simulation confirms these findings hold for smaller samples, with bivariate models rarely outperforming univariate ones as $N$ grows. The study argues for adopting ID$_{xTT}$ (and ID$_{xTT}A_e$ for TP stability) and TT task-axis definitions as practical defaults, contributing to standardized, bias-resistant evaluation of pointing performance in HCI. The work clarifies prior inconsistencies with Wobbrock et al. (2011) and provides guidance for reproducible reporting of $W_e$ calculations, task-axis choices, and bias normalization in future 2D Fitts’ law studies.

Abstract

For evaluations of 2D target selection using Fitts' law, ISO 9241-411 recommends using the effective target width (W_e) calculated using the univariate standard deviation of selection coordinates. Related research proposed using a bivariate standard deviation; however, the proposal was only tested using a single speed-accuracy bias condition, thus the assessment was limited. We compared the univariate and bivariate techniques in a 2D Fitts' law experiment using three speed-accuracy biases and 346 crowdworkers. Calculating W_e using the univariate standard deviation yielded higher model correlations across all bias conditions and produced more stable throughput among the biases. The findings were also consistent in cases using randomly sampled subsets of the participant data. We recommend that future research should calculate W_e using the univariate standard deviation for fair performance evaluations. Also, we found trivial effects when using nominal or effective amplitude and using different perspectives of the task axis.

Normalizing Speed-accuracy Biases in 2D Pointing Tasks with Better Calculation of Effective Target Widths

TL;DR

The paper investigates how to normalize speed–accuracy biases in 2D pointing tasks by comparing univariate (

) and bivariate (

) standard deviations for the effective width

within an ISO-style Fitts' law framework. Using three bias instructions and a large crowdsourced dataset (n=342) across multiple amplitudes and widths, the authors show that univariate

yields higher model-fit (

) in mixed bias conditions, while TT-based effective width models with effective amplitude (

) offer the strongest throughput stability. A Monte Carlo simulation confirms these findings hold for smaller samples, with bivariate models rarely outperforming univariate ones as

grows. The study argues for adopting ID

(and ID

for TP stability) and TT task-axis definitions as practical defaults, contributing to standardized, bias-resistant evaluation of pointing performance in HCI. The work clarifies prior inconsistencies with Wobbrock et al. (2011) and provides guidance for reproducible reporting of

calculations, task-axis choices, and bias normalization in future 2D Fitts’ law studies.

Abstract

Paper Structure (42 sections, 7 equations, 18 figures, 4 tables)

This paper contains 42 sections, 7 equations, 18 figures, 4 tables.

Introduction
Related Work
Fitts' Law and the Speed-Accuracy Tradeoff
Influence of Effective Width 1: Improving Model Fit Across Bias Conditions
Influence of Effective Width 2: Increasing Throughput Stability
Univariate vs. Bivariate Standard Deviation for Effective Width
Studies Citing Wobbrock et al.'s CHI 2011 Paper
Limitations of Previous Studies and Our Research Questions
Unexamined Aspect 1: The Bias-normalizing Capability of Univariate vs. Bivariate sigma
Unexamined Aspect 2: Nominal vs. Effective Amplitude
Unexamined Aspect 3: Definitions of the Task Axis
Method
Participants
Apparatus
Procedure
...and 27 more sections

Figures (18)

Figure 1: The effective-width method for 1D targets. Cross marks ("$\times$") denote endpoints. (a) In a 1D Fitts' law task where the $x$-axis represents the task axis, a participant aims for a target of width $W$ located at a distance $A$ using a cursor. (b) When the target is repeatedly selected, the spread of endpoints tends to form a normal distribution, and larger targets generally produce greater endpoint variability. (c) When participants are biased toward performing in a slow and accurate manner, the error rate is low but the movement time increases. Conversely, (d) when participants are biased toward fast and inaccurate movements, the error rate increases while the movement time decreases. By adjusting target width a posteriori, we can compare performance fairly using $\mathit{TP}$ regardless of accuracy variability.
Figure 2: (a) ISO 9241-411 multi-directional pointing with circular targets. When there are nine targets, select them in order from 1 to 10. (b--c) Rotating each trial so that the task axis points to the right when computing endpoint distributions.
Figure 3: Dispersion in the plane is identical, but we have (a) $\upsigma_\mathit{x}=1.83$ and (b) $\upsigma_\mathit{x}=0$ (arbitrary units such as cm or pixels).
Figure 4: Two definitions of the task axis. Depending on whether the start point is the previous target center or the previous successful click position (red cross), TT and CT yield different $\upsigma_\mathit{x}$ values.
Figure 5: Abstract images of the experimental task. (a) Before starting each $\textsf{bias}$ condition, the goal for the seven sequences including practice was described. (b) Participants clicked the targets in the specified order. The given $\textsf{bias}$ was always written at the top of the window. (c) After completing a sequence, the results and a message to take a break if needed were displayed.
...and 13 more figures

Normalizing Speed-accuracy Biases in 2D Pointing Tasks with Better Calculation of Effective Target Widths

TL;DR

Abstract

Normalizing Speed-accuracy Biases in 2D Pointing Tasks with Better Calculation of Effective Target Widths

Authors

TL;DR

Abstract

Table of Contents

Figures (18)