Better Assumptions, Stronger Conclusions: The Case for Ordinal Regression in HCI
Brandon Victor Syiem, Eduardo Velloso
TL;DR
This paper argues that ordinal data analyses in HCI currently suffer from inconsistent practices and over-reliance on metric-based methods. It advocates for ordinal regression via cumulative link (mixed) models (CL(M)Ms) to respect inherent ordering without assuming equal intervals, and illustrates their application with open CHI datasets using R. Through a systematic CHI 2024 literature review and practical CLM/CLMM re-analyses, the authors demonstrate that CL(M)Ms can reveal effects missed by traditional ANOVA and post-hoc tests, supporting more robust, reproducible inferences. The work contributes methodological guidance, accessible tooling, and concrete examples to help HCI researchers adopt ordinal-first modelling for more reliable insights and cross-study comparability.
Abstract
Despite the widespread use of ordinal measures in HCI, such as Likert-items, there is little consensus among HCI researchers on the statistical methods used for analysing such data. Both parametric and non-parametric methods have been extensively used within the discipline, with limited reflection on their assumptions and appropriateness for such analyses. In this paper, we examine recent HCI works that report statistical analyses of ordinal measures. We highlight prevalent methods used, discuss their limitations and spotlight key assumptions and oversights that diminish the insights drawn from these methods. Finally, we champion and detail the use of cumulative link (mixed) models (CLM/CLMM) for analysing ordinal data. Further, we provide practical worked examples of applying CLM/CLMMs using R to published open-sourced datasets. This work contributes towards a better understanding of the statistical methods used to analyse ordinal data in HCI and helps to consolidate practices for future work.
