The Partner Modelling Questionnaire: A validated self-report measure of perceptions toward machines as dialogue partners
Philip R. Doyle, Iona Gessinger, Justin Edwards, Leigh Clark, Odile Dumbleton, Diego Garaialde, Daniel Rough, Anna Bleakley, Holly P. Branigan, Benjamin R. Cowan
TL;DR
This paper introduces the Partner Modelling Questionnaire (PMQ), an 18-item self-report scale that measures three dimensions of users’ perceptions of machines as dialogue partners: communicative competence/dependability, human-likeness in communication, and communicative flexibility. Through three phases—item generation, construct validity via CFA, and convergent/divergent validity plus test-retest reliability—the PMQ demonstrates strong internal consistency, robust factor structure, and good stability over 4–12 weeks. The 18-item version outperforms the original 23-item model in CFA, and the PMQ shows convergent validity with usability and anthropomorphism scales while maintaining discriminant validity. The PMQ offers researchers a direct, scalable tool to quantify partner models, enabling more precise investigations into how speech interface design influences user perceptions and subsequent interaction behaviors. Overall, the PMQ facilitates longitudinal studies and cross-domain comparisons, advancing theory and practice in speech interface design and human–machine dialogue.
Abstract
Recent work has looked to understand user perceptions of speech agent capabilities as dialogue partners (termed partner models), and how this affects user interaction. Yet, currently partner model effects are inferred from language production as no metrics are available to quantify these subjective perceptions more directly. Through three studies, we develop and validate the Partner Modelling Questionnaire (PMQ): an 18-item self-report semantic differential scale designed to reliably measure people's partner models of non-embodied speech interfaces. Through principal component analysis and confirmatory factor analysis, we show that the PMQ scale consists of three factors: communicative competence and dependability, human-likeness in communication, and communicative flexibility. Our studies show that the measure consistently demonstrates good internal reliability, strong test-retest reliability over 12 and 4-week intervals, and predictable convergent/divergent validity. Based on our findings we discuss the multidimensional nature of partner models, whilst identifying key future research avenues that the development of the PMQ facilitates. Notably, this includes the need to identify the activation, sensitivity, and dynamism of partner models in speech interface interaction.
