Table of Contents
Fetching ...

Analyzing and Learning from User Interactions for Search Clarification

Hamed Zamani, Bhaskar Mitra, Everest Chen, Gord Lueck, Fernando Diaz, Paul N. Bennett, Nick Craswell, Susan T. Dumais

TL;DR

The paper tackles uncertainty in search queries by analyzing how users interact with Bing's clarification pane at scale. It reveals systematic patterns in engagement, content templates, and click bias, and shows how certain query properties affect clarifications' effectiveness. To improve clarification quality, the authors introduce the RLC representation-learning framework with Intents Coverage and Answers Consistency encoders, leveraging BERT-based text encoding and attention mechanisms to re-rank candidate clarifications. Empirical results demonstrate that RLC-based re-ranking outperforms strong baselines on both click-driven and human-labeled metrics, with meaningful implications for open-domain clarification and multi-turn search interfaces.

Abstract

Asking clarifying questions in response to search queries has been recognized as a useful technique for revealing the underlying intent of the query. Clarification has applications in retrieval systems with different interfaces, from the traditional web search interfaces to the limited bandwidth interfaces as in speech-only and small screen devices. Generation and evaluation of clarifying questions have been recently studied in the literature. However, user interaction with clarifying questions is relatively unexplored. In this paper, we conduct a comprehensive study by analyzing large-scale user interactions with clarifying questions in a major web search engine. In more detail, we analyze the user engagements received by clarifying questions based on different properties of search queries, clarifying questions, and their candidate answers. We further study click bias in the data, and show that even though reading clarifying questions and candidate answers does not take significant efforts, there still exist some position and presentation biases in the data. We also propose a model for learning representation for clarifying questions based on the user interaction data as implicit feedback. The model is used for re-ranking a number of automatically generated clarifying questions for a given query. Evaluation on both click data and human labeled data demonstrates the high quality of the proposed method.

Analyzing and Learning from User Interactions for Search Clarification

TL;DR

The paper tackles uncertainty in search queries by analyzing how users interact with Bing's clarification pane at scale. It reveals systematic patterns in engagement, content templates, and click bias, and shows how certain query properties affect clarifications' effectiveness. To improve clarification quality, the authors introduce the RLC representation-learning framework with Intents Coverage and Answers Consistency encoders, leveraging BERT-based text encoding and attention mechanisms to re-rank candidate clarifications. Empirical results demonstrate that RLC-based re-ranking outperforms strong baselines on both click-driven and human-labeled metrics, with meaningful implications for open-domain clarification and multi-turn search interfaces.

Abstract

Asking clarifying questions in response to search queries has been recognized as a useful technique for revealing the underlying intent of the query. Clarification has applications in retrieval systems with different interfaces, from the traditional web search interfaces to the limited bandwidth interfaces as in speech-only and small screen devices. Generation and evaluation of clarifying questions have been recently studied in the literature. However, user interaction with clarifying questions is relatively unexplored. In this paper, we conduct a comprehensive study by analyzing large-scale user interactions with clarifying questions in a major web search engine. In more detail, we analyze the user engagements received by clarifying questions based on different properties of search queries, clarifying questions, and their candidate answers. We further study click bias in the data, and show that even though reading clarifying questions and candidate answers does not take significant efforts, there still exist some position and presentation biases in the data. We also propose a model for learning representation for clarifying questions based on the user interaction data as implicit feedback. The model is used for re-ranking a number of automatically generated clarifying questions for a given query. Evaluation on both click data and human labeled data demonstrates the high quality of the proposed method.

Paper Structure

This paper contains 33 sections, 1 equation, 9 figures, 7 tables.

Figures (9)

  • Figure 1: Few examples of clarification in web search.
  • Figure 2: Relative engagement rate (compared to the average engagement rate) per question template for the most frequent templates in the data.
  • Figure 3: A box plot for the relative engagement rate (compared to the average engagement rate) with respect to the entropy in the conditional answer click distribution. This plot is only computed for clarifications with five options.
  • Figure 4: Relative engagement rate (compared to the average engagement rate) per query length.
  • Figure 5: Conditional click rate per position for ambiguous vs. faceted queries for clarifications with five answers.
  • ...and 4 more figures