Orbit: A Framework for Designing and Evaluating Multi-objective Rankers

Chenyang Yang; Tesi Xiao; Michael Shavlovsky; Christian Kästner; Tongshuang Wu

Orbit: A Framework for Designing and Evaluating Multi-objective Rankers

Chenyang Yang, Tesi Xiao, Michael Shavlovsky, Christian Kästner, Tongshuang Wu

TL;DR

Orbit addresses the challenge of designing multi-objective rankers in production by centering objectives in the design and evaluation workflow. It introduces an objective-centric framework and an interactive system that lets stakeholders explore the objective space and assess trade-offs in real time. A user study with twelve industry practitioners shows Orbit improves design-space exploration, leads to more informed decisions, and fosters deeper consideration of trade-offs. The work suggests the approach generalizes to other multi-objective ML problems and helps bridge metric-centric and example-centric mindsets, enabling participatory design and better communication among cross-functional teams.

Abstract

Machine learning in production needs to balance multiple objectives: This is particularly evident in ranking or recommendation models, where conflicting objectives such as user engagement, satisfaction, diversity, and novelty must be considered at the same time. However, designing multi-objective rankers is inherently a dynamic wicked problem -- there is no single optimal solution, and the needs evolve over time. Effective design requires collaboration between cross-functional teams and careful analysis of a wide range of information. In this work, we introduce Orbit, a conceptual framework for Objective-centric Ranker Building and Iteration. The framework places objectives at the center of the design process, to serve as boundary objects for communication and guide practitioners for design and evaluation. We implement Orbit as an interactive system, which enables stakeholders to interact with objective spaces directly and supports real-time exploration and evaluation of design trade-offs. We evaluate Orbit through a user study involving twelve industry practitioners, showing that it supports efficient design space exploration, leads to more informed decision-making, and enhances awareness of the inherent trade-offs of multiple objectives. Orbit (1) opens up new opportunities of an objective-centric design process for any multi-objective ML models, as well as (2) sheds light on future designs that push practitioners to go beyond a narrow metric-centric or example-centric mindset.

Orbit: A Framework for Designing and Evaluating Multi-objective Rankers

TL;DR

Abstract

Orbit: A Framework for Designing and Evaluating Multi-objective Rankers

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)