Interactive Prompt Debugging with Sequence Salience
Ian Tenney, Ryan Mullins, Bin Du, Shree Pandya, Minsuk Kahng, Lucas Dixon
TL;DR
Sequence Salience tackles the challenge of debugging long, complex prompts for language models by making input salience interpretable and actionable. It introduces an interactive, visual tool that aggregates token-level salience to words, sentences, or paragraphs and supports rapid iteration through prompt editing and re-generation. The approach relies on gradient-based salience (gradnorm and grad-dot-input) implemented on the Lit platform with backends for Gemma, Llama 2, Mistral, and GPT-2, enabling efficient, model-agnostic debugging across few-shot, chain-of-thought, and constitution-style prompting. The paper demonstrates the utility via case studies and provides open-source code, fostering adoption of interpretability methods for prompt design. This has practical impact by reducing cognitive load and accelerating reliable prompt development for long, semi-structured prompts.
Abstract
We present Sequence Salience, a visual tool for interactive prompt debugging with input salience methods. Sequence Salience builds on widely used salience methods for text classification and single-token prediction, and extends this to a system tailored for debugging complex LLM prompts. Our system is well-suited for long texts, and expands on previous work by 1) providing controllable aggregation of token-level salience to the word, sentence, or paragraph level, making salience over long inputs tractable; and 2) supporting rapid iteration where practitioners can act on salience results, refine prompts, and run salience on the new output. We include case studies showing how Sequence Salience can help practitioners work with several complex prompting strategies, including few-shot, chain-of-thought, and constitutional principles. Sequence Salience is built on the Learning Interpretability Tool, an open-source platform for ML model visualizations, and code, notebooks, and tutorials are available at http://goo.gle/sequence-salience.
