Reconciling common source, specific source, feature based and score based likelihood ratios
Aafko Boonstra, Ronald Meester, Klaas Slooten
TL;DR
This paper formalizes a general Bayesian decision framework showing that the expected cost of decisions, given priors, cannot increase when additional information is incorporated via likelihood ratios. It provides a direct, information-theoretic proof that more information (full data or richer processing) improves decision quality, applicable to both score-based and feature-based LRs and to common-source versus specific-source contexts. Through a DNA kinship example, it demonstrates that LR systems differ only in the information processed, and that scores can be informative when full data are unavailable. The work argues against discarding score-based methods, emphasizes proper conditioning on source parameters, and offers a unified view that reconciles debates about LR systems with practical forensic decision-making. Key contributions include (i) a general inequality ${\mathbb{E}}[c(\pi(E))]\le c(\pi)$ for Bayes costs, (ii) a detailed two-hypotheses/two-actions illustration, (iii) extension to the general case, (iv) an empirical DNA kinship example, and (v) a critical analysis clarifying misunderstandings in the score-based versus common/specific-source debates, all within an information-theoretic framework.
Abstract
We show that the incorporation of any new piece of information allows for improved decision making in the sense that the expected costs of an optimal decision decrease (or, in boundary cases where no or not enough new information is incorporated, stays the same) whenever this is done by the appropriate update of the probabilities of the hypotheses. Versions of this result have been stated before. However, previous proofs rely on auxiliary constructions with proper scoring rules. We, instead, offer a direct and completely general proof by considering elementary properties of likelihood ratios only. We apply our results to make a contribution to the debates about the use of score based/feature based and common/specific source likelihood ratios. In the literature these are often presented as different ``LR-systems''. We argue that the difference between these is simply a matter which information is processed. There is no therefore no such thing as different ``LR-systems'', there are only differences in the processed information. In particular, despite claims to the contrary, scores can very well be used in forensic practice and we illustrate this with an extensive example in DNA kinship context.
