Generalized partitioned local depth
Kenneth S. Berenhaut, John D. Foley, Liangdongsheng Lyu
TL;DR
This work generalizes partitioned local depth (PaLD) by introducing probabilistic locality via $R_{x,y,z}$ and probabilistic support division via $Q_{x,y,z}$, enabling cohesion analysis under uncertainty and conflicting information. It formalizes generalized partitioned local depth $\ell_{S,\boldsymbol{R},\boldsymbol{Q}}(x)$ and cohesion $C_{x,w}$, proving core properties such as dissipation under separation and conservation of cohesion, and recovers the original PaLD model when $R$ and $Q$ are binary. The paper demonstrates practical applications, including combining multiple dissimilarity measures, event-based data, and data uncertainty, with examples from cultural distances, political analysis, and sports data (NBA). Overall, the framework enables robust inference of community structure in non-metric, uncertain data without requiring fixed neighborhoods or distributional assumptions, and opens avenues for uncertainty-aware network analysis and persistent-like studies.
Abstract
In this paper we provide a generalization of the concept of cohesion as introduced recently by Berenhaut, Moore and Melvin [Proceedings of the National Academy of Sciences, 119 (4) (2022)]. The formulation presented builds on the technique of partitioned local depth by distilling two key probabilistic concepts: local relevance and support division. Earlier results are extended within the new context, and examples of applications to revealing communities in data with uncertainty are included. The work sheds light on the foundations of partitioned local depth, and extends the original ideas to enable probabilistic consideration of uncertain, variable and potentially conflicting information.
