Expanding the Katz Index for Link Prediction: A Case Study on a Live Fish Movement Network
Michael-Sam Vidza, Marcin Budka, Wei Koong Chai, Mark Thrush, Mickael Teixeira Alves
TL;DR
The study addresses the limitation of static link prediction in aquaculture disease modelling by extending the Katz Index to incorporate spatial distance and temporal dynamics. It introduces the Weighted Katz Index (WKI) and Edge Weighted Katz Index (EWKI), along with hybrid variants, and formulates the corresponding mathematical expressions, e.g., $KI_{(u,v)} = \sum_{l=1}^{\infty} \beta^l (A^l)_{(u,v)}$ and $\omega_{(u,v)} = e^{-\gamma \times d_{(u,v)}}$, to capture distance effects. EWKI achieves superior predictive performance with precision 0.988, recall 0.712, F1 0.827, and AUPR 0.970, while AUROC reaches 1.000 for EWKI and WKIEWKI, underscoring the value of spatial-temporal integration for disease spread prediction. The results demonstrate that geographic proximity and temporal movement patterns substantially improve link prediction in live-fish networks, offering actionable insights for surveillance and biosecurity planning. Future work will explore richer farm features, graph neural networks, and network rewiring under node removals to further enhance robustness and adaptability.
Abstract
In aquaculture, disease spread models often neglect the dynamic interactions between farms, hindering accuracy. This study enhances the Katz index (KI) to incorporate spatial and temporal patterns of fish movement, improving the prediction of farms susceptible to disease via live fish transfers. We modified the Katz index to create models like the Weighted Katz Index (WKI), Edge Weighted Katz Index (EWKI), and combined models (e.g., KIEWKI). These incorporate spatial distances and temporal movement patterns for a comprehensive aquaculture network connection prediction framework. Model performance was evaluated using precision, recall, F1-scores, AUPR, and AUROC. The EWKI model significantly outperformed the traditional KI and other variations. It achieved high precision (0.988), recall (0.712), F1-score (0.827), and AUPR (0.970). Combined models (KIEWKI, WKIEWKI) approached, but couldn't surpass, EWKI performance. This study highlights the value of extending Katz index models to improve disease spread predictions in aquaculture networks. The EWKI model's performance demonstrates an innovative and flexible approach to tackling spatial challenges within network analysis.
