Learning Developmental Age from 3D Infant Kinetics Using Adaptive Graph Neural Networks
Daniel Holmberg, Manu Airaksinen, Viviana Marchi, Andrea Guzzetta, Anna Kivi, Leena Haataja, Sampsa Vanhatalo, Teemu Roos
TL;DR
KA predicts infant corrected age from spontaneous motor patterns using pose-estimated 3D data and an adaptive graph neural network that models spatio-temporal dependencies. The approach integrates multi-stream features (joint coordinates, bone directions, velocity, acceleration) within a single graph, and leverages ST-GCN/AAGCN blocks with per-sample attention to optimize age estimation. Experiments show 3D data and adaptive graph structures outperform 2D baselines and hand-crafted feature indices, enabling KA and KA-gap to distinguish typical and at-risk infants. The work provides a publicly released 3D infant kinetics dataset and growth-chart style KA predictions, underscoring potential for early screening while calling for larger, longitudinal validation.
Abstract
Reliable methods for the neurodevelopmental assessment of infants are essential for early detection of problems that may need prompt interventions. Spontaneous motor activity, or 'kinetics', is shown to provide a powerful surrogate measure of upcoming neurodevelopment. However, its assessment is by and large qualitative and subjective, focusing on visually identified, age-specific gestures. In this work, we introduce Kinetic Age (KA), a novel data-driven metric that quantifies neurodevelopmental maturity by predicting an infant's age based on their movement patterns. KA offers an interpretable and generalizable proxy for motor development. Our method leverages 3D video recordings of infants, processed with pose estimation to extract spatio-temporal series of anatomical landmarks, which are released as a new openly available dataset. These data are modeled using adaptive graph convolutional networks, able to capture the spatio-temporal dependencies in infant movements. We also show that our data-driven approach achieves improvement over traditional machine learning baselines based on manually engineered features.
