The Topology of a Family Tree Graph and Its Members' Satisfaction with One Another: A Machine Learning Approach

Teddy Lazebnik; Amit Yaniv-Rosenfeld

The Topology of a Family Tree Graph and Its Members' Satisfaction with One Another: A Machine Learning Approach

Teddy Lazebnik, Amit Yaniv-Rosenfeld

TL;DR

The paper addresses whether the topology of a family tree can predict how members rate each other’s satisfaction. It introduces a two-stage pipeline that encodes topology via a Variational Graph AutoEncoder into a $16$-dimensional vector and then uses a TPOT-driven regression to predict $EFS$ and $NFS$, demonstrating strong predictive performance on $N=486$ families. Key contributions include formalizing topology-based prediction for $EFS$ and $NFS$, showing substantial gains over traditional feature-based baselines, and providing a reproducible framework for topology-centric analysis in family studies. The findings suggest that graph topology alone carries significant information about family satisfaction, with practical implications for targeted interventions and future explainability research.

Abstract

Family members' satisfaction with one another is central to creating healthy and supportive family environments. In this work, we propose and implement a novel computational technique aimed at exploring the possible relationship between the topology of a given family tree graph and its members' satisfaction with one another. Through an extensive empirical evaluation ($N=486$ families), we show that the proposed technique brings about highly accurate results in predicting family members' satisfaction with one another based solely on the family graph's topology. Furthermore, the results indicate that our technique favorably compares to baseline regression models which rely on established features associated with family members' satisfaction with one another in prior literature.

The Topology of a Family Tree Graph and Its Members' Satisfaction with One Another: A Machine Learning Approach

TL;DR

-dimensional vector and then uses a TPOT-driven regression to predict

and

, demonstrating strong predictive performance on

families. Key contributions include formalizing topology-based prediction for

and

, showing substantial gains over traditional feature-based baselines, and providing a reproducible framework for topology-centric analysis in family studies. The findings suggest that graph topology alone carries significant information about family satisfaction, with practical implications for targeted interventions and future explainability research.

Abstract

families), we show that the proposed technique brings about highly accurate results in predicting family members' satisfaction with one another based solely on the family graph's topology. Furthermore, the results indicate that our technique favorably compares to baseline regression models which rely on established features associated with family members' satisfaction with one another in prior literature.

Paper Structure (8 sections, 2 equations, 3 figures, 1 table)

This paper contains 8 sections, 2 equations, 3 figures, 1 table.

Introduction
Related Work
Computational Technique
Empirical Study
Data Collection
Baseline Models
Results
Discussion and Conclusions

Figures (3)

Figure 1: A schematic view of the proposed technique. The family tree graph is first converted by the VGAE model into a representative feature vector. In parallel, the satisfaction matrix is used for computing the family satisfaction measures of interest (e.g., the EFS and NFS measures). The extended feature vectors, now including the target variables, are then used as input to an optimization tool aimed at finding an optional regression model $R$.
Figure 2: A schematic view of the empirical study's workflow.
Figure 3: A comparison of the four models in question: $R_{l},R_{nl},B_{l}$ and $B_{nl}$ using scatter plots. The x-axis represents the actual values, and the y-axis represents the predicted values. Ideally, if the predictions are perfect, the points will lie along a straight solid black line with a slope of 1. Each sub-figure denotes the prediction made by one specific model captioned by its name, Mean Average Error (MEA, the lower - the better), and its determination of coefficients ($R^2$, the higher - the better), respectively. The top row presents the results using the EFS measure and the bottom row represents the results using the NFS measure. The gray dashed line indicates the linear regression fitting on the scatter points.

The Topology of a Family Tree Graph and Its Members' Satisfaction with One Another: A Machine Learning Approach

TL;DR

Abstract

The Topology of a Family Tree Graph and Its Members' Satisfaction with One Another: A Machine Learning Approach

Authors

TL;DR

Abstract

Table of Contents

Figures (3)