Using Graph Neural Networks to Predict Local Culture
Thiago H Silva, Daniel Silver
TL;DR
This paper develops a graph neural network (GNN) framework to predict local cultural dimensions of neighbourhoods by integrating three data streams: area socio-economic information, mobility graphs derived from Yelp review co-location, and group profiles inferred from reviewers’ venue tastes. By modeling cities as graphs where vertices are neighbourhoods and edges encode cross-neighbourhood movements and group interactions, the authors compare eight graph-structured scenarios built on different feature subsets and evaluate their predictive power for 15 cultural dimensions. Key findings show that either area-level socio-economic data or Yelp-derived group profiles achieve the strongest predictive performance, while mobility connectivity alone provides limited value; combining data sources does not consistently improve results. The work demonstrates the potential of GNNs to fuse diverse data sources for urban research and highlights practical implications for cases with scarce census data, while also outlining avenues for richer group profiling and multi-city graph analyses.
Abstract
Urban research has long recognized that neighbourhoods are dynamic and relational. However, lack of data, methodologies, and computer processing power have hampered a formal quantitative examination of neighbourhood relational dynamics. To make progress on this issue, this study proposes a graph neural network (GNN) approach that permits combining and evaluating multiple sources of information about internal characteristics of neighbourhoods, their past characteristics, and flows of groups among them, potentially providing greater expressive power in predictive models. By exploring a public large-scale dataset from Yelp, we show the potential of our approach for considering structural connectedness in predicting neighbourhood attributes, specifically to predict local culture. Results are promising from a substantive and methodologically point of view. Substantively, we find that either local area information (e.g. area demographics) or group profiles (tastes of Yelp reviewers) give the best results in predicting local culture, and they are nearly equivalent in all studied cases. Methodologically, exploring group profiles could be a helpful alternative where finding local information for specific areas is challenging, since they can be extracted automatically from many forms of online data. Thus, our approach could empower researchers and policy-makers to use a range of data sources when other local area information is lacking.
