Read the Room: Inferring Social Context Through Dyadic Interaction Recognition in Cyber-physical-social Infrastructure Systems

Cheyu Lin; John Martins; Katherine A. Flanigan; Ph. D

Read the Room: Inferring Social Context Through Dyadic Interaction Recognition in Cyber-physical-social Infrastructure Systems

Cheyu Lin, John Martins, Katherine A. Flanigan, Ph. D

TL;DR

This paper advances cyber-physical-social infrastructure systems (CPSIS) by focusing on social benefits and privacy-preserving measurement of human interactions within infrastructure. It introduces a dyadic interaction dataset drawn from a five-category taxonomy and benchmarks five skeleton-based recognition models on 12 dyadic interactions, finding ConvLSTM to be the most effective for capturing spatiotemporal social cues. The study reveals that depth-sensor–based skeleton data can enable robust dyadic interaction recognition even under occlusion, suggesting practical pathways for integrating social objectives into CPSIS. The work lays groundwork for mapping social interactions to social benefits, with potential applications in healthcare, smart spaces, and autonomous systems, while highlighting the need for deeper understanding of social meanings and more efficient implementations.

Abstract

Cyber-physical systems (CPS) integrate sensing, computing, and control to improve infrastructure performance, focusing on economic goals like performance and safety. However, they often neglect potential human-centered (or ''social'') benefits. Cyber-physical-social infrastructure systems (CPSIS) aim to address this by aligning CPS with social objectives. This involves defining social benefits, understanding human interactions with each other and infrastructure, developing privacy-preserving measurement methods, modeling these interactions for prediction, linking them to social benefits, and actuating the physical environment to foster positive social outcomes. This paper delves into recognizing dyadic human interactions using real-world data, which is the backbone to measuring social behavior. This lays a foundation to address the need to enhance understanding of the deeper meanings and mutual responses inherent in human interactions. While RGB cameras are informative for interaction recognition, privacy concerns arise. Depth sensors offer a privacy-conscious alternative by analyzing skeletal movements. This study compares five skeleton-based interaction recognition algorithms on a dataset of 12 dyadic interactions. Unlike single-person datasets, these interactions, categorized into communication types like emblems and affect displays, offer insights into the cultural and emotional aspects of human interactions.

Read the Room: Inferring Social Context Through Dyadic Interaction Recognition in Cyber-physical-social Infrastructure Systems

TL;DR

Abstract

Read the Room: Inferring Social Context Through Dyadic Interaction Recognition in Cyber-physical-social Infrastructure Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)