Aligning Human and Machine Attention for Enhanced Supervised Learning
Avihay Chriqui, Inbal Yahav, Dov Teeni, Ahmed Abbasi
TL;DR
HuMAL addresses the challenge of aligning machine attention with human attention to improve supervised learning, especially when labeled data are scarce or imbalanced. It introduces three strategies (HuMAL-AL, HuMAL-AN, HuMAL-AP) that integrate human attention into transformer-based classifiers, and evaluates them on Yelp sentiment and myPersonality tasks. The results show that HuMAL-AL consistently outperforms baselines and other HuMAL variants, with notable gains under data scarcity and across different base models (BERT, GPT-2, XLNet). This work suggests that incorporating task-specific human attention can enhance performance and interpretability in real-world NLP applications, particularly for nuanced or low-resource tasks.
Abstract
Attention, or prioritization of certain information items over others, is a critical element of any learning process, for both humans and machines. Given that humans continue to outperform machines in certain learning tasks, it seems plausible that machine performance could be enriched by aligning machine attention with human attention mechanisms -- yet research on this topic is sparse and has achieved only limited success. This paper proposes a new approach to address this gap, called Human-Machine Attention Learning (HuMAL). This approach involves reliance on data annotated by humans to reflect their self-perceived attention during specific tasks. We evaluate several alternative strategies for integrating such human attention data into machine learning (ML) algorithms, using a sentiment analysis task (review data from Yelp) and a personality-type classification task (data from myPersonality). The best-performing HuMAL strategy significantly enhances the task performance of fine-tuned transformer models (BERT, as well as GPT-2 and XLNET), and the benefit is particularly pronounced under challenging conditions of imbalanced or sparse labeled data. This research contributes to a deeper understanding of strategies for integrating human attention into ML models and highlights the potential of leveraging human cognition to augment ML in real-world applications.
