FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

Thanh-Dat Truong; Utsav Prabhu; Bhiksha Raj; Jackson Cothren; Khoa Luu

FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

Thanh-Dat Truong, Utsav Prabhu, Bhiksha Raj, Jackson Cothren, Khoa Luu

TL;DR

FALCON tackles the fairness and unknown-class modeling challenges in continual semantic segmentation by introducing a Fairness Contrastive Clustering Loss and an Attention-based Visual Grammar for unknown classes. The method couples a contrastive clustering objective with a learnable fairness mechanism, while an upcoming visual grammar module models the distribution of unknown classes through self-attention, enabling discriminative representations across both known and unseen classes. The approach yields state-of-the-art results on ADE20K, Pascal VOC, and Cityscapes, with empirical evidence showing improved fairness for minor classes and robust forgetting control across sequential tasks. By connecting the contrastive clustering objective to an upper bound on knowledge distillation, FALCON provides a principled, scalable way to preserve previous knowledge while learning new classes in open-set environments, with broad implications for fair, continual semantic understanding.

Abstract

Continual Learning in semantic scene segmentation aims to continually learn new unseen classes in dynamic environments while maintaining previously learned knowledge. Prior studies focused on modeling the catastrophic forgetting and background shift challenges in continual learning. However, fairness, another major challenge that causes unfair predictions leading to low performance among major and minor classes, still needs to be well addressed. In addition, prior methods have yet to model the unknown classes well, thus resulting in producing non-discriminative features among unknown classes. This work presents a novel Fairness Learning via Contrastive Attention Approach to continual learning in semantic scene understanding. In particular, we first introduce a new Fairness Contrastive Clustering loss to address the problems of catastrophic forgetting and fairness. Then, we propose an attention-based visual grammar approach to effectively model the background shift problem and unknown classes, producing better feature representations for different unknown classes. Through our experiments, our proposed approach achieves State-of-the-Art (SoTA) performance on different continual learning benchmarks, i.e., ADE20K, Cityscapes, and Pascal VOC. It promotes the fairness of the continual semantic segmentation model.

FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

TL;DR

Abstract

Paper Structure (21 sections, 17 equations, 7 figures, 12 tables, 1 algorithm)

This paper contains 21 sections, 17 equations, 7 figures, 12 tables, 1 algorithm.

Introduction
Related Work
The Proposed FALCON Approach
Continual Learning via Contrastive Clustering
Fairness Contrastive Clustering Learning
An Efficient Unknown Class Modeling
Continual Learning Procedure
Experiments
Implementations and Evaluation Protocols
Ablation Study
Comparison with Prior SoTA Methods
Conclusions and Limitations
Proof of Propositions 1 and 2
Proof of Proposition 1
Proof of Proposition 2
...and 6 more sections

Figures (7)

Figure 1: Our Fairness Learning via Contrastive Attention to Continual Semantic Segmentation. The Fairness Contrastive Clustering Loss promotes the fairness of the model while the Attention-based Visual Grammar models the unknown classes.
Figure 2: The Data Class Distribution of ADE20K. The major classes occupy more than 75% of the total pixels of the dataset.
Figure 3: The Proposed Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding.
Figure 4: The Enforcement Loss of Contrastive Clustering $\mathcal{L}_{Cont}$ and Fairness Contrastive Clustering $\mathcal{L}^{\alpha}_{Cont}$ on Pascal VOC. Since $\mathcal{L}_{Cont}$ suffers severe biased, its clusters of minor classes remain scattered. Our $\mathcal{L}^{\alpha}_{Cont}$ produces a more uniform loss among classes, promotes fairness and compactness of clusters.
Figure 5: The Proposed Visual Grammar Model.
...and 2 more figures

FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

TL;DR

Abstract

FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding

Authors

TL;DR

Abstract

Table of Contents

Figures (7)