Task Progressive Curriculum Learning for Robust Visual Question Answering

Ahmed Akl; Abdelwahed Khamis; Zhe Wang; Ali Cheraghian; Sara Khalifa; Kewen Wang

Task Progressive Curriculum Learning for Robust Visual Question Answering

Ahmed Akl, Abdelwahed Khamis, Zhe Wang, Ali Cheraghian, Sara Khalifa, Kewen Wang

TL;DR

This work shows for the first time that robust Visual Question Answering is attainable by simply enhancing the training strategy, and proposes a proposed approach, Task Progressive Curriculum Learning (TPCL), which breaks the main VQA problem into smaller, easier tasks based on the question type.

Abstract

Visual Question Answering (VQA) systems are known for their poor performance in out-of-distribution datasets. An issue that was addressed in previous works through ensemble learning, answer re-ranking, or artificially growing the training set. In this work, we show for the first time that robust Visual Question Answering is attainable by simply enhancing the training strategy. Our proposed approach, Task Progressive Curriculum Learning (TPCL), breaks the main VQA problem into smaller, easier tasks based on the question type. Then, it progressively trains the model on a (carefully crafted) sequence of tasks. We further support the method by a novel distributional-based difficulty measurer. Our approach is conceptually simple, model-agnostic, and easy to implement. We demonstrate TPCL effectiveness through a comprehensive evaluation on standard datasets. Without either data augmentation or explicit debiasing mechanism, it achieves state-of-the-art on VQA-CP v2, VQA-CP v1 and VQA v2 datasets. Extensive experiments demonstrate that TPCL outperforms the most competitive robust VQA approaches by more than 5% and 7% on VQA-CP v2 and VQA-CP v1; respectively. TPCL also can boost VQA baseline backbone performance by up to 28.5%.

Task Progressive Curriculum Learning for Robust Visual Question Answering

TL;DR

Abstract

Task Progressive Curriculum Learning for Robust Visual Question Answering

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)