FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue, Ryusuke Miyamoto
TL;DR
FastInstShadow introduces a query-based approach for instance shadow detection by learning shadow–object relationships during detection through an association transformer with two dual-path decoders. It eliminates the need for a separate pairing step by directly modeling paired shadows and objects, aided by training strategies like shadow direction learning and box-aware mask loss. On the SOBA dataset, FIS variants achieve state-of-the-art performance across instance and association metrics, with D3 delivering the best overall accuracy and D1 providing real-time speeds on moderate-resolution images. The method enables practical shadow-aware editing, generation, and compositing tasks by offering a simpler, faster, and more accurate inference pipeline. Overall, FastInstShadow advances the field by integrating query-based instance detection with mutual shadow–object reasoning in a single, streamlined framework.
Abstract
Instance shadow detection is the task of detecting pairs of shadows and objects, where existing methods first detect shadows and objects independently, then associate them. This paper introduces FastInstShadow, a method that enhances detection accuracy through a query-based architecture featuring an association transformer decoder with two dual-path transformer decoders to assess relationships between shadows and objects during detection. Experimental results using the SOBA dataset showed that the proposed method outperforms all existing methods across all criteria. This method makes real-time processing feasible for moderate-resolution images with better accuracy than SSISv2, the most accurate existing method. Our code is available at https://github.com/wlotkr/FastInstShadow.
