TADACap: Time-series Adaptive Domain-Aware Captioning
Elizabeth Fons, Rachneet Kaur, Zhen Zeng, Soham Palande, Tucker Balch, Svitlana Vyetrenko, Manuela Veloso
TL;DR
TADACap addresses the problem of domain-aware captioning for time-series images without retraining. It combines a domain-agnostic captioner with a diverse retrieval strategy (TADACap-diverse) that samples from a target-domain database using a Determinantal Point Process and CLIP embeddings, then leverages in-context prompting of an LLM to produce domain-specific captions. The key contributions are a retrieval-based framework that adapts to new domains with minimal annotation, a diverse-sample retrieval method with $k=4$, and four new domain-aware time-series caption datasets; experiments show competitive semantic accuracy compared to retrained baselines and significant annotation-effort reductions. This approach enables practical, scalable domain adaptation for time-series captioning in finance, healthcare, and other domains where access to raw time-series data is limited or where rapid domain-specific captioning is valuable.
Abstract
While image captioning has gained significant attention, the potential of captioning time-series images, prevalent in areas like finance and healthcare, remains largely untapped. Existing time-series captioning methods typically offer generic, domain-agnostic descriptions of time-series shapes and struggle to adapt to new domains without substantial retraining. To address these limitations, we introduce TADACap, a retrieval-based framework to generate domain-aware captions for time-series images, capable of adapting to new domains without retraining. Building on TADACap, we propose a novel retrieval strategy that retrieves diverse image-caption pairs from a target domain database, namely TADACap-diverse. We benchmarked TADACap-diverse against state-of-the-art methods and ablation variants. TADACap-diverse demonstrates comparable semantic accuracy while requiring significantly less annotation effort.
