Table of Contents
Fetching ...

Deep Mamba Multi-modal Learning

Jian Zhu, Xin Zou, Yu Cui, Zhangmin Huang, Chenshu Hu, Bo Lyu

TL;DR

This work applies DMML to the field of multimedia retrieval and proposes an innovative Deep Mamba Multi-modal Hashing (DMMH) method that combines the advantages of algorithm accuracy and inference speed.

Abstract

Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative Deep Mamba Multi-modal Hashing (DMMH) method. It combines the advantages of algorithm accuracy and inference speed. We validated the effectiveness of DMMH on three public datasets and achieved state-of-the-art results.

Deep Mamba Multi-modal Learning

TL;DR

This work applies DMML to the field of multimedia retrieval and proposes an innovative Deep Mamba Multi-modal Hashing (DMMH) method that combines the advantages of algorithm accuracy and inference speed.

Abstract

Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative Deep Mamba Multi-modal Hashing (DMMH) method. It combines the advantages of algorithm accuracy and inference speed. We validated the effectiveness of DMMH on three public datasets and achieved state-of-the-art results.

Paper Structure

This paper contains 7 sections, 2 figures, 2 tables.

Figures (2)

  • Figure 1: Deep Mamba Multi-modal Learning
  • Figure 2: Deep Mamba Multi-modal Hashing