MolEdit: Knowledge Editing for Multimodal Molecule Language Models

Zhenyu Lei; Patrick Soga; Yaochen Zhu; Yinhan He; Yushun Dong; Jundong Li

MolEdit: Knowledge Editing for Multimodal Molecule Language Models

Zhenyu Lei, Patrick Soga, Yaochen Zhu, Yinhan He, Yushun Dong, Jundong Li

TL;DR

This work addresses the problem of stale or manipulated knowledge in multimodal Molecule Language Models (MoLMs) by introducing MolEdit, a targeted editing framework designed for multifaceted molecular knowledge. It combines a Multi-Expert Knowledge Adapter (MEKA) with an Expertise-Aware Editing Switcher (EAES) to ensure fine-grained, locality-preserving edits across both molecule-to-caption and caption-to-molecule tasks. To evaluate editing effectiveness, the authors propose MEBench, a benchmark assessing Reliability, Locality, and Generality, and demonstrate that MolEdit outperforms baselines across these dimensions on two MoLM backbones. The study highlights the importance of facet-specific editing and cautious activation of edits to maintain consistency across related molecular knowledge, advancing reliable knowledge management in MoLMs with practical implications for chemistry, biology, and materials science research.

Abstract

Understanding and continuously refining multimodal molecular knowledge is crucial for advancing biomedicine, chemistry, and materials science. Molecule language models (MoLMs) have become powerful tools in these domains, integrating structural representations (e.g., SMILES strings, molecular graphs) with rich contextual descriptions (e.g., physicochemical properties). However, MoLMs can encode and propagate inaccuracies due to outdated web-mined training corpora or malicious manipulation, jeopardizing downstream discovery pipelines. While knowledge editing has been explored for general-domain AI, its application to MoLMs remains uncharted, presenting unique challenges due to the multifaceted and interdependent nature of molecular knowledge. In this paper, we take the first step toward MoLM editing for two critical tasks: molecule-to-caption generation and caption-to-molecule generation. To address molecule-specific challenges, we propose MolEdit, a powerful framework that enables targeted modifications while preserving unrelated molecular knowledge. MolEdit combines a Multi-Expert Knowledge Adapter that routes edits to specialized experts for different molecular facets with an Expertise-Aware Editing Switcher that activates the adapters only when input closely matches the stored edits across all expertise, minimizing interference with unrelated knowledge. To systematically evaluate editing performance, we introduce MEBench, a comprehensive benchmark assessing multiple dimensions, including Reliability (accuracy of the editing), Locality (preservation of irrelevant knowledge), and Generality (robustness to reformed queries). Across extensive experiments on two popular MoLM backbones, MolEdit delivers up to 18.8% higher Reliability and 12.0% better Locality than baselines while maintaining efficiency. The code is available at: https://github.com/LzyFischer/MolEdit.

MolEdit: Knowledge Editing for Multimodal Molecule Language Models

TL;DR

Abstract

MolEdit: Knowledge Editing for Multimodal Molecule Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)