Comgra: A Tool for Analyzing and Debugging Neural Networks

Florian Dietz; Sophie Fellenz; Dietrich Klakow; Marius Kloft

Comgra: A Tool for Analyzing and Debugging Neural Networks

Florian Dietz, Sophie Fellenz, Dietrich Klakow, Marius Kloft

TL;DR

Comgra addresses the challenge of inspecting large, complex neural networks by providing a PyTorch-compatible tool with a GUI that logs internal activations and gradients. The approach combines dynamic, flexible logging with a dependency-graph view to enable rapid, multi-angle inspection of tensors, training dynamics, and interventions, while maintaining a manageable memory footprint. Key contributions include the GUI architecture with selectors, a customizable dependency graph, and workflow-oriented use cases that demonstrate debugging, architecture exploration, and mechanistic interpretability. The work enables faster hypothesis testing and targeted debugging without rerunning extensive training runs, improving practicality for researchers and practitioners alike.

Abstract

Neural Networks are notoriously difficult to inspect. We introduce comgra, an open source python library for use with PyTorch. Comgra extracts data about the internal activations of a model and organizes it in a GUI (graphical user interface). It can show both summary statistics and individual data points, compare early and late stages of training, focus on individual samples of interest, and visualize the flow of the gradient through the network. This makes it possible to inspect the model's behavior from many different angles and save time by rapidly testing different hypotheses without having to rerun it. Comgra has applications for debugging, neural architecture design, and mechanistic interpretability. We publish our library through Python Package Index (PyPI) and provide code, documentation, and tutorials at https://github.com/FlorianDietz/comgra.

Comgra: A Tool for Analyzing and Debugging Neural Networks

TL;DR

Abstract

Comgra: A Tool for Analyzing and Debugging Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Figures (1)