SnapPix: Efficient-Coding--Inspired In-Sensor Compression for Edge Vision

Weikai Lin; Tianrui Ma; Adith Boloor; Yu Feng; Ruofan Xing; Xuan Zhang; Yuhao Zhu

SnapPix: Efficient-Coding--Inspired In-Sensor Compression for Edge Vision

Weikai Lin, Tianrui Ma, Adith Boloor, Yu Feng, Ruofan Xing, Xuan Zhang, Yuhao Zhu

TL;DR

SnapPix tackles the energy bottleneck of edge sensing by performing in-sensor compression via coded exposure (CE). It introduces a decorrelation-based, task-agnostic CE pattern learned to minimize redundancy and a tile-repetitive exposure scheme co-designed with a Vision Transformer (ViT) backbone, plus lightweight hardware augmentations to support CE with negligible area impact. The approach yields energy savings ranging from 1.4x to 15.4x and outperforms task-specific and video-based baselines on action recognition and video reconstruction, while maintaining competitive accuracy. This work enables energy-efficient, multi-task edge vision with practical hardware support and open-source tooling for broader adoption.

Abstract

Energy-efficient image acquisition on the edge is crucial for enabling remote sensing applications where the sensor node has weak compute capabilities and must transmit data to a remote server/cloud for processing. To reduce the edge energy consumption, this paper proposes a sensor-algorithm co-designed system called SnapPix, which compresses raw pixels in the analog domain inside the sensor. We use coded exposure (CE) as the in-sensor compression strategy as it offers the flexibility to sample, i.e., selectively expose pixels, both spatially and temporally. SNAPPIX has three contributions. First, we propose a task-agnostic strategy to learn the sampling/exposure pattern based on the classic theory of efficient coding. Second, we co-design the downstream vision model with the exposure pattern to address the pixel-level non-uniformity unique to CE-compressed images. Finally, we propose lightweight augmentations to the image sensor hardware to support our in-sensor CE compression. Evaluating on action recognition and video reconstruction, SnapPix outperforms state-of-the-art video-based methods at the same speed while reducing the energy by up to 15.4x. We have open-sourced the code at: https://github.com/horizon-research/SnapPix.

SnapPix: Efficient-Coding--Inspired In-Sensor Compression for Edge Vision

TL;DR

Abstract

SnapPix: Efficient-Coding--Inspired In-Sensor Compression for Edge Vision

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)