Minimalist Vision with Freeform Pixels
Jeremy Klotz, Shree K. Nayar
TL;DR
The paper introduces minimalist vision with freeform pixels to solve lightweight vision tasks using far fewer measurements than traditional cameras. By modeling the camera as the first layer of a neural network, it learns task-specific freeform pixel shapes and downstream inference weights from data collected with a training camera, then fabricates the masks and deploys a hardware prototype that is self-powered by harvested light. Across tasks such as workspace monitoring, room lighting estimation, and traffic speed, the approach achieves performance on par with much higher-resolution baselines, while enabling privacy protection (limited identifiability) and energy autonomy through compact sensing. The work situates itself within deep optics and differentiable sensor design, and points to future gains via dynamic masking and learned optical mappings to broaden task coverage while maintaining privacy and low-power operation.
Abstract
A minimalist vision system uses the smallest number of pixels needed to solve a vision task. While traditional cameras use a large grid of square pixels, a minimalist camera uses freeform pixels that can take on arbitrary shapes to increase their information content. We show that the hardware of a minimalist camera can be modeled as the first layer of a neural network, where the subsequent layers are used for inference. Training the network for any given task yields the shapes of the camera's freeform pixels, each of which is implemented using a photodetector and an optical mask. We have designed minimalist cameras for monitoring indoor spaces (with 8 pixels), measuring room lighting (with 8 pixels), and estimating traffic flow (with 8 pixels). The performance demonstrated by these systems is on par with a traditional camera with orders of magnitude more pixels. Minimalist vision has two major advantages. First, it naturally tends to preserve the privacy of individuals in the scene since the captured information is inadequate for extracting visual details. Second, since the number of measurements made by a minimalist camera is very small, we show that it can be fully self-powered, i.e., function without an external power supply or a battery.
