Analog Physical Systems Can Exhibit Double Descent

Sam Dillavou; Jason W Rocks; Jacob F Wycoff; Andrea J Liu; Douglas J Durian

Analog Physical Systems Can Exhibit Double Descent

Sam Dillavou, Jason W Rocks, Jacob F Wycoff, Andrea J Liu, Douglas J Durian

TL;DR

This work demonstrates double descent in a decentralized analog network of self-adjusting resistive elements, and shows that analog physical systems, if appropriately trained, can exhibit behaviors underlying the success of digital AI.

Abstract

An important component of the success of large AI models is double descent, in which networks avoid overfitting as they grow relative to the amount of training data, instead improving their performance on unseen data. Here we demonstrate double descent in a decentralized analog network of self-adjusting resistive elements. This system trains itself and performs tasks without a digital processor, offering potential gains in energy efficiency and speed -- but must endure component non-idealities. We find that standard training fails to yield double descent, but a modified protocol that accommodates this inherent imperfection succeeds. Our findings show that analog physical systems, if appropriately trained, can exhibit behaviors underlying the success of digital AI. Further, they suggest that biological systems might similarly benefit from over-parameterization.

Analog Physical Systems Can Exhibit Double Descent

TL;DR

Abstract

Analog Physical Systems Can Exhibit Double Descent

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)