Table of Contents
Fetching ...

Physics-informed appliance signatures generator for energy disaggregation

Ilia Kamyshev, Sahar Moghimian Hoosh, Henni Ouerdane

TL;DR

Energy disaggregation faces overfitting when training data are limited and biased toward a subset of appliances. The authors propose two physics-informed appliance-signature generators for high-sampling-rate ($kHz$) and low-sampling-rate ($Hz$) signals that synthesize unlimited, physically plausible signatures without needing input data. They validate the approach with PCA and KL-divergence, showing the synthetic distributions are significantly closer to real data than prior work (roughly 1.6× improvement for high-rate and 3.9× for low-rate). The method supports scalable, diverse NILM datasets and is released as the Edframe open-source library, enabling broader adoption and improved generalization of disaggregation algorithms.

Abstract

Energy disaggregation is a promising solution to access detailed information on energy consumption in a household, by itemizing its total energy consumption. However, in real-world applications, overfitting remains a challenging problem for data-driven disaggregation methods. First, the available real-world datasets are biased towards the most frequently used appliances. Second, both real and synthetic publicly-available datasets are limited in number of appliances, which may not be sufficient for a disaggregation algorithm to learn complex relations among different types of appliances and their states. To address the lack of appliance data, we propose two physics-informed data generators: one for high sampling rate signals (kHz) and another for low sampling rate signals (Hz). These generators rely on prior knowledge of the physics of appliance energy consumption, and are capable of simulating a virtually unlimited number of different appliances and their corresponding signatures for any time period. Both methods involve defining a mathematical model, selecting centroids corresponding to individual appliances, sampling model parameters around each centroid, and finally substituting the obtained parameters into the mathematical model. Additionally, by using Principal Component Analysis and Kullback-Leibler divergence, we demonstrate that our methods significantly outperform the previous approaches.

Physics-informed appliance signatures generator for energy disaggregation

TL;DR

Energy disaggregation faces overfitting when training data are limited and biased toward a subset of appliances. The authors propose two physics-informed appliance-signature generators for high-sampling-rate () and low-sampling-rate () signals that synthesize unlimited, physically plausible signatures without needing input data. They validate the approach with PCA and KL-divergence, showing the synthetic distributions are significantly closer to real data than prior work (roughly 1.6× improvement for high-rate and 3.9× for low-rate). The method supports scalable, diverse NILM datasets and is released as the Edframe open-source library, enabling broader adoption and improved generalization of disaggregation algorithms.

Abstract

Energy disaggregation is a promising solution to access detailed information on energy consumption in a household, by itemizing its total energy consumption. However, in real-world applications, overfitting remains a challenging problem for data-driven disaggregation methods. First, the available real-world datasets are biased towards the most frequently used appliances. Second, both real and synthetic publicly-available datasets are limited in number of appliances, which may not be sufficient for a disaggregation algorithm to learn complex relations among different types of appliances and their states. To address the lack of appliance data, we propose two physics-informed data generators: one for high sampling rate signals (kHz) and another for low sampling rate signals (Hz). These generators rely on prior knowledge of the physics of appliance energy consumption, and are capable of simulating a virtually unlimited number of different appliances and their corresponding signatures for any time period. Both methods involve defining a mathematical model, selecting centroids corresponding to individual appliances, sampling model parameters around each centroid, and finally substituting the obtained parameters into the mathematical model. Additionally, by using Principal Component Analysis and Kullback-Leibler divergence, we demonstrate that our methods significantly outperform the previous approaches.
Paper Structure (7 sections, 18 equations, 9 figures, 1 table)

This paper contains 7 sections, 18 equations, 9 figures, 1 table.

Figures (9)

  • Figure 1: Single cycle of laptop's high sampling rate signature (left) and its corresponding spectrum amplitudes (right). The amplitude of both graphs is set to 1.
  • Figure 2: Ten cycles of compact fluorescent lamp's high sampling rate signature that show exponential amplitude decay and spectrum fluctuations. The amplitude is set to 1.
  • Figure 3: Time-correlated random variable $r$ (left). Ten cycles of synthetic signature (right) which spectrum is time-correlated with accordance to the graph on the left. The amplitude of both graphs is set to 1.
  • Figure 4: Four real appliances and the most similar to them synthetic appliances. The amplitude of all graphs is set to 1.
  • Figure 5: Primitive cycle of a kettle (left) and its corresponding parametrized model $p=p_1\cdot p_2\cdot p_4$ (right). The amplitude of both graphs is set to 1.
  • ...and 4 more figures