Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Industrial Energy Disaggregation with Digital Twin-generated Dataset and Efficient Data Augmentation

Created by
  • Haebom

Author

Christian Intern o, Andrea Castellani, Sebastian Schmitt, Fabio Stella, Barbara Hammer

Outline

In this paper, we present an open-source synthetic industrial energy decomposition dataset (SIDED) generated using digital twin simulation to address the data shortage problem in the field of industrial non-intrusive load monitoring (NILM) and the complex variability of industrial energy consumption patterns. SIDED captures diverse device behaviors, weather conditions, and load profiles across three types of industrial facilities and three geographical locations. In addition, we propose a computationally efficient device modulation data augmentation (AMDA) technique that intelligently adjusts device power contributions based on the relative impact of devices to improve the generalization performance of NILM models. Experimental results show that NILM models trained with AMDA-augmented data significantly improve the energy consumption decomposition performance of complex industrial devices such as cogeneration systems. In particular, in the out-of-sample scenario, the model using AMDA achieves a normalized decomposition error of 0.093, which outperforms the model trained without data augmentation (0.451) and the model using random data augmentation (0.290). Data distribution analysis confirms that AMDA effectively aligns the training and test data distributions to improve the generalization performance of the model.

Takeaways, Limitations

Takeaways:
We present the possibility of solving the data shortage and privacy issues in the industrial NILM field through a synthetic dataset SIDED based on digital twins.
We propose an effective data augmentation method to improve the generalization performance of the NILM model using the AMDA technique.
It contributed to improving the energy consumption decomposition performance of complex industrial equipment.
By providing open source datasets, we can contribute to the advancement of research in the field of industrial NILM.
Limitations:
Since this is a dataset based on digital twin simulation, there may be differences from the actual industrial environment.
The effectiveness of the AMDA technique may be limited to certain types of industrial facilities and equipment.
There is a need to expand the dataset to include more diverse industrial facilities, equipment, and geographic locations.
Additional validation in actual industrial settings is needed.
👍