Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Generating Heterogeneous Multi-dimensional Data: A Comparative Study

Created by
  • Haebom

Author

Michael Corbeau, Emmanuelle Claeys, Mathieu Serrurier, Pascale Zarat e

Outline

This paper compares and analyzes data generation methods required for simulations that experiment with various scenarios to optimize the allocation of manpower and material resources when firefighters intervene. We review various data generation methods such as Random Sampling, Tabular Variational Autoencoders, GANs, Conditional Tabular GANs, and Diffusion Probabilistic Models, and evaluate the quality of the generated synthetic data by combining existing metrics such as Wasserstein distance and metrics specific to the firefighting field such as response time distribution, spatial-temporal intervention distribution, and accident representation. Considering the unbalanced distribution and non-normal distribution characteristics of firefighting intervention data, we propose a method to resolve the complexity of the data generation process.

Takeaways, Limitations

Takeaways:
We present an effective method for generating synthetic data for firefighting activity simulation.
We propose a comprehensive data quality assessment method that combines existing metrics and domain-specific metrics.
We seek solutions to generate imbalanced and non-normally distributed data.
Limitations:
There may be a lack of validation of the proposed methodology in practical firefighting settings.
Further research is needed on the generalizability of domain-specific metrics used in evaluation.
Generalizability to different types of firefighting activities needs to be verified.
👍