Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

LEMUR Neural Network Dataset: Towards Seamless AutoML

Created by
  • Haebom

Author

Arash Torabi Goodarzi, Roman Kochnev, Waleed Khalid, Hojjat Torabi Goudarzi, Furui Qin, Tolgay Atinc Uzun, Yashkumar Sanjaybhai Dhameliya, Yash Kanubhai Kathiriya, Zofia Antonina Bentyn, Dmitry Ignatov, Radu Timofte

Outline

LEMUR is an open-source dataset and framework that provides a large collection of PyTorch-based neural networks across a variety of tasks (classification, segmentation, detection, natural language processing, etc.). All models follow a unified template, and their configurations and results are stored in a structured database, ensuring consistency and reproducibility. It integrates automatic hyperparameter optimization via Optuna, statistical analysis, and visualization tools, and provides an API for seamless access to performance data. It is designed to be extensible, allowing the addition of new models, datasets, or metrics while maintaining compatibility. By standardizing implementations and unifying evaluations, it aims to accelerate AutoML research, enable fair benchmarking, and reduce barriers to large-scale neural network experimentation.

Takeaways, Limitations

Takeaways:
Accelerating AutoML Research
Fair comparison and benchmarking of neural network models possible.
Reducing the barrier to entry for large-scale neural network experiments.
Improved model reproducibility and consistency
Provides an open source and extensible framework
Limitations:
The number of types and models of tasks currently supported needs to be further expanded in the future.
Limited to PyTorch, so compatibility issues with other deep learning frameworks may arise.
Dataset and model quality control and continuous update are required.
There may be dependencies on specific hardware environments.
👍