Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Breaking the ICE: Exploring promises and challenges of benchmarks for Inference Carbon & Energy estimation for LLMs

Created by
  • Haebom

Author

Samarth Sikand, Rohit Mehra, Priyavanshi Pathania, Nikhil Bamby, Vibhu Saujanya Sharma, Vikrant Kaulgud, Sanjay Podder, Adam P. Burden

Outline

This paper raises concerns about the energy consumption and environmental impact of generative AI, especially large-scale language models (LLMs), and emphasizes the importance of energy consumption estimation in establishing sustainability strategies. We point out the shortcomings of existing energy consumption monitoring and estimation tools (high input data requirements, invasive nature, high error rates, etc.) and aim to address these issues by leveraging the latest LLM benchmark data. In this paper, we present R-ICE, a novel framework for estimating prompted-level inference carbon emissions, and present promising validation results demonstrating the potential of benchmark-based modeling. R-ICE provides a practical and non-invasive method that enables new use cases such as dynamic LLM routing and carbon accounting.

Takeaways, Limitations

Takeaways:
A new framework R-ICE for estimating energy consumption and carbon emissions in LLMs is presented
Proposal of a Benchmark Data Utilization Method to Overcome __T84719__ of Existing Monitoring and Estimation Tools
Presents new use cases including dynamic LLM routing and carbon accounting
Confirming the potential of benchmark-based modeling and raising the need for further research
Limitations:
Lack of detailed implementation and algorithm description of the R-ICE framework.
Further research is needed on generalizability across different LLM and hardware environments.
Need more detailed explanation of the scope and reliability of the verification results?
Additional analysis is needed to compare the accuracy of R-ICE with existing methods.
👍