Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey

Created by
  • Haebom

Author

Jing Liu, Yao Du, Kun Yang, Jiaqi Wu, Yan Wang, Xiping Hu, Zehua Wang, Yang Liu, Peng Sun, Azzedine Boukerche, Victor CM Leung

Outline

This paper comprehensively surveys the intersection of distributed intelligence and model optimization in Edge-Cloud Collaborative Computing (ECCC). ECCC, which integrates edge devices and cloud resources to enable efficient, low-latency processing, has emerged as a key paradigm for addressing the computing demands of modern intelligent applications. This paper provides a structured tutorial on the underlying architecture, enabling technologies, and emerging applications. It systematically analyzes model optimization methods, such as model compression, adaptation, and neural network architecture exploration, along with AI-based resource management strategies that balance performance, energy efficiency, and latency requirements. Furthermore, it explores critical aspects of enhancing privacy and security within ECCC systems and examines real-world deployments across a range of applications, including autonomous driving, healthcare, and industrial automation. Performance analysis and benchmarking techniques are also thoroughly explored to establish evaluation standards for these complex systems. Finally, it presents a roadmap for addressing the ongoing challenges of heterogeneity management, real-time processing, and scalability by highlighting key research directions, including LLM deployment, 6G integration, neuromorphic computing, and quantum computing.

Takeaways, Limitations

Takeaways:
We present a comprehensive overview of the latest trends in distributed intelligence and model optimization in the ECCC system.
We systematically analyze model optimization techniques, AI-based resource management strategies, and privacy and security enhancement measures.
We present real-world deployment cases and performance analysis and benchmarking techniques across various applications.
It suggests future research directions such as LLM deployment, 6G integration, neuromorphic computing, and quantum computing.
Limitations:
Because the paper is in the form of a survey, it focuses on a comprehensive analysis of existing studies rather than new research findings.
Because it covers a broad topic rather than an in-depth analysis of a specific technology or application, the details may be somewhat lacking.
The topics presented as future research directions are still in the early stages of research, and actual implementation and application may take considerable time.
👍