Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence

Created by
  • Haebom

Author

Ji Wang, Kashing Chen, Xinyuan Song, Ke Zhang, Lynn Ai, Eric Yang, Bill Shi

Outline

This paper proposes Symphony, a distributed multi-agent system, to address the high deployment costs, inflexible communication topologies, and limited adaptability of existing centralized large-scale language model (LLM)-based agent frameworks. Symphony enables the coordination of lightweight LLMs on consumer-grade GPUs and introduces three key mechanisms: a distributed ledger for recording features, a beacon selection protocol for dynamic task allocation, and CoT-based weighted-outcome voting. This design creates a low-overhead coordination system that is privacy-preserving, scalable, and fault-tolerant. Experimentally, Symphony outperforms existing baselines on inference benchmarks, demonstrating significant accuracy gains and robust performance across a wide range of model capacities.

Takeaways, Limitations

Takeaways:
We present the efficiency and feasibility of a distributed multi-agent system leveraging lightweight LLM on consumer-grade GPUs.
It provides a new architecture that overcomes the limitations of existing centralized systems.
We present a system design that simultaneously satisfies privacy protection, scalability, and fault tolerance.
It demonstrates its practicality by outperforming existing systems in inference benchmarks.
Limitations:
Further validation of the generalizability of the benchmark presented in this paper is needed.
There is a lack of performance evaluation in various real-world environments.
Maintenance difficulties are expected due to the complexity of the system.
Additional analysis is needed to address potential consensus delays and failures in distributed environments.
👍