Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Granular-Ball-Induced Multiple Kernel K-Means

Created by
  • Haebom

Author

Shuyin Xia, Yifan Wang, Lifeng Shen, Guoyin Wang

Outline

This paper highlights that existing multi-kernel clustering algorithms (e.g., multi-kernel K-means) suffer from computational efficiency and robustness issues in complex data distributions. This is because optimization methods rely on inter-point relationships, making it difficult to accurately capture the inherent structure and diversity of the data set. Furthermore, complex interactions among multiple kernels further exacerbate these issues, impacting the ability to cluster data points in high-dimensional space. In this paper, we improve the multi-kernel clustering framework by leveraging granular-ball computing. The core of granular-ball computing is to adaptively fit the data distribution using balls, ranging from coarse to tolerant, to fit the data distribution. Each ball can enclose data points based on a density consistency measure. This ball-based data description improves computational efficiency and robustness against unknown noise. Specifically, based on the granular-ball representation, we propose the granular-ball kernel (GBK) and its corresponding granular-ball multi-kernel K-means framework (GB-MKKM) for efficient clustering. The proposed GB-MKKM framework using granular ball relations in multi-kernel space demonstrates its efficiency and superior clustering performance in empirical evaluations on various clustering tasks.

Takeaways, Limitations

Takeaways:
A novel multi-kernel clustering framework (GB-MKKM) using granular ball computing is presented.
Improving the computational efficiency and robustness of the existing multi-kernel K-means algorithm.
Demonstrated improved performance and efficiency across a variety of clustering tasks.
Limitations:
Further research is needed to determine the practical applicability and scalability of the proposed GB-MKKM algorithm.
Need to evaluate generalization performance for various data types and sizes
Optimization research is needed to establish criteria for measuring granule ball size and density consistency.
👍