Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Predict, Cluster, Refine: A Joint Embedding Predictive Self-Supervised Framework for Graph Representation Learning

Created by
  • Haebom

Author

Srinitish Srinivasan, Omkumar C.U.

Outline

This paper proposes a novel joint embedding prediction framework (JPEB-GSSL) to address computational inefficiency, contrastive target dependency, and representation collapse in graph self-supervised learning (SSL). We overcome the limitations of existing methods—feature reconstruction, speech sampling, and reliance on complex decoders—and present a non-contrast, view-invariant joint embedding prediction architecture that preserves semantic and structural information without contrastive targets or speech sampling. Furthermore, we introduce a semantic target term that integrates pseudo-labels derived using Gaussian Mixture Models (GMMs) to evaluate the contribution of latent features, thereby enhancing node discriminability. By leveraging single-context and multi-target relationships between subgraphs, we outperform existing state-of-the-art graph SSL methods across various benchmarks. This provides a computationally efficient and collapse-resistant paradigm for combining spatial and semantic graph features.

Takeaways, Limitations

Takeaways:
Effectively solves the computational inefficiency, contrastive target dependency, and representation collapse problems of existing graph SSL methods.
Achieve excellent performance without contrast loss or complex decoders.
Improve node identification and generalization performance through semantic recognition target terms.
We present a novel framework that effectively combines spatial and semantic graph features.
Achieve cutting-edge performance across a variety of benchmarks.
Limitations:
Further analysis of the generalization performance of the proposed method is needed.
Scalability evaluation for various graph structures and sizes is required.
Further research is needed on the accuracy of GMM-based pseudo-label generation.
Application and performance verification for actual applications are required.
👍