Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts

Created by
  • Haebom

Author

Xinkui Lin, Yongxiu Xu, Minghao Tang, Shilong Zhang, Hongbo Xu, Hao Xu, Yubin Wang

Outline

This paper proposes REMOTE, a novel unified framework for multimodal relation extraction (MRE). REMOTE simultaneously extracts intra- and inter-modal relations between text entities and visual objects by leveraging multilevel optimal transport and a mixture of experts. It overcomes the single relation extraction and computational duplication inherent in existing methods, and dynamically selects optimal interaction features for various relation triplets through a mixture of experts mechanism. Furthermore, it introduces a multilevel optimal transport fusion module, preserving the benefits of multilayer encoding without losing low-level information, generating more expressive representations. We evaluate the effectiveness of REMOTE on a new dataset, UMRE, and achieve state-of-the-art performance on existing MRE datasets. The source code is available on GitHub.

Takeaways, Limitations

Takeaways:
Presenting an integrated framework for simultaneously extracting various types of relationship triplets.
Efficient and expressive relation extraction through multi-layer optimal propagation and expert mixing mechanisms.
Objective performance evaluation and comparison through the construction of the UMRE dataset.
Achieving cutting-edge performance and activating research through open source disclosure
Limitations:
The size and diversity of the UMRE dataset could be improved in the future.
There may be performance differences for certain types of relationships.
The computational complexity of multilayer optimal transmission modules can be high.
👍