Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Jina-reranker-v3: Last but Not Late Interaction for Document Reranking

Created by
  • Haebom

Author

Feng Wang, Yuqing Li, Han Xiao

Outline

Jina-reranker-v3 is a multilingual document reranking model with 0.6B parameters that introduces a novel last-but-not-late interaction. Unlike late interaction models like ColBERT, it performs causal self-attention between the query and documents within the same context window, enabling rich cross-document interactions before extracting contextual embeddings from the last token of each document. This compact architecture achieves state-of-the-art BEIR performance with 61.94 nDCG@10, while being significantly smaller than generative list-wise reranking models.

Takeaways, Limitations

Takeaways:
Achieves cutting-edge performance despite its compact size (BEIR 61.94 nDCG@10)
We present a novel architecture that leverages causal self-attention between queries and documents, rather than relying on late-stage interaction.
Increased efficiency with smaller size than generative list-wise reordering models.
Limitations:
Specific Limitations is not stated in the abstract
👍