Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Towards Explainable Job Title Matching: Leveraging Semantic Textual Relatedness and Knowledge Graphs

Created by
  • Haebom

Author

Vadim Zadykian, Bruno Andrade, Haithem Afli

Outline

This paper studies semantic textual relevance (STR) for job title matching, a critical task in resume recommendation systems. To improve STR beyond simple lexical similarity and capture nuanced relationships, we propose a self-supervised hybrid architecture combining dense sentence embeddings and domain-specific knowledge graphs (KGs). Unlike previous studies, we perform a hierarchical evaluation by segmenting the STR score continuum into low, medium, and high semantic relevance regions, allowing for a detailed analysis of model performance. Evaluating various embedding models with and without KGs, we demonstrate that a fine-tuned SBERT model with KGs significantly improves performance, reducing root mean square error (RMSE) by up to 25% in high-STR regions. This highlights the effectiveness of combining KGs with text embeddings and the importance of analyzing region-specific performance.

Takeaways, Limitations

Takeaways:
We demonstrate that a hybrid architecture based on dense sentence embeddings incorporating knowledge graphs (KGs) is effective in improving semantic text relevance (STR) in job title matching.
A method of evaluating the STR score continuum by stratifying it reveals the model's strengths and weaknesses more clearly.
Significant performance improvements in the high STR domain could improve accuracy and explainability in applications such as human resource (HR) systems.
Limitations:
This study was limited to a specific domain (job title matching), so generalizability may be limited.
The quality and scale of the knowledge graph used may affect model performance, and further research is needed.
Further research is needed on the optimal splitting criteria for hierarchical evaluation methods.
👍