[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

LLM-RecG: A Semantic Bias-Aware Framework for Zero-Shot Sequential Recommendation

Created by
  • Haebom

Author

Yunzhe Li, Junting Wang, Hari Sundaram, Zhining Liu

Outline

In this paper, we propose a novel framework to address the domain semantic bias problem in zero-shot cross-domain sequential recommendation (ZCDSR). Existing ZCDSR models have improved cross-domain knowledge transfer by utilizing large-scale language models (LLMs), but have limitations in accuracy due to semantic bias caused by vocabulary and content differences between domains. In this paper, we address this issue by improving cross-domain alignment at both the item level and the sequential level. At the item level, we introduce a generalization loss function for aligning cross-domain item embeddings to secure similarity between domains while maintaining the unique characteristics of items in each domain. At the sequential level, we develop a method to cluster source domain user sequences and transfer user behavior patterns through attention-based aggregation to dynamically adapt user embeddings when inferring target domains. As a result, we enable effective zero-shot recommendation without target domain interaction.

Takeaways, Limitations

Takeaways:
A new framework is presented to contribute to improving the performance of LLM-based ZCDSR.
Effectively solves domain semantic bias problems at the item and sequential levels
Aligning cross-domain embeddings and maintaining within-domain diversity via generalization loss function
Effective zero-shot recommendation possible through attention-based user behavior pattern transfer
Recommendations for new domains without target domain data possible
Limitations:
Additional experiments and analysis are needed to determine the actual performance and generalization ability of the proposed framework.
Need to verify applicability to various domains and datasets
There is a possibility of overfitting to certain domains.
Potential increase in computational costs
👍