Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Request-Only Optimization for Recommendation Systems

Created by
  • Haebom

Author

Liang Guo, Wei Li, Lucy Liao, Huihui Cheng, Rui Zhang, Yu Shi, Yueming Wang, Yanzun Huang, Keke Zhai, Pengchao Wang, Timothy Shi, Xuan Cao, Shengzhi Wang, Renqin Cai, Zhaojie Gong, Omkar Vichare, Rui Jian, Leon Gao, Shiyan Deng, Xingyu Liu, Xiong Zhang, Fu Li, Wenlei Xie, Bin Wen, Rui Li, Xing Liu, Jiaqi Zhai

Outline

This paper presents Request-Only Optimizations (ROO), a novel learning and modeling paradigm for improving the efficiency and performance of large-scale deep learning recommendation models (DLRM). ROO processes user requests as data units, saving data storage space compared to traditional user impression-based processing. It also eliminates redundant computation and communication for multiple impressions within a request, enabling a scalable neural network architecture to better capture user interest signals. This is achieved by co-designing data (request-only data), infrastructure (request-only data processing pipelines), and model architecture (request-only neural network architectures). It is particularly advantageous for request-only architectures such as generative recommendation models (GR).

Takeaways, Limitations

Takeaways:
Reduce data storage space: Eliminate redundant data by processing user requests on a per-data basis.
Improved learning efficiency: Eliminate redundant computation and communication for multiple exposures within a request.
Improving model performance: Capturing user interest signals more accurately with scalable neural network architectures.
Increased potential for leveraging new architectures such as generative recommendation models.
Limitations:
Lack of specific technical details on the actual implementation and application of the ROO paradigm.
Further validation of generalizability across different types of recommender systems and datasets is needed.
Further research is needed on the design and optimization of request-only architectures.
👍