This is a page that curates AI-related papers published worldwide. All content here is summarized using Google Gemini and operated on a non-profit basis. Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.
Multi-Faceted Large Embedding Tables for Pinterest Ads Ranking
Created by
Haebom
Author
Runze Su, Jiayin Jin, Jiacheng Li, Sihan Wang, Guangtong Bai, Zelun Wang, Li Tang, Yixiong Meng, Huasen Wu, Zhimeng Pan, Kungang Li, Han Sun, Zhifang Liu, Haoyang Li, Siping Ji, Degao Peng, Jinfeng Zhuang, Ling Leng, Prathibha Deshikachar
Outline
This paper presents the challenges encountered in integrating large-scale embedding tables into the Pinterest ad ranking model and their solutions. Initial attempts to train large-scale embedding tables from scratch proved unsuccessful, leading to the introduction of a novel multifaceted pre-training technique that integrates various pre-training algorithms. This approach significantly improved the quality of the embedding tables and significantly improved performance in both Click-Through Rate (CTR) and Conversion Rate (CVR). Furthermore, to overcome GPU memory limitations and enhance scalability, a CPU-GPU hybrid serving infrastructure was designed and deployed in the Pinterest ad system. This resulted in a 1.34% reduction in CPC and a 2.60% increase in CTR. Overall latency remained unchanged.
Takeaways, Limitations
•
Takeaways:
◦
We demonstrate that multifaceted pre-learning techniques can significantly improve the performance of large embedding tables.
◦
We propose that a CPU-GPU hybrid serving infrastructure can address the scalability issues of large embedding tables.
◦
Implemented in practice with Pinterest's advertising system, achieving tangible results, including reduced CPC and increased CTR.
•
Limitations:
◦
Lack of description of the specific algorithmic and implementation details of the proposed multifaceted dictionary learning technique.
◦
Lack of analysis on applicability to other advertising platforms or recommendation systems.
◦
Lack of detailed description of the specific structure and performance of CPU-GPU hybrid serving infrastructure.