Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Multi-Faceted Large Embedding Tables for Pinterest Ads Ranking

Created by
  • Haebom

Author

Runze Su, Jiayin Jin, Jiacheng Li, Sihan Wang, Guangtong Bai, Zelun Wang, Li Tang, Yixiong Meng, Huasen Wu, Zhimeng Pan, Kungang Li, Han Sun, Zhifang Liu, Haoyang Li, Siping Ji, Ling Leng, Prathibha Deshikachar

Outline

Integrating large embedding tables into Pinterest's ad ranking model, we encountered not only common challenges like sparsity and scalability, but also Pinterest-specific challenges. Attempts to train large embedding tables from scratch yielded poor results, so we introduced a novel, multifaceted pre-training approach that integrated various pre-training algorithms. This significantly improved the embedding table, leading to performance gains in both click-through rate (CTR) and conversion rate (CVR). Furthermore, to overcome GPU memory limitations and enhance scalability, we designed and deployed a CPU-GPU hybrid serving infrastructure into the Pinterest ad system. As a result, we achieved a 1.34% reduction in online cost-per-click (CPC) and a 2.60% increase in CTR, with no change in end-to-end latency.

Takeaways, Limitations

Takeaways:
We demonstrate that a multifaceted pre-training approach can significantly improve the performance of large embedding tables.
Demonstrates that CPU-GPU hybrid serving infrastructure can overcome GPU memory limitations and increase scalability.
By applying it to the Pinterest advertising system, we achieved positive results such as reduced CPC and increased CTR.
Limitations:
This paper lacks a detailed explanation of the specific algorithm and details of the proposed multifaceted dictionary learning method.
Because this method is optimized for Pinterest's unique environment, its applicability to other systems requires additional verification.
Lack of detailed technical explanation of building a CPU-GPU hybrid serving infrastructure.
👍