Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

What Data is Really Necessary? A Feasibility Study of Inference Data Minimization for Recommender Systems

Created by
  • Haebom

Author

Jens Leysen, Marco Favier, Bart Goethals

Outline

This paper addresses the problem of applying privacy minimization principles in recommender systems. Applying privacy minimization principles is challenging because recommender systems rely on vast amounts of personal data. This paper conducts a feasibility study on implicit feedback inference data minimization. We present a novel problem definition, analyze various minimization techniques, and investigate key factors influencing their effectiveness. We demonstrate that significant inference data reduction without performance degradation is technically feasible. However, its practicality depends heavily on technical settings (e.g., performance objectives, model selection) and user characteristics (e.g., history size, preference complexity). Therefore, while demonstrating technical feasibility, we conclude that data minimization remains a practical challenge, and its dependence on technical and user context makes it difficult to implement a universal standard for data "necessity."

Takeaways, Limitations

Takeaways: Demonstrates that significant data reduction in recommender systems is technically feasible. This contributes to finding a balance between privacy and system performance.
Limitations: The practicality of data minimization depends heavily on technical settings and user characteristics, making it difficult to establish a universal "necessity" criterion for data. The degree to which data can be reduced without performance degradation varies across systems and users.
👍