Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Automatically Detecting Online Deceptive Patterns

Created by
  • Haebom

Author

Asmit Nayak, Shirley Zhang, Yash Wani, Rishabh Khandelwal, Kassem Fawaz

Outline

This paper presents the AutoBot framework, which automatically identifies and mitigates deceptive patterns in digital interfaces. AutoBot analyzes website screenshots to accurately identify and localize deceptive patterns without requiring HTML code. A two-stage pipeline utilizing specialized vision models and large-scale language models (LLMs) analyzes the visual and textual features of the website to identify deceptive patterns. Furthermore, AutoBot is used to train smaller language models by extracting knowledge from a "teacher" LLM, generating a synthetic dataset. AutoBot is implemented as three sub-applications: a browser extension for users, a Lighthouse audit tool for developers, and a measurement tool for researchers and regulators, supporting a variety of web stakeholders. Evaluation results demonstrate its effectiveness, achieving an F1 score of 0.93 for deceptive pattern detection.

Takeaways, Limitations

Takeaways:
Introducing a new framework (AutoBot) that can effectively identify and mitigate deceptive patterns on websites.
Analysis is possible using only screenshots, without relying on HTML code.
Providing applications for a variety of stakeholders, including users, developers, researchers, and regulators.
Achieved high accuracy (F1-score 0.93).
A method for generating synthetic datasets using LLM is presented.
Limitations:
Only performance evaluation results for a specific type of deceptive pattern are presented; verification of generalization performance for various types of deceptive patterns is needed.
Screenshot-based analysis may have limitations in responding to dynamic web element changes.
Bias and potential for error due to performance dependence of LLM.
Inadequate assessment of adaptability to changes in long-term deception patterns.
👍