[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree

Created by
  • Haebom

Author

Sam Johnson, Viet Pham, Thai Le

Outline

This paper demonstrates that while LLM-based web crawling agents are powerful in their automation, they are vulnerable to indirect prompt injection (IPI) attacks. We demonstrate that an attacker can inject common adversarial triggers into the HTML of a web page to hijack the behavior of the agent parsing the HTML using the accessibility tree, thereby causing unintended or malicious behavior. Using the Greedy Coordinate Gradient (GCG) algorithm and a Browser Gym agent based on Llama-3.1, we demonstrate a system that achieves high success rates across both targeted and generic attacks (including login credential leaks and forced ad clicks) on real websites. The experimental results highlight significant security risks and the need for stronger defenses as LLM-based autonomous web agents become more widely adopted. The system software ( https://github.com/sej2020/manipulating-web-agents) is released under the MIT license, and a publicly accessible demo website ( http://lethaiq.github.io/attack-web-llm-agent) is also provided.

Takeaways, Limitations

Takeaways: Reveals security vulnerabilities in LLM-based web crawling agents and empirically demonstrates the dangers of indirect prompt injection attacks. Suggests the need for strengthening the security of LLM-based autonomous systems. Provides an important foundation for research on attack techniques and defense strategies. Easy reproducibility and verification of research results through open source software and demo websites.
Limitations: The current study may be limited to a specific LLM (Llama-3.1) and a specific type of web crawling agent that utilizes accessibility trees. Further research is needed on various LLM and web crawling strategies. Lack of specific discussion on the proposed defense strategies. May not fully reflect the complexity of real-world attack scenarios.
👍