[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants

Created by
  • Haebom

Author

Ruofan Liu, Yun Lin, Silas Yeo Shuen Yu, Xiwen Teoh, Zhenkai Liang, Jin Song Dong

Outline

This paper addresses the problem that psychologically persuasive phishing emails generated using large-scale language models (LLMs) can evade existing detection systems. To address this issue, we propose PiMRef, a reference-based phishing email detector that leverages knowledge-based invariants. PiMRef extracts the alleged identity of a sender from an email, verifies the legitimacy of a domain using a predefined knowledge base, and detects call-to-action prompts that encourage user engagement. Contradictory claims are flagged as phishing indicators and provided with human-readable explanations. It demonstrates higher precision (8.8% improvement) and efficiency than existing methods, and achieves excellent performance (precision 92.1%, recall 87.9%, and average execution time 0.05 s) in real-world evaluations.

Takeaways, Limitations

Takeaways:
Presenting an effective defense strategy against sophisticated phishing email attacks based on LLM.
Proof of the superiority of reference-based detection methods using knowledge-based invariants.
Demonstrating the practical value of the PiMRef system with high accuracy and efficiency.
Ensure transparency by providing human-readable explanations.
Limitations:
High dependence on the completeness and accuracy of the knowledge base. Inaccurate or incomplete information in the knowledge base can cause a decrease in detection performance.
Potential vulnerability to attacks that utilize new phishing techniques or information not in the knowledge base.
Performance may vary depending on the characteristics of the evaluation dataset. Additional validation in various environments is required.
👍