PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection

Tri Cao; Chengyu Huang; Yuexin Li; Wang Huilin; Amy He; Nay Oo; Bryan Hooi

doi:10.1609/aaai.v39i27.35003

Authors

Tri Cao National University of Singapore
Chengyu Huang National University of Singapore
Yuexin Li National University of Singapore
Wang Huilin National University of Singapore
Amy He Massachusetts Institute of Technology
Nay Oo NCS Cyber Special Ops-R&D
Bryan Hooi National University of Singapore

DOI:

https://doi.org/10.1609/aaai.v39i27.35003

Abstract

Phishing attacks are a major threat to online security, exploiting user vulnerabilities to steal sensitive information. Various methods have been developed to counteract phishing, each with varying levels of accuracy, but they also face notable limitations. In this study, we introduce PhishAgent, a multimodal agent that combines a wide range of tools, integrating both online and offline knowledge bases with Multimodal Large Language Models (MLLMs). This combination leads to broader brand coverage, which enhances brand recognition and recall. Furthermore, we propose a multimodal information retrieval framework designed to extract the relevant top k items from offline knowledge bases, using available information from a webpage, including logos and HTML. Our empirical results, based on three real-world datasets, demonstrate that the proposed framework significantly enhances detection accuracy and reduces both false positives and false negatives, while maintaining model efficiency. Additionally, PhishAgent shows strong resilience against various types of adversarial attacks.

PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information