Online Fraud Detection via Test-Time Retrieval-Based Representation Enrichment

Authors

  • Yiran Qiao Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China Key Lab of AI Safety, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, CAS, Beijing, China
  • Ningtao Wang Independent Researcher
  • Yuncong Gao Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China Key Lab of AI Safety, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, CAS, Beijing, China
  • Yang Yang Independent Researcher
  • Xing Fu Independent Researcher
  • Weiqiang Wang Independent Researcher
  • Xiang Ao Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China Key Lab of AI Safety, Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, CAS, Beijing, China CASMINO Ltd., Suzhou, China

DOI:

https://doi.org/10.1609/aaai.v39i12.33359

Abstract

Anti-fraud machine learning systems are perpetually confronted with the significant challenge of concept drift, driven by the continuous and intense evolution of fraudulent techniques. That is, outdated models trained on historical fraudulent behaviors often fall short in addressing the evolving tactics of malicious users over time. The key issue lies in effectively tackling the rapid and significant evolution of fraudsters' behaviors to detect these emerging and unforeseen anomalies. In this paper, we propose a solution by directly accessing real-time data and introducing a lightweight plug-in approach named TRE (Test-time Retrieval-based Representation Enrichment). Considering the similarity among samples, TRE employs a retriever to efficiently identify the top-K most relevant recent samples and implements an aggregation strategy to provide neighboring embeddings to the predictor. It thus adjusts the trained classifiers during the test time, providing them with the information from the latest unlabeled data. Extensive experiments on three large-scale real-world datasets demonstrate the superiority of TRE. By consistently incorporating information from the nearest neighbors, TRE demonstrates high adaptability and surpasses existing methods in performance.

Downloads

Published

2025-04-11

How to Cite

Qiao, Y., Wang, N., Gao, Y., Yang, Y., Fu, X., Wang, W., & Ao, X. (2025). Online Fraud Detection via Test-Time Retrieval-Based Representation Enrichment. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 12470-12478. https://doi.org/10.1609/aaai.v39i12.33359

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II