Barely Supervised Learning for Graph-Based Fraud Detection

Authors

  • Hang Yu Shanghai University
  • Zhengyang Liu Shanghai University
  • Xiangfeng Luo Shanghai University

DOI:

https://doi.org/10.1609/aaai.v38i15.29593

Keywords:

ML: Semi-Supervised Learning, ML: Graph-based Machine Learning, DMKM: Graph Mining, Social Network Analysis & Community

Abstract

In recent years, graph-based fraud detection methods have garnered increasing attention for their superior ability to tackle the issue of camouflage in fraudulent scenarios. However, these methods often rely on a substantial proportion of samples as the training set, disregarding the reality of scarce annotated samples in real-life scenarios. As a theoretical framework within semi-supervised learning, the principle of consistency regularization posits that unlabeled samples should be classified into the same category as their own perturbations. Inspired by this principle, this study incorporates unlabeled samples as an auxiliary during model training, designing a novel barely supervised learning method to address the challenge of limited annotated samples in fraud detection. Specifically, to tackle the issue of camouflage in fraudulent scenarios, we employ disentangled representation learning based on edge information for a small subset of annotated nodes. This approach partitions node features into three distinct components representing different connected edges, providing a foundation for the subsequent augmentation of unlabeled samples. For the unlabeled nodes used in auxiliary training, we apply both strong and weak augmentation and design regularization losses to enhance the detection performance of the model in the context of extremely limited labeled samples. Across five publicly available datasets, the proposed model showcases its superior detection capability over baseline models.

Published

2024-03-24

How to Cite

Yu, H., Liu, Z., & Luo, X. (2024). Barely Supervised Learning for Graph-Based Fraud Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16548-16557. https://doi.org/10.1609/aaai.v38i15.29593

Issue

Section

AAAI Technical Track on Machine Learning VI