MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context

Authors

  • Shuai Lyu The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China
  • Rongchen Zhang The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China
  • Zeqi Ma The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China
  • Fangjian Liao The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China
  • Dongmei Mo The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China
  • Waikeung Wong The Hong Kong Polytechnic University, Hong Kong SAR, China Laboratory for Artificial Intelligence in Design, Hong Kong SAR, China

DOI:

https://doi.org/10.1609/aaai.v39i6.32634

Abstract

Few-shot defect multi-classification (FSDMC) is an emerging trend in quality control within industrial manufacturing. However, current FSDMC research often lacks generalizability due to its focus on specific datasets. Additionally, defect classification heavily relies on contextual information within images, and existing methods fall short of effectively extracting this information. To address these challenges, we propose a general FSDMC framework called MVREC, which offers two primary advantages: (1) MVREC extracts general features for defect instances by incorporating the pre-trained AlphaCLIP model. (2) It utilizes a region-context framework to enhance defect features by leveraging mask region input and multi-view context augmentation. Furthermore, Few-shot Zip-Adapter(-F) classifiers within the model are introduced to cache the visual features of the support set and perform few-shot classification. We also introduce MVTec-FS, a new FSDMC benchmark based on MVTec AD, which includes 1228 defect images with instance-level mask annotations and 46 defect types. Extensive experiments conducted on MVTec-FS and four additional datasets demonstrate its effectiveness in general defect classification and its ability to incorporate contextual information to improve classification performance.

Downloads

Published

2025-04-11

How to Cite

Lyu, S., Zhang, R., Ma, Z., Liao, F., Mo, D., & Wong, W. (2025). MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context. Proceedings of the AAAI Conference on Artificial Intelligence, 39(6), 5937–5945. https://doi.org/10.1609/aaai.v39i6.32634

Issue

Section

AAAI Technical Track on Computer Vision V