Learning Only When It Matters: Cost-Aware Long-Tailed Classification

Yu-Cheng He; Yao-Xiang Ding; Han-Jia Ye; Zhi-Hua Zhou

doi:10.1609/aaai.v38i11.29133

Authors

Yu-Cheng He National Key Laboratory for Novel Software Technology, Nanjing University, China
Yao-Xiang Ding State Key Laboratory of CAD & CG, Zhejiang University, China
Han-Jia Ye National Key Laboratory for Novel Software Technology, Nanjing University, China School of Artificial Intelligence, Nanjing University, China
Zhi-Hua Zhou National Key Laboratory for Novel Software Technology, Nanjing University, China School of Artificial Intelligence, Nanjing University, China

DOI:

https://doi.org/10.1609/aaai.v38i11.29133

Keywords:

ML: Multi-class/Multi-label Learning & Extreme Classification

Abstract

Most current long-tailed classification approaches assume the cost-agnostic scenario, where the training distribution of classes is long-tailed while the testing distribution of classes is balanced. Meanwhile, the misclassification costs of all instances are the same. On the other hand, in many real-world applications, it is more proper to assume that the training and testing distributions of classes are the same, while the misclassification cost of tail-class instances is varied. In this work, we model such a scenario as cost-aware long-tailed classification, in which the identification of high-cost tail instances and focusing learning on them thereafter is essential. In consequence, we propose the learning strategy of augmenting new instances based on adaptive region partition in the feature space. We conduct theoretical analysis to show that under the assumption that the feature-space distance and the misclassification cost are correlated, the identification of high-cost tail instances can be realized by building region partitions with a low variance of risk within each region. The resulting AugARP approach could significantly outperform baseline approaches on both benchmark datasets and real-world product sales datasets.

Learning Only When It Matters: Cost-Aware Long-Tailed Classification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription