Binary Split Categorical Feature with Mean Absolute Error Criteria in CART

Authors

  • Peng Yu University of Electronic Science and Technology of China Shopify Télécom Paris
  • Yike Chen University of Electronic Science and Technology of China
  • Chao Xu University of Electronic Science and Technology of China
  • Albert Bifet Télécom Paris The University of Waikato
  • Jesse Read École polytechnique

DOI:

https://doi.org/10.1609/aaai.v40i33.40020

Abstract

In the context of the Classification and Regression Trees (CART) algorithm, the efficient splitting of categorical features using standard criteria like GINI and Entropy is well-established. However, using the Mean Absolute Error (MAE) criterion for categorical features has traditionally relied on various numerical encoding methods. This paper demonstrates that unsupervised numerical encoding methods are not viable for MAE criteria. Furthermore, we present a novel and efficient splitting algorithm that addresses the challenges of handling categorical features with the MAE criterion. Our findings underscore the limitations of existing approaches and offer a promising solution to enhance the handling of categorical data in CART algorithms.

Downloads

Published

2026-03-14

How to Cite

Yu, P., Chen, Y., Xu, C., Bifet, A., & Read, J. (2026). Binary Split Categorical Feature with Mean Absolute Error Criteria in CART. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 27961–27968. https://doi.org/10.1609/aaai.v40i33.40020

Issue

Section

AAAI Technical Track on Machine Learning X