EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce

Authors

  • Minhyeong Yu Enhans, Seoul, South Korea
  • Seunghyun Lee Enhans, Seoul, South Korea
  • Wonduk Seo Enhans, Seoul, South Korea Peking University, Beijing, China

DOI:

https://doi.org/10.1609/aaaiss.v9i1.42935

Abstract

Product mapping decides whether two e-commerce listings refer to the same product. We propose EPM-RL, an on-premise framework that distills LLM comparison reasoning into a local model via PEFT and RL, improving balanced matching under privacy and cost constraints. On an internal benchmark, EPM-RL yields the strongest F1 by boosting recall while maintaining competitive precision.

Downloads

Published

2026-06-23

How to Cite

Yu, M., Lee, S., & Seo, W. (2026). EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce. Proceedings of the AAAI Symposium Series, 9(1), 227–227. https://doi.org/10.1609/aaaiss.v9i1.42935

Issue

Section

AI in Business: Intelligent Transformation and Management (Extended Abstracts)