EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce
DOI:
https://doi.org/10.1609/aaaiss.v9i1.42935Abstract
Product mapping decides whether two e-commerce listings refer to the same product. We propose EPM-RL, an on-premise framework that distills LLM comparison reasoning into a local model via PEFT and RL, improving balanced matching under privacy and cost constraints. On an internal benchmark, EPM-RL yields the strongest F1 by boosting recall while maintaining competitive precision.Downloads
Published
2026-06-23
How to Cite
Yu, M., Lee, S., & Seo, W. (2026). EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce. Proceedings of the AAAI Symposium Series, 9(1), 227–227. https://doi.org/10.1609/aaaiss.v9i1.42935
Issue
Section
AI in Business: Intelligent Transformation and Management (Extended Abstracts)