EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce

Minhyeong Yu; Seunghyun Lee; Wonduk Seo

doi:10.1609/aaaiss.v9i1.42935

EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce

Authors

Minhyeong Yu Enhans, Seoul, South Korea
Seunghyun Lee Enhans, Seoul, South Korea
Wonduk Seo Enhans, Seoul, South Korea Peking University, Beijing, China

DOI:

https://doi.org/10.1609/aaaiss.v9i1.42935

Abstract

Product mapping decides whether two e-commerce listings refer to the same product. We propose EPM-RL, an on-premise framework that distills LLM comparison reasoning into a local model via PEFT and RL, improving balanced matching under privacy and cost constraints. On an internal benchmark, EPM-RL yields the strongest F1 by boosting recall while maintaining competitive precision.

2026 AAAI Summer Symposium Proceedings Cover

Downloads

Published

2026-06-23

How to Cite

Yu, M., Lee, S., & Seo, W. (2026). EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce. Proceedings of the AAAI Symposium Series, 9(1), 227–227. https://doi.org/10.1609/aaaiss.v9i1.42935

Download Citation

Issue

Vol. 9 No. 1: Proceedings of the 2026 AAAI Summer Symposium Series

Section

AI in Business: Intelligent Transformation and Management (Extended Abstracts)

EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information