BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement

Morgane Ayle; Jimmy Tekli; Julia El-Zini; Boulos El-Asmar; Mariette Awad

doi:10.1609/aaai.v34i03.5639

Authors

Morgane Ayle American University of Beirut
Jimmy Tekli BMW Group
Julia El-Zini American University of Beirut
Boulos El-Asmar BMW Group
Mariette Awad American University of Beirut

DOI:

https://doi.org/10.1609/aaai.v34i03.5639

Abstract

Research has shown that deep neural networks are able to help and assist human workers throughout the industrial sector via different computer vision applications. However, such data-driven learning approaches require a very large number of labeled training images in order to generalize well and achieve high accuracies that meet industry standards. Gathering and labeling large amounts of images is both expensive and time consuming, specifically for industrial use-cases. In this work, we introduce BAR (Bounding-box Automated Refinement), a reinforcement learning agent that learns to correct inaccurate bounding-boxes that are weakly generated by certain detection methods, or wrongly annotated by a human, using either an offline training method with Deep Reinforcement Learning (BAR-DRL), or an online one using Contextual Bandits (BAR-CB). Our agent limits the human intervention to correcting or verifying a subset of bounding-boxes instead of re-drawing new ones. Results on a car industry-related dataset and on the PASCAL VOC dataset show a consistent increase of up to 0.28 in the Intersection-over-Union of bounding-boxes with their desired ground-truths, while saving 30%-82% of human intervention time in either correcting or re-drawing inaccurate proposals.

BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription