Enhanced Optical Character Recognition by Optical Sensor Combined with BERT and Cosine Similarity Scoring (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v38i21.30483Keywords:
OCR, BERT, Cosine Similarity, NLP, CVAbstract
Optical character recognition(OCR) is the technology to identify text characters embedded within images. Conventional OCR models exhibit performance degradation when performing with noisy images. To solve this problem, we propose a novel model, which combines computer vision using optical sensor with natural language processing by bidirectional encoder representations from transformers(BERT) and cosine similarity scoring. The proposed model uses a confidence rate to determine whether to utilize optical sensor alone or BERT/cosine similarity scoring combined with the optical sensor. Experimental results show that the proposed model outperforms approximately 4.34 times better than the conventional OCR.Downloads
Published
2024-03-24
How to Cite
Moon, W., Nengroo, S., Kim, T., Lee, J., Son, S., & Har, D. (2024). Enhanced Optical Character Recognition by Optical Sensor Combined with BERT and Cosine Similarity Scoring (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23585-23586. https://doi.org/10.1609/aaai.v38i21.30483
Issue
Section
AAAI Student Abstract and Poster Program