Generative Pipeline for Data Augmentation of Unconstrained Document Images with Structural and Textural Degradation (Student Abstract)

Arnab Poddar; Abhishek Kumar Sah; Soumyadeep Dey; Pratik Jawanpuria; Jayanta Mukhopadhyay; Prabir Kumar Biswas

doi:10.1609/aaai.v37i13.27009

Generative Pipeline for Data Augmentation of Unconstrained Document Images with Structural and Textural Degradation (Student Abstract)

Authors

Arnab Poddar Indian Institute of Technology Kharagpur
Abhishek Kumar Sah Indian Institute of Technology Kharagpur
Soumyadeep Dey Microsoft R & D India, Hyderabad
Pratik Jawanpuria Microsoft R & D India, Hyderabad
Jayanta Mukhopadhyay Indian Institute of Technology Kharagpur
Prabir Kumar Biswas Indian Institute of Technology Kharagpur

DOI:

https://doi.org/10.1609/aaai.v37i13.27009

Keywords:

Computer Vision, Data Augmentation, Document Images, Handwritten Manuscripts, Generative Adversarial Networks, Optical Character Recognition, Text Image

Abstract

Computer vision applications for document image understanding (DIU) such as optical character recognition, word spotting, enhancement etc. suffer from structural deformations like strike-outs and unconstrained strokes, to name a few. They also suffer from texture degradation due to blurring, aging, or blotting-spots etc. The DIU applications with deep networks are limited to constrained environment and lack diverse data with text-level and pixel-level annotation simultaneously. In this work, we propose a generative framework to produce realistic synthetic handwritten document images with simultaneous annotation of text and corresponding pixel-level spatial foreground information. The proposed approach generates realistic backgrounds with artificial handwritten texts which supplements data-augmentation in multiple unconstrained DIU systems. The proposed framework is an early work to facilitate DIU system-evaluation in both image quality and recognition performance at a go.

Downloads

Published

2024-07-15

How to Cite

Poddar, A., Sah, A. K., Dey, S., Jawanpuria, P., Mukhopadhyay, J., & Biswas, P. K. (2024). Generative Pipeline for Data Augmentation of Unconstrained Document Images with Structural and Textural Degradation (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16298-16299. https://doi.org/10.1609/aaai.v37i13.27009

Download Citation

Issue

Vol. 37 No. 13: AAAI-23 Special Programs, IAAI-23, EAAI-23, Student Papers and Demonstrations

Section

AAAI Student Abstract and Poster Program

Generative Pipeline for Data Augmentation of Unconstrained Document Images with Structural and Textural Degradation (Student Abstract)

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription