Text Recognition of Low-resolution Document Images

Chuck Jacobs; Patrice Simard; Paul Viola; James Rinker

Text Recognition of Low-resolution Document Images

Chuck Jacobs ,
Patrice Simard ,
Paul Viola ,
James Rinker

August 2005

Published by IEEE Computer Society

Publication

Download BibTex

Cheap and versatile cameras make it possible to easily and quickly capture a wide variety of documents. However, low resolution cameras present a challenge to OCR because it is virtually impossible to do character segmentation independently from recognition. In this paper we solve these problems simultaneously by applying methods borrowed from cursive handwriting recognition. To achieve maximum robustness, we use a machine learning approach based on a convolutional neural network. When our system is combined with a language model using dynamic programming, the overall performance is in the vicinity of 80-95% word accuracy on pages captured with a 1024×768 webcam and 10-point text.

Copyright © 2005 IEEE. Reprinted from IEEE Computer Society. This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.