The invention described herein provides a method and apparatus for document processing that efficiently separates and interrelates single modalities, such as text, handwriting, and images. In particular, the present invention starts with the recognition of text characters and words for the efficient separation of text paragraphs from images by maintaining their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing of a single character, after its recognition, continues with the recognition and framing of a word, and ends with the framing of all text lines. The method and apparatus described herein can process different types of documents, such as typed, handwritten, skewed, mixed, but not half-tone ones.

 
Web www.patentalert.com

< Document processing method and system

< Color correction of images

> Reducing aliasing artifacts when shaping a digital image

> Statistic calculating method using a template and corresponding sub-image to determine similarity based on sum of squares thresholding

~ 00277