POST-OCR IMAGE SEGMENTATION INTO SPATIALLY SEPARATED TEXT ZONES

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20070041642A1
SERIAL NO

11465505

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

This invention describes a post-recognition procedure to group text recognized by an Optical Character Reader (OCR) from a document image into zones. Once the recognized text and the corresponding word bounding boxes for each word of the text are received, the procedure described dilates (expands) these word bounding boxes by a factor and records those which cross. Two word bounding boxes will cross upon dilation if the corresponding words are very close to each other on the original document. The text is then grouped into zones using the rule that two words will belong to the same zone if their word bounding boxes cross upon dilation. The text zones thus identified are sorted and returned.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
MMV FINANCE CANADA INC95 WELLINGTON STREET WEST 22ND FLOOR TORONTO M5J 2N7

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
ROMANOFF, Harris Gabriel Narberth, PA 8 205
SINGH, Sarabjit Uttaranchal, IN 28 226
SPERO, Leslie Merion Station, PA 4 414

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation