Optical Character Recognition

This Help topic refers to the following editions:

þ Enterprise þProfessional þ Personal þ Small Business

 

 

OCR, or optical character recognition, converts image data to text to edit, cut and paste, reformat, print, and send. The OCR process evaluates images, such as scanned text, a fax, or a graphic, and recognizes which shapes constitute characters and which characters form words.

 

To OCR an Image Document:

You can save converted text as Microsoft Word for Windows/RTF (Rich Text Format), WordPerfect, HTML, or unformatted text.

You can convert an entire document with OCR, retaining the page layout and picture placement. Alternatively, you can select an area on a page or a group of pages for OCR.

Users can specify automatic training of the OCR process , or monitor it interactively, correcting questionable characters and saving them in a training file to improve word recognition in subsequent OCR sessions. You can also create dictionaries of words, proper names, or acronyms to use during OCR to improve recognition accuracy.

 

Tip:

OCR should not be used as the basic indexing methodology. To best use the OCR feature use it only when required for either indexing or to convert text for editing and inclusion in other documents.

PDF documents can only be OCR'd in Adobe Acrobat. The OCR text from a PDF document is proprietary and not full text searchable.