Translate
Francais Español Deutsch
 
 
   
Developers Product Chooser PDF Choosing Wizard Feature Comparison Chart Image Product Chooser Image Comparison Chart

Do it all with PDF-XChange PRO

A product suite that includes all of the above PDF products and allows you to do pretty much whatever you want with PDFs

PRO PDF

PDF-XChange Viewer

PDF-XChange Lite

PDF-XChange Standard

PDF-Tools

Imaging Products


Raster-XChange


Tiff-XChange

SDKs for PDFs: SDKs for Imaging: Clarion® SDKs:

PDF Related

Imaging Products

Clarion® Developers now have the choice of using our generic SDKs, or downloading our Clarion® specific SDKs which include extra Clarion® classes, templates and examples for fast and easy integration into their Clarion® based applications.
Knowledgebase

Knowledgebase

How do I OCR a document?

Knowledge Base Article: KB351 - Created On: Dec 15, 2011 09:01 AM - Last Modified: May 2, 2012 06:23 AM

 
Challenge

 I have an image based document I would like to convert to a text searchable document. 

 

Note, this will create documents searchable and text selectable(not including font and format). It will NOT make it a fully editable text based documents. 

Resolution

 First open the PDF document you would like to OCR. Go to the Document menu and choose the OCR icon or OCR icon : OCR Pages... You may also press Ctrl+Shift+C.

 

 

Both these actions will open the OCR Pages dialog.

 

The OCR Pages dialog is split into three sections Page Range, Recognition, and Output.

 

Page Range - 

 

All - This option will select all pages in the document.

Current Page - This option will select the current page.

Pages - Allows you to choose a selection and/or range of pages.  

 

Recognition -

Primary Language - This option allows you to specify the language used in the OCR process. If the language you need is not listed in the drop down menu you can press the More Languages link to navigate to the Languages download page.

Accuracy - This option allows you to set the resolution of the OCR scan. The lower the setting, the smaller the file size. As well as a shorter processing time and diminished Accuracy.

 

Output -

Preserve Original Content & Add Text Layer - This option will not change the content of the target document but creates another layer within the document and overlays the text over the image of the text.

 Convert Page Content to Image only - Add Text As a Layer - This option will take a document that has both images as well as text and convert them both into a consolidated  image. This option also enables the Images Quality option which dictates the end resolution of the consolidated image. When this process is performed on an image only document, it doesn't make changes to the image other than to adjust the resulting dpi.

WARNING!! This option is destructive. It will recreate your document as a new image and in the process cast away the original document. Once saved, this is IRREVERSIBLE, so please use a copy of your original document when using this option.

 

Finally - Press Ok and your document will be processed.

** By default the PDF-XChange Viewer build 200 or later will install base OCR language support for English, French, German and Spanish. Additional Language Packs are available here: http://www.tracker-software.com/pdf-xchange-viewer-ocr

 

Vote

Was this article helpful?

More Like This

   KB#279: I have a problem with copying text from a PDF document.

   KB#148: How do I copy and paste text boxes and sticky notes?

   KB#203: How do I highlight things on a document that contains only images?

   KB#238: How do I reduce the process time of adding a background image to every page in a large document?

   KB#278: How do I paste images into a PDF document?