Here I have tested out the OCR capabilities on a colour 300dpi scan of German text that includes both roman fonts and fraktur (blackletter) fonts.
I compared Adobe Acrobat
7.0 OCR performance with three different accuracy settings in PDF-XChange Editor
In summary, Acrobat is always much faster, but Editor is more accurate if either "Medium" or "High" accuracy is chosen:
- Acrobat. GERMAN/EXACT/600DPI: 7 seconds, very poor accuracy (Note: this has no specific fraktur recognition capability.)
- Editor, LOW ACCURACY: 60 seconds, poor accuracy
- Editor, MEDIUM ACCURACY: a 24 seconds, good accuracy
- Editor, HIGH ACCURACY: 30 seconds, good accuracy
As shown, actual accuracy of the results is practically equivalent for the first two and the last two.
Please note that times are for just one single page.
I consider 20–30 seconds to be rather slow for just one page. However, the increased accuracy of the results makes it worthwhile.
60 seconds for a single page is completely impractical, especially when the results are poor.
Due to copyright issues I am not going to post the entire document, but attached hereto are an overview of the page analysed, an enlarged view of the sample text, and various OCR results.
. As suggested also elsewhere
, the newer versions of Adobe Acrobat
can be expected to be much better than the old version (7.0) tested here!