We are using your OCR component in our ASP.NET application. Everything works for us correctly, however we are wondering about low performance of the OCR_MakeSearchable method. We compared your product to the Quick Scan Pro solution and the QSP was definitely faster.
OCRing the file below (it's converted to PDF before OCRing) using the OCR_MakeSearchable() method takes about 3 minutes. It's pretty long. For comparison, QSP has processed the same document in about 20 seconds.
Is there any way to make this method faster? Are you planing improve performance in next release?
Our PXO_Options are:
Code: Select all
OCR.PXO_Options options = new OCR.PXO_Options();
options.blacklist = String.Empty;
options.whitelist = String.Empty;
options.DataPath = OcrUtility.GetLanguagesDirectory();
options.ImageFlags = (uint)OCR.OCR_ImageProcessingFlags.OCR_Image_SuppressOutput;
options.lang = OCR.PXO_Language.PXO_English;
options.raster_dpi = 300;
options.RegionMode = OCR.OCR_RegionMode.OCR_Auto;
options.reserved = 0;
PS. When are you going to release a new version of the ocrtools? We are looking forward a two new abilities. First is the full orientation detection while OCRing. Second is the new functionality which places only text layer to the original PDF file. Now, we are dealing with it by using the OCR_Image_SuppresOutput setting and PlaceContents() method from the xcpro40.dll. Unfortunately, it prevents us from using the rotation mode.
Thanks in advance and best regards,
Igor