PDF/A and MakeSearchable

PDF-X OCR SDK is a New product from us and intended to compliment our existing PDF and Imaging Tools to provide the Developer with an expanding set of professional tools for Optical Character Recognition tasks

Moderators: TrackerSupp-Daniel, Tracker Support, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Tracker Supp-Stefan

Post Reply
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

PDF/A and MakeSearchable

Post by Dorwol »

After I scan some pages with "SXC_EnableSource", "PXC_NewDocumentEx" and so on I have a valid "PDF/A" Scan-Result.

OK, but how can I make this document now searchable and still valid as "PDF/A" !?!?

After "RunOCR..." a new PDF will generated and this document is not a "PDF/A".

So please help!
User avatar
John - Tracker Supp
Site Admin
Posts: 5219
Joined: Tue Jun 29, 2004 10:34 am
Location: United Kingdom
Contact:

Re: PDF/A and MakeSearchable

Post by John - Tracker Supp »

Hi,

can you please post a 'before and after' sample PDF - zipped, and can you advise if you are using our PDF SDK engine or some other ?

thanks
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

John - Tracker Supp wrote: can you please post a 'before and after' sample PDF
There exist not realy a "before PDF", because I scan a paper document before I want to make it searchable.
For thist I use "PXC_NewDocumentEx". This will make a great PDF/A.
But now I must make it "Searchable" by start "RunOCR...." <- And this will steal the "PDF/A" :|
John - Tracker Supp wrote: and can you advise if you are using our PDF SDK engine or some other ?
I use only the "PDF-XChange PRO SDK" (version 5).
Last edited by Dorwol on Thu Apr 24, 2014 3:14 pm, edited 1 time in total.
User avatar
John - Tracker Supp
Site Admin
Posts: 5219
Joined: Tue Jun 29, 2004 10:34 am
Location: United Kingdom
Contact:

Re: PDF/A and MakeSearchable

Post by John - Tracker Supp »

Hi,

understood - the OCR engine is actually using an Existing PDF despite the fact you have created it using the Scanner library as it is saved and then re-opened to OCR and apply the text searchable layer - for now there is I am afraid no work around but I have spoken to the development team and we can add a 'small tweak' in the next but one build to our XCPRO40.dll library which is responsible for adding the functionality to all area's - we cannot add before as we have already completed and submitted the next release to our test team before release early in May.

I cannot give you a specific time I am afraid - but this will be approx. 8-10 weeks or so after the next release.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

Could be just because we planned the new release for the end of May.
OK, If necessary, we would have this feature replacement delivery as an update.
So please do your best for eventualy earlier 'small tweak'! :wink:
THX!
User avatar
John - Tracker Supp
Site Admin
Posts: 5219
Joined: Tue Jun 29, 2004 10:34 am
Location: United Kingdom
Contact:

Re: PDF/A and MakeSearchable

Post by John - Tracker Supp »

We will of course do our best :)
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com
User avatar
Paul - Tracker Supp
Site Admin
Posts: 6832
Joined: Wed Mar 25, 2009 10:37 pm
Location: Chemainus, Canada
Contact:

Re: PDF/A and MakeSearchable

Post by Paul - Tracker Supp »

There is a ticket (internal only) for this: RT#2466: PDF/A and MakeSearchable
Best regards

Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

Meanwhile build 309 is out. But I see not a difference. Or do I anything wrong?
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: PDF/A and MakeSearchable

Post by Tracker Supp-Stefan »

Hello Dorwol,

I can see that ticket #2466 has been assigned to a developer but is not yet marked as resolved.
Guess the guys could not implement this for build 309.

Regards,
Stefan
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

OK, but what now? I need this function for my application very urgent and John says in April approx. 8-10 weeks.

Meanwhile my application is ready to release but waiting for this litte but very necessary feature. Because this is n not a "nice to have feature". Thus you will agree with me. :?
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

Is there a way in combination with your viewer SDK or with the Printer Driver or any other way to save a searchable PDF as PDF/A?
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: PDF/A and MakeSearchable

Post by Tracker Supp-Stefan »

Hello Dorwol,

Still no further news in ticket RT#2466, but to answer your latest question - yes using the PDF Tools SDK and the Drivers you should be able to create PDF/A files and they should remain searchable as well.

Regards,
Stefan
Dorwol
User
Posts: 275
Joined: Mon Aug 04, 2008 5:04 pm

Re: PDF/A and MakeSearchable

Post by Dorwol »

yes using the PDF Tools SDK and the Drivers
I can't realy believe that.

If I try to print any already seachable PDF (after scan) , the result is a image only pdf (without searchable text-layer!). So the possiblility to seach is erased.

What do I wrong please?
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: PDF/A and MakeSearchable

Post by Tracker Supp-Stefan »

Hello Dorwol,

Using the PDF Tools SDK you create the PDF/A File from images scanner.
Then OCR it - to add the text layer. This breaks the PDF/A compatibility.
Now using this non PDF/A file as source - copy the whole page content into a new PDF/A file using the PDF Tools SDK again.

Regards,
Stefan
Post Reply