Can PDF-XChange Editor directly call many online OCRs, such as Baidu OCR, to replace the pre-installed OCR in the software? Because some SHX English and Chinese font PDFs generated by AutoCAD, the recognition effect of using the preset OCR is terrible, and the ABBYY FineReader PDF is also very poor, but the effect of using Baidu OCR is very good, and the accuracy is almost 95%!
PDF Documentation for Experiments
https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link
Can you directly call many online OCRs, such as Baidu OCR.
Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan
-
- User
- Posts: 5
- Joined: Thu Jan 19, 2023 11:45 am
-
- Site Admin
- Posts: 17960
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Can you directly call many online OCRs, such as Baidu OCR.
Hello softschool,
No - I am afraid that because OCR is quite a resource heavy process - it has to be performed on your machine.
If you need to use an online tool - you can use the Editor to export images of the original PDF pages (with a customizable resolution), and then pass those to the online OCR.
Have you tried the different settings options in the Editor's OCR window - are all of them producing bad results?
Do you have a sample file you could share?
Kind regards,
Stefan
No - I am afraid that because OCR is quite a resource heavy process - it has to be performed on your machine.
If you need to use an online tool - you can use the Editor to export images of the original PDF pages (with a customizable resolution), and then pass those to the online OCR.
Have you tried the different settings options in the Editor's OCR window - are all of them producing bad results?
Do you have a sample file you could share?
Kind regards,
Stefan
-
- User
- Posts: 5
- Joined: Thu Jan 19, 2023 11:45 am
Re: Can you directly call many online OCRs, such as Baidu OCR.
https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link
You can try this PDF document, neither PDF-XChange nor ABBYY FineReader can recognize it well, I know Baidu OCR can recognize it well, but it is not an application or a web page, it is just a free or paid cloud service!
This is the Chinese introduction page
Baidu OCR general text recognition (high precision version with location)
https://ai.baidu.com/ai-doc/OCR/tk3h7y2aq
You can try this PDF document, neither PDF-XChange nor ABBYY FineReader can recognize it well, I know Baidu OCR can recognize it well, but it is not an application or a web page, it is just a free or paid cloud service!
This is the Chinese introduction page
Baidu OCR general text recognition (high precision version with location)
https://ai.baidu.com/ai-doc/OCR/tk3h7y2aq
-
- Site Admin
- Posts: 17960
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Can you directly call many online OCRs, such as Baidu OCR.
Hello softschool,
As you said - it is a cloud tool - so there is a server somewhere that has enough processing power to OCR the content you upload to this service. We do not have plans to include another OCR engine in our products for now, and I am sorry if the ones available are not getting correct recognition of your files!
Kind regards,
Stefan
As you said - it is a cloud tool - so there is a server somewhere that has enough processing power to OCR the content you upload to this service. We do not have plans to include another OCR engine in our products for now, and I am sorry if the ones available are not getting correct recognition of your files!
Kind regards,
Stefan
-
- User
- Posts: 5
- Joined: Thu Jan 19, 2023 11:45 am
Re: Can you directly call many online OCRs, such as Baidu OCR.
https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link
This is a PDF document generated by AutoCAD, and because it uses SHX fonts, even ABBYY cannot recognize it. Can you contact ABBYY to add recognition of this font to improve the recognition of your OCR engine?
Because there are so many PDF documents in this AutoCAD format, our translation industry often has to face this kind of PDF documents. It is very difficult to translate and typesetting. The most troublesome thing is the work of converting OCR into real text. I hope you can provide The solution for this!
This is a PDF document generated by AutoCAD, and because it uses SHX fonts, even ABBYY cannot recognize it. Can you contact ABBYY to add recognition of this font to improve the recognition of your OCR engine?
Because there are so many PDF documents in this AutoCAD format, our translation industry often has to face this kind of PDF documents. It is very difficult to translate and typesetting. The most troublesome thing is the work of converting OCR into real text. I hope you can provide The solution for this!
-
- Site Admin
- Posts: 6903
- Joined: Wed Mar 25, 2009 10:37 pm
- Location: Chemainus, Canada
Re: Can you directly call many online OCRs, such as Baidu OCR.
Hi softschool,
that is indeed a heavy OCR job! I will pass this on to the team to see what they think we or ABBYY can do.
warm regards
that is indeed a heavy OCR job! I will pass this on to the team to see what they think we or ABBYY can do.
warm regards
Best regards
Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com