'Find whole words' feature doesn't work properly in version 6.0.322.7

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
doquan0
User
Posts: 10
Joined: Mon Jan 11, 2016 3:48 pm

'Find whole words' feature doesn't work properly in version 6.0.322.7

Post by doquan0 »

Hi developer & suppor team

I'm using PDF-Xchange Editor version 6.0.322.7 on Windows 8.1 x64

I frequently use the feature 'find whole words' but it doesn't seem work properly in this case

This is a pdf file (included in attachments) whose content's text contains : "She's tall.". PDF-Xchange Editor returns 0 entries when I want to search for the whole word 'tall'
Cannot find whole word 'tall' 1.jpg
However, after I deselect the option 'find whole words', PDF-Xchange Editor shows me a result which is a little strange in the text: "She'stall"
Cannot find whole word 'tall' 2.jpg
I don't understand why the space between the words "She's tall" disappears in PDF-Xchange Editor's search results. Could you please fix this issue for later version?
Attachments
Cannot find whole word 'tall'.pdf
(418.54 KiB) Downloaded 65 times
Last edited by doquan0 on Thu Nov 16, 2017 11:20 pm, edited 3 times in total.
User avatar
David.P
User
Posts: 1510
Joined: Thu Feb 28, 2008 8:16 pm

Re: 'Find whole words' feature doesn't work properly in version 6.0.322.7

Post by David.P »

Hi,

it seems that your PDF has been badly OCR'ed from an image file.

That's the entire text from your file:
2 Write. Complete this dialogue.
Nga: ...isthat?
Lan: That's Nam.
Nga: No....is the girl talking to Miss Lien?
Lan: Her name's Hoa. She's a new student.
Nga: ...classis she in?
Lan: She'sinour class — class 7A.
Nga: ...doesshe live?
Lan: She lives on Tran Hung Dao
Street with her aunt and uncle.
Nga: ...doher parents live?
Lan: They live in Hue.
Nga: She'stall. ... old is she?
Lan: She's13.

*3 Ask your partner questions and complete this form.

Name:
Age:
Grade:
School:
Home address:
4\...,
N

4 Listen. Then practice with a partner.
Nam: Where do you live, Hoa?
Hoa: I live at 12 Tran Hung Dao Street.
Nam: How far is it from your house to school?
Hoa: It's not far — about one kilometer.

16
You see that there are a lot of OCR errors in the text, including the "She'stall" error.

So the problem is with your file, not with PDF-XChange Editor.

Regards and hth,
David.P
David.P
PDF-XChange Pro
doquan0
User
Posts: 10
Joined: Mon Jan 11, 2016 3:48 pm

Re: 'Find whole words' feature doesn't work properly in version 6.0.322.7

Post by doquan0 »

David.P wrote:Hi,

it seems that your PDF has been badly OCR'ed from an image file.

That's the entire text from your file:

You see that there are a lot of OCR errors in the text, including the "She'stall" error.

So the problem is with your file, not with PDF-XChange Editor.

Regards and hth,
David.P
Hi David,
How could you get the entire text in my PDF file? (I guess you have selected all and copied - pasted)

The below OCR text is what I have selected all and copied - pasted by using Adobe Acrobat (Reader) 2017.012.20098. This text seems to be fine, not containing errors like the text copied from PDF-XChange Editor. Also, PDF-XChange Editor shows the correct text in the panel "Content" (as shown in the pictures of my first post). So, I think the OCR text is good and there may be a problem in PDF-XChange Editor.

Could you please fix this issue?
Thanks a lot
2 Write. Complete this dialogue.
Nga: ... is that?
Lan: That's Nam.
Nga: No. ... is the girl talking to Miss Lien?
Lan: Her name's Hoa. She's a new student.
Nga: ... class is she in?
Lan: She's in our class — class 7A.
Nga: ... does she live?
Lan: She lives on Tran Hung Dao
Street with her aunt and uncle.
Nga: ... do her parents live?
Lan: They live in Hue.
Nga: She's tall. ... old is she?
Lan: She's 13.
*3 Ask your partner questions and complete this form.
Name:
Age:
Grade:
School:
Home address:
4\...,
N
4 Listen. Then practice with a partner.
Nam: Where do you live, Hoa?
Hoa: I live at 12 Tran Hung Dao Street.
Nam: How far is it from your house to school?
Hoa: It's not far — about one kilometer.
16
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17820
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: 'Find whole words' feature doesn't work properly in version 6.0.322.7

Post by Tracker Supp-Stefan »

Hi all,

Thanks for the comments. Actually the particular text is recognized correctly by the OCR engine - and the space is there in the contents pane.
However - there is another space (between Nga: and She's tall) that is very wide, and overlaps over the space before tall. This is causing us some problems when searching and extracting this text. I've made a ticket in our internal system for this:
#4097: Editor 322.7: Issue with text search in a file with wide characters overlaps (OCR text)
And we will work on fixing it.

The next image shows the problematic space in question:
IMG_16112017_174056_0.png
Cheers,
Stefan
User avatar
David.P
User
Posts: 1510
Joined: Thu Feb 28, 2008 8:16 pm

Re: 'Find whole words' feature doesn't work properly in version 6.0.322.7

Post by David.P »

Thank you Stefan. Fixing this could considerably improve working with copied text from OCR'ed documents!

Regards
David
:)
David.P
PDF-XChange Pro
Post Reply