Tom's Guide | Tom's Hardware | Tom's Games
![]() |
![]() |
![]() |
Here is a PDF that opens correctly and the text is shown well, but when you copy the text from it and paste it in another text editor (wordpad, notepad, ms word)the characters appear in squares (since the glyphs are not found)
I searched for the embedded font and installed it but even using that specific font the text still appears in squares.
And the embedded font has an extra Identity-H suffix and in the font properties of the pdf it shows that it is encoded in Identity-H.
Now the question is this:
Can I in any way either (1)extract this embedded subset font and use it in a text editor to make the text appear well or (2)can I in any way encode the font that I found to Identity-H or (3)find a way to convert the resulted text to the regular UTF-8 format? [without having to program a separate application to do it ;-)]

What pdf program are you using? Adobe Reader has the option to Save As a Text file.
"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown

I have tried to save as text but the text is not preserved (probably because of the special font and encoding)
As I said before, I can copy the text to another document but it results in square characters!

That's because pdf files are not text files. They're "closer" to graphic files.
Plain text files will not show any special formatting, fonts, etc. That's the nature of text files.
"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown

Oh,(hitting my head against the wall [just kidding])
I know what are pdf, rtf,txt ...
for example this is a partial result of pasting and saving it in rtf format and viewing the source:
viewkind4\uc1\pard\ltrpar\b\f0\fs33\u-9280?\u-9092?\u-9280?\u-9029?\
it looks like to be Unicode but it isn't, maybe because of some special encoding.

Did you Google? http://www.google.com/search?q=extr...
"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown

Yeah, I have done the exact search and visited the exact two first and the last results in that page, and they are just confusing and that is why I am here (I just tried and downloaded some tools, converted the pdf to postscript .ps, ... but what!!? It is all confusing and no straight-forward application for it, and don't think about fontforge! [maybe think about it if I had a linux os installed but no way I could run it under windows, arrrrgh)
So I want a kind person to answer the three questions in my first post. :)

![]() |
Problems with WORD
|
Viewing metadata Office 0...
|

This post is quite old and has been locked from receiving new replies. Please create a new posting instead.
| Ads by Google |