Computing.Net > Forums > Office Software > PDF subset font extract?

Computer Problems? Computing.Net has over 1,000,000 posts about all things technology related! Over 90% answered within 24 hours! Click here to start participating now! Also, be sure to check out the New User Guide.

PDF subset font extract?

Reply to Message Icon

Name: Рилэйшн (by prorelation)
Date: November 21, 2008 at 02:29:27 Pacific
OS: XP SP3
CPU/Ram: P4 2.4/512
Comment:

Here is a PDF that opens correctly and the text is shown well, but when you copy the text from it and paste it in another text editor (wordpad, notepad, ms word)the characters appear in squares (since the glyphs are not found)

I searched for the embedded font and installed it but even using that specific font the text still appears in squares.

And the embedded font has an extra Identity-H suffix and in the font properties of the pdf it shows that it is encoded in Identity-H.

Now the question is this:
Can I in any way either (1)extract this embedded subset font and use it in a text editor to make the text appear well or (2)can I in any way encode the font that I found to Identity-H or (3)find a way to convert the resulted text to the regular UTF-8 format? [without having to program a separate application to do it ;-)]



Sponsored Link
Ads by Google

Response Number 1
Name: Jennifer SUMN
Date: November 21, 2008 at 05:01:20 Pacific
Reply:

What pdf program are you using? Adobe Reader has the option to Save As a Text file.

"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown


0

Response Number 2
Name: Рилэйшн (by prorelation)
Date: November 21, 2008 at 07:26:10 Pacific
Reply:

I have tried to save as text but the text is not preserved (probably because of the special font and encoding)
As I said before, I can copy the text to another document but it results in square characters!


0

Response Number 3
Name: Jennifer SUMN
Date: November 21, 2008 at 11:28:09 Pacific
Reply:

That's because pdf files are not text files. They're "closer" to graphic files.

Plain text files will not show any special formatting, fonts, etc. That's the nature of text files.

"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown


0

Response Number 4
Name: Рилэйшн (by prorelation)
Date: November 21, 2008 at 11:57:11 Pacific
Reply:

Oh,(hitting my head against the wall [just kidding])
I know what are pdf, rtf,txt ...
for example this is a partial result of pasting and saving it in rtf format and viewing the source:
viewkind4\uc1\pard\ltrpar\b\f0\fs33\u-9280?\u-9092?\u-9280?\u-9029?\
it looks like to be Unicode but it isn't, maybe because of some special encoding.


0

Response Number 5
Name: Jennifer SUMN
Date: November 21, 2008 at 12:14:09 Pacific
Reply:

Did you Google? http://www.google.com/search?q=extr...

"So won’t you give this man his wings
What a shame
To have to beg you to see
We’re not all the same
What a shame" - Shinedown


0

Related Posts

See More



Response Number 6
Name: Рилэйшн (by prorelation)
Date: November 21, 2008 at 12:24:10 Pacific
Reply:

Yeah, I have done the exact search and visited the exact two first and the last results in that page, and they are just confusing and that is why I am here (I just tried and downloaded some tools, converted the pdf to postscript .ps, ... but what!!? It is all confusing and no straight-forward application for it, and don't think about fontforge! [maybe think about it if I had a linux os installed but no way I could run it under windows, arrrrgh)
So I want a kind person to answer the three questions in my first post. :)


0

Sponsored Link
Ads by Google
Reply to Message Icon

Problems with WORD Viewing metadata Office 0...



Post Locked

This post is quite old and has been locked from receiving new replies. Please create a new posting instead.


Go to Office Software Forum Home


Sponsored links

Ads by Google


Results for: PDF subset font extract?

How to change font in PDF www.computing.net/answers/office/how-to-change-font-in-pdf/9208.html

How do i reduce/compress a PDF ? www.computing.net/answers/office/how-do-i-reducecompress-a-pdf-/5980.html

Outlook 2003 - Sent Items lost www.computing.net/answers/office/outlook-2003-sent-items-lost/4710.html