Peter,

 

The COS output is horrible formatted,  so I read only first line ;-)

It uses FontFile3 stream.

FontFile3 stream contains font either in Compact Font Format ( CFF) or
OpenType Format (OTF) 

which are not supported by java.

The font name is "FKAJPF+AdvOT3b30f6db.B" which means that it is subset font
of font named "AdvOT3b30f6db.B".

I don't know exactly how PdfBox handles CFF/OpenType fonts, probably they
just search for surrogate font (by name) or some kind default font (since I
never saw such horrible font name in system fonts).

 

I don't know if this is really useful for you. 

I also have no idea why font name/style are not set.

It may be nevertheless valid font.

 

BTW The only way to make java understand CFF/OTF fonts is to convert them to
Type1 fonts.

I doubt that there are any free java program which could do it.

 

/ (I managed to write parser for CFF fonts, but still have to dig into Type1
font format, however my to do list is really long and Type1 format in not on
first place ;-))

 

Best Regards

 

Andrey

 

 

 

 

Von: [email protected]
[mailto:[email protected]] Im Auftrag von Peter Murray-Rust
Gesendet: Montag, 7. Mai 2012 13:15
An: Andrey Kuznetsov
Betreff: Re: Extracting vector graphics from PDF

 

Andrey,
I have added 
System.err.println(fontDictionary.toString());
to the constructor.

I attach my output; (I assume that it is too large to post to the list).

In case it helps it is the first 2 pages of:
http://www.biomedcentral.com/content/pdf/1471-2148-11-371.pd
<http://www.biomedcentral.com/content/pdf/1471-2148-11-371.pdf> f (which has
no copyright restrictions)




-- 
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069

Reply via email to