Hello Hans,

the pdf text extractions also works for us. But we need the correct text
inside the SWF an this is not correct. Just check the output of swfstrings
page_00010.swf there are characters missing.

Greetings
Mike


freundliche Grüße
scireum GmbH

Michael Haufler
Geschäftsführer

-------------------------------------------------------------------------------------------------------
scireum GmbH, Alfred-Klingele-Straße 6, 73630 Remshalden

Unser Service - gerne für Sie da:
Vertrieb: (07151) 20637 10
Support: (07151) 20637 20 - E-Mail: supp...@scireum.net

Tel:  (07151) 20637-11 -  E-Mail: m...@scireum.de
Fax: (07151) 20637-19 -  Internet: http://www.scireum.de

Geschäftsführer: Michael Haufler, Andreas Haufler
Amtsgericht Stuttgart, HRB 732171


2014/1/14 Hans J Nuecke <hnue...@vservu.de>

>  Hallo Michael,
> not sure I fully understood your issue, but probably your "-vvv" creates
> the problem.
> To have triple verbosity I think the correct use is "-v -v -v" instead of
> "-vvv".
>
> I converted the page and created a search index file (the pure text saved
> in a .txt file; we use that for search); and both looks OK; see attached.
> The text was extracted with pdftotext (part of xpdf), with the setting
> "-enc UTF-8".
>
> Hopefully this helps ;-)
> Regards
> Hans
>
> Am 14.01.2014 16:40, schrieb Michael Haufler (scireum GmbH):
>
> Hello All,
>
>  we have a strange issue with pdf2swf 0.9.2
> with this doc: http://m.scireum.de/cid-fonts.pdf
> The PDF contains item numbers like this one: 09500214
>  From the source pdf I can copy + paste them perfectly.
>  However when I convert the pdf to swf the textinformation partially gets
> lost.
>
>  I already inspected pdf2swf with the -vvv option and in the log
> everything seems right.
>
>  But when I check the text with swfstrings i get some unicode crap
> instead of the original item number.
>
>  Here is the part of the Log for the second item on the Page with the
> Text:
>  "LSPG       1 09500214       105263    36"
>
>  VERBOSE Updating font to OBIGPM+Corbel-Italic-60-0
> VERBOSE Updating font to XJVNLU+Corbel-62-0
> TRACE   beginString(L) render=0
> DEBUG   drawChar(102.047200,205.512000,c='L' (76), u=76 <1> 'L') CID=0 
> render=0 glyphid=20 font=0xc3d7d0 size=0.007471
> TRACE   Placing shape ID 32
> TRACE   Drawing char 20 in font 5 at 1,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> TRACE   beginString(SPG) render=0
> DEBUG   drawChar(106.025200,205.512000,c='S' (83), u=83 <1> 'S') CID=0 
> render=0 glyphid=24 font=0xc3d7d0 size=0.007471
> TRACE   Drawing char 24 in font 5 at 94,0 in color 000000ff
> DEBUG   drawChar(110.248000,205.512000,c='P' (80), u=80 <1> 'P') CID=0 
> render=0 glyphid=22 font=0xc3d7d0 size=0.007471
> TRACE   Drawing char 22 in font 5 at 194,0 in color 000000ff
> DEBUG   drawChar(114.608500,205.512000,c='G' (71), u=71 <1> 'G') CID=0 
> render=0 glyphid=16 font=0xc3d7d0 size=0.007471
> TRACE   Drawing char 16 in font 5 at 296,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> VERBOSE Updating font to ERHDFG+Corbel-58-0
> TRACE   beginString(.w) render=0
> DEBUG   drawChar(426.070600,205.512000,c='w' (887), u=49 <1> '1') CID=1 
> render=0 glyphid=34 font=0xc21470 size=0.007471
> TRACE   Drawing char 5 in font 4 at 7625,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> TRACE   beginString(.v...{.v.v.x.w.z) render=0
> DEBUG   drawChar(442.105000,205.512000,c='v' (886), u=48 <1> '0') CID=1 
> render=0 glyphid=33 font=0xc21470 size=0.007471
> TRACE   Drawing char 40 in font 4 at 8002,0 in color 000000ff
> DEBUG   drawChar(446.029450,205.512000,c=' ' (895), u=57 <1> '9') CID=1 
> render=0 glyphid=42 font=0xc21470 size=0.007471
> TRACE   Drawing char 13 in font 4 at 8094,0 in color 000000ff
> DEBUG   drawChar(449.953900,205.512000,c='{' (891), u=53 <1> '5') CID=1 
> render=0 glyphid=38 font=0xc21470 size=0.007471
> TRACE   Drawing char 9 in font 4 at 8187,0 in color 000000ff
> DEBUG   drawChar(453.878350,205.512000,c='v' (886), u=48 <1> '0') CID=1 
> render=0 glyphid=33 font=0xc21470 size=0.007471
> TRACE   Drawing char 40 in font 4 at 8279,0 in color 000000ff
> DEBUG   drawChar(457.802800,205.512000,c='v' (886), u=48 <1> '0') CID=1 
> render=0 glyphid=33 font=0xc21470 size=0.007471
> TRACE   Drawing char 40 in font 4 at 8371,0 in color 000000ff
> DEBUG   drawChar(461.727250,205.512000,c='x' (888), u=50 <1> '2') CID=1 
> render=0 glyphid=35 font=0xc21470 size=0.007471
> TRACE   Drawing char 41 in font 4 at 8464,0 in color 000000ff
> DEBUG   drawChar(465.651700,205.512000,c='w' (887), u=49 <1> '1') CID=1 
> render=0 glyphid=34 font=0xc21470 size=0.007471
> TRACE   Drawing char 5 in font 4 at 8556,0 in color 000000ff
> DEBUG   drawChar(469.576150,205.512000,c='z' (890), u=52 <1> '4') CID=1 
> render=0 glyphid=37 font=0xc21470 size=0.007471
> TRACE   Drawing char 42 in font 4 at 8648,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> TRACE   beginString(.w.v.{.x.|.y) render=0
> DEBUG   drawChar(518.306650,205.512000,c='w' (887), u=49 <1> '1') CID=1 
> render=0 glyphid=34 font=0xc21470 size=0.007471
> TRACE   Drawing char 5 in font 4 at 9795,0 in color 000000ff
> DEBUG   drawChar(522.231100,205.512000,c='v' (886), u=48 <1> '0') CID=1 
> render=0 glyphid=33 font=0xc21470 size=0.007471
> TRACE   Drawing char 40 in font 4 at 9887,0 in color 000000ff
> DEBUG   drawChar(526.155550,205.512000,c='{' (891), u=53 <1> '5') CID=1 
> render=0 glyphid=38 font=0xc21470 size=0.007471
> TRACE   Drawing char 9 in font 4 at 9980,0 in color 000000ff
> DEBUG   drawChar(530.080000,205.512000,c='x' (888), u=50 <1> '2') CID=1 
> render=0 glyphid=35 font=0xc21470 size=0.007471
> TRACE   Drawing char 41 in font 4 at 10072,0 in color 000000ff
> DEBUG   drawChar(534.004450,205.512000,c='|' (892), u=54 <1> '6') CID=1 
> render=0 glyphid=39 font=0xc21470 size=0.007471
> TRACE   Drawing char 10 in font 4 at 10164,0 in color 000000ff
> DEBUG   drawChar(537.928900,205.512000,c='y' (889), u=51 <1> '3') CID=1 
> render=0 glyphid=36 font=0xc21470 size=0.007471
> TRACE   Drawing char 7 in font 4 at 10257,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> TRACE   beginString(.y.|) render=0
> DEBUG   drawChar(560.167450,205.512000,c='y' (889), u=51 <1> '3') CID=1 
> render=0 glyphid=36 font=0xc21470 size=0.007471
> TRACE   Drawing char 7 in font 4 at 10780,0 in color 000000ff
> DEBUG   drawChar(564.091900,205.512000,c='|' (892), u=54 <1> '6') CID=1 
> render=0 glyphid=39 font=0xc21470 size=0.007471
> TRACE   Drawing char 10 in font 4 at 10872,0 in color 000000ff
> TRACE   endString() render=0 textstroke=(nil)
> TRACE   endTextObject() render=0 textstroke=(nil) clipstroke=(nil)
> TRACE   saveState 0xc69e70
> DEBUG   updateLineDash, 0 dashes
> VERBOSE Updating font to ERHDFG+Corbel-58-0
>
>
>
>  This is how I converted the file:
>
> pdf2swf -T 9 -G -f -vvv catalog.pdf -o 1.swf
>
> Here is the complete log file as zipped Textfile:
> http://m.scireum.de/cid-font-log.txt.zip
>
> To speed things up a little we offer 500 USD payed per paypal to the fist
> person who can solve our issue.
>
> The issue is resolved if we can convert the pdf to swf with the correct
> text in the swf
>
> For further information just contract me m...@scireum.de
>
> Greetings
>
> Michael Haufler
>
>
>
> ---------------
> SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an 
> existing subscription, please kindly point your favourite web browser 
> at:<http://lists.nongnu.org/mailman/listinfo/swftools-common> 
> <http://lists.nongnu.org/mailman/listinfo/swftools-common>
>
>
>
>
> ---------------
> SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend
> an existing subscription, please kindly point your favourite web browser
> at:<http://lists.nongnu.org/mailman/listinfo/swftools-common>
>
---------------
SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an 
existing subscription, please kindly point your favourite web browser 
at:<http://lists.nongnu.org/mailman/listinfo/swftools-common>

Reply via email to