Hello Hans, the pdf text extractions also works for us. But we need the correct text inside the SWF an this is not correct. Just check the output of swfstrings page_00010.swf there are characters missing.
Greetings Mike freundliche Grüße scireum GmbH Michael Haufler Geschäftsführer ------------------------------------------------------------------------------------------------------- scireum GmbH, Alfred-Klingele-Straße 6, 73630 Remshalden Unser Service - gerne für Sie da: Vertrieb: (07151) 20637 10 Support: (07151) 20637 20 - E-Mail: supp...@scireum.net Tel: (07151) 20637-11 - E-Mail: m...@scireum.de Fax: (07151) 20637-19 - Internet: http://www.scireum.de Geschäftsführer: Michael Haufler, Andreas Haufler Amtsgericht Stuttgart, HRB 732171 2014/1/14 Hans J Nuecke <hnue...@vservu.de> > Hallo Michael, > not sure I fully understood your issue, but probably your "-vvv" creates > the problem. > To have triple verbosity I think the correct use is "-v -v -v" instead of > "-vvv". > > I converted the page and created a search index file (the pure text saved > in a .txt file; we use that for search); and both looks OK; see attached. > The text was extracted with pdftotext (part of xpdf), with the setting > "-enc UTF-8". > > Hopefully this helps ;-) > Regards > Hans > > Am 14.01.2014 16:40, schrieb Michael Haufler (scireum GmbH): > > Hello All, > > we have a strange issue with pdf2swf 0.9.2 > with this doc: http://m.scireum.de/cid-fonts.pdf > The PDF contains item numbers like this one: 09500214 > From the source pdf I can copy + paste them perfectly. > However when I convert the pdf to swf the textinformation partially gets > lost. > > I already inspected pdf2swf with the -vvv option and in the log > everything seems right. > > But when I check the text with swfstrings i get some unicode crap > instead of the original item number. > > Here is the part of the Log for the second item on the Page with the > Text: > "LSPG 1 09500214 105263 36" > > VERBOSE Updating font to OBIGPM+Corbel-Italic-60-0 > VERBOSE Updating font to XJVNLU+Corbel-62-0 > TRACE beginString(L) render=0 > DEBUG drawChar(102.047200,205.512000,c='L' (76), u=76 <1> 'L') CID=0 > render=0 glyphid=20 font=0xc3d7d0 size=0.007471 > TRACE Placing shape ID 32 > TRACE Drawing char 20 in font 5 at 1,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > TRACE beginString(SPG) render=0 > DEBUG drawChar(106.025200,205.512000,c='S' (83), u=83 <1> 'S') CID=0 > render=0 glyphid=24 font=0xc3d7d0 size=0.007471 > TRACE Drawing char 24 in font 5 at 94,0 in color 000000ff > DEBUG drawChar(110.248000,205.512000,c='P' (80), u=80 <1> 'P') CID=0 > render=0 glyphid=22 font=0xc3d7d0 size=0.007471 > TRACE Drawing char 22 in font 5 at 194,0 in color 000000ff > DEBUG drawChar(114.608500,205.512000,c='G' (71), u=71 <1> 'G') CID=0 > render=0 glyphid=16 font=0xc3d7d0 size=0.007471 > TRACE Drawing char 16 in font 5 at 296,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > VERBOSE Updating font to ERHDFG+Corbel-58-0 > TRACE beginString(.w) render=0 > DEBUG drawChar(426.070600,205.512000,c='w' (887), u=49 <1> '1') CID=1 > render=0 glyphid=34 font=0xc21470 size=0.007471 > TRACE Drawing char 5 in font 4 at 7625,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > TRACE beginString(.v...{.v.v.x.w.z) render=0 > DEBUG drawChar(442.105000,205.512000,c='v' (886), u=48 <1> '0') CID=1 > render=0 glyphid=33 font=0xc21470 size=0.007471 > TRACE Drawing char 40 in font 4 at 8002,0 in color 000000ff > DEBUG drawChar(446.029450,205.512000,c=' ' (895), u=57 <1> '9') CID=1 > render=0 glyphid=42 font=0xc21470 size=0.007471 > TRACE Drawing char 13 in font 4 at 8094,0 in color 000000ff > DEBUG drawChar(449.953900,205.512000,c='{' (891), u=53 <1> '5') CID=1 > render=0 glyphid=38 font=0xc21470 size=0.007471 > TRACE Drawing char 9 in font 4 at 8187,0 in color 000000ff > DEBUG drawChar(453.878350,205.512000,c='v' (886), u=48 <1> '0') CID=1 > render=0 glyphid=33 font=0xc21470 size=0.007471 > TRACE Drawing char 40 in font 4 at 8279,0 in color 000000ff > DEBUG drawChar(457.802800,205.512000,c='v' (886), u=48 <1> '0') CID=1 > render=0 glyphid=33 font=0xc21470 size=0.007471 > TRACE Drawing char 40 in font 4 at 8371,0 in color 000000ff > DEBUG drawChar(461.727250,205.512000,c='x' (888), u=50 <1> '2') CID=1 > render=0 glyphid=35 font=0xc21470 size=0.007471 > TRACE Drawing char 41 in font 4 at 8464,0 in color 000000ff > DEBUG drawChar(465.651700,205.512000,c='w' (887), u=49 <1> '1') CID=1 > render=0 glyphid=34 font=0xc21470 size=0.007471 > TRACE Drawing char 5 in font 4 at 8556,0 in color 000000ff > DEBUG drawChar(469.576150,205.512000,c='z' (890), u=52 <1> '4') CID=1 > render=0 glyphid=37 font=0xc21470 size=0.007471 > TRACE Drawing char 42 in font 4 at 8648,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > TRACE beginString(.w.v.{.x.|.y) render=0 > DEBUG drawChar(518.306650,205.512000,c='w' (887), u=49 <1> '1') CID=1 > render=0 glyphid=34 font=0xc21470 size=0.007471 > TRACE Drawing char 5 in font 4 at 9795,0 in color 000000ff > DEBUG drawChar(522.231100,205.512000,c='v' (886), u=48 <1> '0') CID=1 > render=0 glyphid=33 font=0xc21470 size=0.007471 > TRACE Drawing char 40 in font 4 at 9887,0 in color 000000ff > DEBUG drawChar(526.155550,205.512000,c='{' (891), u=53 <1> '5') CID=1 > render=0 glyphid=38 font=0xc21470 size=0.007471 > TRACE Drawing char 9 in font 4 at 9980,0 in color 000000ff > DEBUG drawChar(530.080000,205.512000,c='x' (888), u=50 <1> '2') CID=1 > render=0 glyphid=35 font=0xc21470 size=0.007471 > TRACE Drawing char 41 in font 4 at 10072,0 in color 000000ff > DEBUG drawChar(534.004450,205.512000,c='|' (892), u=54 <1> '6') CID=1 > render=0 glyphid=39 font=0xc21470 size=0.007471 > TRACE Drawing char 10 in font 4 at 10164,0 in color 000000ff > DEBUG drawChar(537.928900,205.512000,c='y' (889), u=51 <1> '3') CID=1 > render=0 glyphid=36 font=0xc21470 size=0.007471 > TRACE Drawing char 7 in font 4 at 10257,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > TRACE beginString(.y.|) render=0 > DEBUG drawChar(560.167450,205.512000,c='y' (889), u=51 <1> '3') CID=1 > render=0 glyphid=36 font=0xc21470 size=0.007471 > TRACE Drawing char 7 in font 4 at 10780,0 in color 000000ff > DEBUG drawChar(564.091900,205.512000,c='|' (892), u=54 <1> '6') CID=1 > render=0 glyphid=39 font=0xc21470 size=0.007471 > TRACE Drawing char 10 in font 4 at 10872,0 in color 000000ff > TRACE endString() render=0 textstroke=(nil) > TRACE endTextObject() render=0 textstroke=(nil) clipstroke=(nil) > TRACE saveState 0xc69e70 > DEBUG updateLineDash, 0 dashes > VERBOSE Updating font to ERHDFG+Corbel-58-0 > > > > This is how I converted the file: > > pdf2swf -T 9 -G -f -vvv catalog.pdf -o 1.swf > > Here is the complete log file as zipped Textfile: > http://m.scireum.de/cid-font-log.txt.zip > > To speed things up a little we offer 500 USD payed per paypal to the fist > person who can solve our issue. > > The issue is resolved if we can convert the pdf to swf with the correct > text in the swf > > For further information just contract me m...@scireum.de > > Greetings > > Michael Haufler > > > > --------------- > SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an > existing subscription, please kindly point your favourite web browser > at:<http://lists.nongnu.org/mailman/listinfo/swftools-common> > <http://lists.nongnu.org/mailman/listinfo/swftools-common> > > > > > --------------- > SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend > an existing subscription, please kindly point your favourite web browser > at:<http://lists.nongnu.org/mailman/listinfo/swftools-common> >
--------------- SWFTools-common is a self-managed list. To subscribe/unsubscribe, or amend an existing subscription, please kindly point your favourite web browser at:<http://lists.nongnu.org/mailman/listinfo/swftools-common>