Hi John,

new tools package is available at sourceforge just for you :). The problem  
was the rotation of the page which should be fixed by now.

You are welcome,
jozef

> hi,
>
> I've been looking for a tool to convert pdf > txt so a miner bot can
> track data of interest.
>
> The text data in the pdf files is formatted by columns, which is
> important for parsing the text.  I'm dealing with about 1000 pdf files
> so I'd really like to avoid converting them by hand.
>
> The script tools I've tried so far all mangle the formatting.  Pdfedit
> is the only tool I've found so far that respects whitespace, and it does
> what I want, except version 0.4.5 is clipping lines, and I can't get
> pdf_to_text to compile under 0.4.1.
>
> A specimen pdf file I'm having a problem with is here:
> http://tinyurl.com/29a4xhw
>
> if I use v 0.4.1 and save as text it works fine
>
> if I use pdf_to_text (v 0.4.5) the text is clipped around column 117
>    and saving as text from the 0.4.5 gui gives the same clipped results
>
> attached is a part of the diff file comparing the output from 0.4.1 and
> 0.4.5
>
> I went to 0.4.5 specifically to use the stand alone tool pdf_to_text.
>
> I'm wondering if you have any insight/fix/workaround for this.
>
>
>       thanks,
>               JE
>
>
>


-- 
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/

------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
_______________________________________________
Pdfedit-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pdfedit-support

Reply via email to