Hi Jozef

That would certainly help a lot. And I would be happy to compile it 
myself and give it a try.

Thanks in advance.

Thomas


On Fri, 4 Sep 2009, Jozef Misutka wrote:

> hi,
>
> i changed the algorithm of pdftotext a bit but it is far from what i 
> would like it to be. nevertheless, i can provide you with source code of 
> you tool using our pdfedit library extract text function but you would 
> have to compile it by your own. will it help?
>
> /jozo
>
> ----------------------------------------
>> Date: Fri, 4 Sep 2009 11:52:46 +0200
>> From: [email protected]
>> To: [email protected]
>> Subject: Re: [Pdfedit-support] Save file as text from the command line
>>
>> On Fri, 4 Sep 2009, Alister Hood wrote:
>>
>>> Sorry if someone else replied and I missed it.
>>> I don't know how to do this with pdfedit, but you could alternatively
>>> try the pdftotext tool from xpdf, or pdftohtml if that is more suitable
>>> for your purpose.
>>>
>>> Alister
>>
>> I am currently using pdftotext in my script. However, it doesn't work
>> well. That means, it drops a lot of spaces between words which makes the
>> output almost unuseable. This may be a problem with the PDF-input, but
>> I have no influence on this. For this reason I tried to use pdfedit and
>> found, that it's much better: the output is perfect.
>>
>>> From the man page I can see that there is a command line mode. I found the
>> script savealltext.qs on the wiki. But I can't figure out how to use this
>> from the command line. I still guess it must be easy, but I have no
>> success so far. Unfortunately I could not find any examples of how to use
>> pdfedit in command line mode.
>>
>> Thomas
>>
>>
>>> -----Original Message-----
>>> From: Thomas Spahni [mailto:[email protected]]
>>> Sent: Thursday, 3 September 2009 12:21 a.m.
>>> To: [email protected]
>>> Subject: [Pdfedit-support] Save file as text from the command line
>>>
>>> Hello
>>>
>>> I'm a new subscriber on this list; greetings to everyone.
>>>
>>> I have a bash script which at some point should translate a PDF file to
>>> plain text. Let's say we have foobar.pdf and want to convert it to
>>> foobar.txt. I can do this from the GUI but I'm unable to figure out what
>>>
>>> the command should be to do the same from the command line.
>>>
>>> Yes, I read the docs, manpage, wiki, archives, but still no luck. Yor
>>> help
>>> would be very much appreciated.
>>>
>>> Details: PDFedit 0.4.2 from the SuSE-11.1 packman repo.
>>>
>>> Best ragards,
>>> Tom

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Pdfedit-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pdfedit-support

Reply via email to