Hi Jozef That would certainly help a lot. And I would be happy to compile it myself and give it a try.
Thanks in advance. Thomas On Fri, 4 Sep 2009, Jozef Misutka wrote: > hi, > > i changed the algorithm of pdftotext a bit but it is far from what i > would like it to be. nevertheless, i can provide you with source code of > you tool using our pdfedit library extract text function but you would > have to compile it by your own. will it help? > > /jozo > > ---------------------------------------- >> Date: Fri, 4 Sep 2009 11:52:46 +0200 >> From: [email protected] >> To: [email protected] >> Subject: Re: [Pdfedit-support] Save file as text from the command line >> >> On Fri, 4 Sep 2009, Alister Hood wrote: >> >>> Sorry if someone else replied and I missed it. >>> I don't know how to do this with pdfedit, but you could alternatively >>> try the pdftotext tool from xpdf, or pdftohtml if that is more suitable >>> for your purpose. >>> >>> Alister >> >> I am currently using pdftotext in my script. However, it doesn't work >> well. That means, it drops a lot of spaces between words which makes the >> output almost unuseable. This may be a problem with the PDF-input, but >> I have no influence on this. For this reason I tried to use pdfedit and >> found, that it's much better: the output is perfect. >> >>> From the man page I can see that there is a command line mode. I found the >> script savealltext.qs on the wiki. But I can't figure out how to use this >> from the command line. I still guess it must be easy, but I have no >> success so far. Unfortunately I could not find any examples of how to use >> pdfedit in command line mode. >> >> Thomas >> >> >>> -----Original Message----- >>> From: Thomas Spahni [mailto:[email protected]] >>> Sent: Thursday, 3 September 2009 12:21 a.m. >>> To: [email protected] >>> Subject: [Pdfedit-support] Save file as text from the command line >>> >>> Hello >>> >>> I'm a new subscriber on this list; greetings to everyone. >>> >>> I have a bash script which at some point should translate a PDF file to >>> plain text. Let's say we have foobar.pdf and want to convert it to >>> foobar.txt. I can do this from the GUI but I'm unable to figure out what >>> >>> the command should be to do the same from the command line. >>> >>> Yes, I read the docs, manpage, wiki, archives, but still no luck. Yor >>> help >>> would be very much appreciated. >>> >>> Details: PDFedit 0.4.2 from the SuSE-11.1 packman repo. >>> >>> Best ragards, >>> Tom ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Pdfedit-support mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/pdfedit-support
