Re: [iText-questions] PDF to Text

2007-11-28 Thread wasegraves
Open a PDF with your favorite text editor ( I use Vim with the Pdftk plugin), and examine the PDF to see the patterns in which the target text appears. Write a script to extract the text using a regex. You should be able to extract all of the text from a PDF in this manner, but don't be surpris

Re: [iText-questions] PDF to Text

2007-11-28 Thread Francisco Garrido Ortiz
ons about iText here Subject: Re: [iText-questions] PDF to Text Francisco Garrido Ortiz wrote: > Can anybody show me code for extract txt of pdf? > Thank You Very Much! *sigh* In chapter 18 of my book, I explain why iText doesn't do text extraction. Please use PdfBox instead of iText

Re: [iText-questions] PDF to Text

2007-11-28 Thread Bruno Lowagie
Francisco Garrido Ortiz wrote: Can anybody show me code for extract txt of pdf? Thank You Very Much! *sigh* In chapter 18 of my book, I explain why iText doesn't do text extraction. Please use PdfBox instead of iText for text extraction. Note that it's very hard to get acceptable results for

[iText-questions] PDF to Text

2007-11-28 Thread Francisco Garrido Ortiz
Hello to everybody. Can anybody show me code for extract txt of pdf? Thank You Very Much! - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linu

Re: [iText-questions] PDF to Text conversion

2007-01-31 Thread Bruno Lowagie
mister bean wrote: > In a word, no. iText doesn't generate .doc files nor does it offer text > extraction from PDF files. For info on the latter point, see the iText book, > p. 576. By the way: in the next release newPage won't throw a DocumentException. I also cleaned up the DocListener interface

Re: [iText-questions] PDF to Text conversion

2007-01-31 Thread mister bean
In a word, no. iText doesn't generate .doc files nor does it offer text extraction from PDF files. For info on the latter point, see the iText book, p. 576. ---Andrew Binstock Chen, Lingjiao wrote: > > Hi, I was wondering if it was possible to use IText to convert a PDF > into Text (.doc). Tha

[iText-questions] PDF to Text conversion

2007-01-31 Thread Chen, Lingjiao
Hi, I was wondering if it was possible to use IText to convert a PDF into Text (.doc). Thanks in advanced. - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated t

Re: [iText-questions] PDF To Text

2006-09-11 Thread bruno
alex leong wrote: >hi, >May I know is itext able to convert any pdf file to >pure text file ? > No. With Pdfbox you can convert some PDF files to pure text. With an OCR tool you can convert most PDF files to pure text. But converting ANY PDF file is impossible. In any case, it is a bad idea to con

[iText-questions] PDF To Text

2006-09-11 Thread alex leong
hi, May I know is itext able to convert any pdf file to pure text file ? Thanks __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com --

Re: [iText-questions] PDF to Text

2005-12-15 Thread bruno
Rajat Chhawchharia wrote: please let me know the API which supports this functionality. Search Google for OCR and PDF. br, Bruno --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download