Open a PDF with your favorite text editor ( I use Vim with the Pdftk plugin),
and examine the PDF to see the patterns in which the target text appears. Write
a script to extract the text using a regex.
You should be able to extract all of the text from a PDF in this manner, but
don't be surpris
ons about iText here
Subject: Re: [iText-questions] PDF to Text
Francisco Garrido Ortiz wrote:
> Can anybody show me code for extract txt of pdf?
> Thank You Very Much!
*sigh*
In chapter 18 of my book, I explain why iText doesn't do text
extraction. Please use PdfBox instead of iText
Francisco Garrido Ortiz wrote:
Can anybody show me code for extract txt of pdf?
Thank You Very Much!
*sigh*
In chapter 18 of my book, I explain why iText doesn't do text
extraction. Please use PdfBox instead of iText for text extraction. Note
that it's very hard to get acceptable results for
Hello to everybody.
Can anybody show me code for extract txt of pdf?
Thank You Very Much!
-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell. From the desktop to the data center, Linu
mister bean wrote:
> In a word, no. iText doesn't generate .doc files nor does it offer text
> extraction from PDF files. For info on the latter point, see the iText book,
> p. 576.
By the way: in the next release newPage won't throw
a DocumentException. I also cleaned up the DocListener
interface
In a word, no. iText doesn't generate .doc files nor does it offer text
extraction from PDF files. For info on the latter point, see the iText book,
p. 576.
---Andrew Binstock
Chen, Lingjiao wrote:
>
> Hi, I was wondering if it was possible to use IText to convert a PDF
> into Text (.doc). Tha
Hi, I was wondering if it was possible to use IText to convert a PDF
into Text (.doc). Thanks in advanced.
-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated t
alex leong wrote:
>hi,
>May I know is itext able to convert any pdf file to
>pure text file ?
>
No.
With Pdfbox you can convert some PDF files to pure text.
With an OCR tool you can convert most PDF files to pure text.
But converting ANY PDF file is impossible.
In any case, it is a bad idea to con
hi,
May I know is itext able to convert any pdf file to
pure text file ?
Thanks
__
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
--
Rajat Chhawchharia wrote:
please let me know the API which supports this functionality.
Search Google for OCR and PDF.
br,
Bruno
---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download
10 matches
Mail list logo