[EMAIL PROTECTED] wrote:
Hello all,
I need a piece of advice/experience..
What pdf parser (written in java) u'd recommend?
I played now with PDFBox-0.6.7a and would not say I was satisfied too much
with it
On certain pdf's (not well formated but anyway readable with acrobate) it
run into dead
Hello,
1. Is it possibleto use Lucene to search PDF contents ?
2. Can it search Chinese contents PDF files ???
Eric
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--- Eric Chow [EMAIL PROTECTED] wrote:
Hello,
1. Is it possibleto use Lucene to search PDF
contents ?
Yes, you need to use some external tools to extract
the text from the PDF file and then pass it to lucene
for indexing. If you do a search of this list you will
get lot of mails related to