At 03:03 PM 4/3/2003 -0800, Matt Benson wrote:
Does anyone (Leonard) know of a package that will do
this, or should I implement parsing text from one of
JPedal or PdfBox?

There are LOTS of PDF indexing engines out there - commercial, open source, your choice of languages, etc. Do a search on FreshMeat...


OR you could indeed use JPEDAL or PdfBox to do it yourself - but that's just the extraction, indexing is the harder part to get right, esp. if you plan to offer linquistic support (stemming, Unicode, etc.) and efficient storage of the tables.


Leonard --------------------------------------------------------------------------- Leonard Rosenthol <mailto:[EMAIL PROTECTED]> Chief Technical Officer <http://www.pdfsages.com> PDF Sages, Inc. 215-629-3700 (voice)



-------------------------------------------------------
This SF.net email is sponsored by: ValueWeb: Dedicated Hosting for just $79/mo with 500 GB of bandwidth! No other company gives more support or power for your dedicated server
http://click.atdmt.com/AFF/go/sdnxxaff00300020aff/direct/01/
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to