--- Eric Chow <[EMAIL PROTECTED]> wrote: 
> Hello,
> 
> 1. Is it possibleto use Lucene to search PDF
> contents ?
Yes, you need to use some external tools to extract
the text from the PDF file and then pass it to lucene
for indexing. If you do a search of this list you will
get lot of mails related to that.
 
> 2. Can it search Chinese contents PDF files ???
I have used a tool called xpdf (in linux) and it works
with both chinese traditional and chinese simplified.
It gives language support packages for many of the
languages. Please take a look at the URL below.
http://www.foolabs.com/xpdf/download.html

Now the tool only helps in extracting the text.
Whether you can search chinese text or not depends on
the analyzer you use in Lucene. Try CJKAnalyzer for
CJK text search.

Thanks,
  George


        
        
                
___________________________________________________________ALL-NEW Yahoo! Messenger - 
all new features - even more fun!  http://uk.messenger.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to