Hi,Srinivasa,
Use StandardAnaylzer for indexing and parsing query for Indian Lang. docs.
It will work. Right now we r searching on Hindi,Marathi
but without specific stemmers and filters. We r plannig to develop
Marathi Morphological Analyzer.
Thanks,
Satish.
On Sun, 22 Aug 2004, srinivasa r
Any one have built lucene for Devnagari UNICODE search? PLZ help me wht
kind of changes i have to do in lucene.
Also if any one have built StandardTokenizer,Analyzer,Stemmer,Indexer
,queryParser for Hindi & Marathi Plz let me know.
Thanks,
Satish.
-
Hello,
Does Lucene support UNICODE search and indexing of UNICODE
data(especially..Devnagari unicode data)?
Does it make any difference between utf-8 & utf-16 unicode docs? Bcoz
java strings supports utf-16.
Bcoz i tried indexing(using indexFiles & indexHTML from lucene Demo)
devnagari uni da