Re: Spliting of words

Paul Libbrecht Tue, 13 Sep 2005 03:10:17 -0700

Madhu,

Analyzer is the magic word here.

Lucene's StandardAnalyzer has a whole grammar to split words intotokens. There are many more analyzers, most of which are languagespecific (e.g. based the Snowball or Porter-stemmers, see contribs orjavadoc of core).


For which language do wish to use that ?

paul


Le 13 sept. 05, à 11:45, Madhu Satyanarayana Panitini a écrit :

Hai all

I want know the split pattern of text before indexing in Lucene, its
splits where ever there is space in between the words Or is there any
pattern in splitting the words of text document. In which program I can
find the code on the splitting of the word.

Madhu

Madhu Satyanarayana. Panitini
PASS GCA Solution Centre Pvt Ltd.
601 Aditya Trade Centre, Ameerpet,
Hyderabad, India.
www.pass-consulting.com



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Spliting of words

Reply via email to