Take a look at Bayesian text classification, which might be more efficient for your needs. Google it. There are several other text classification methods - depending your needs, you can dig into them.
On Mon, Nov 9, 2009 at 10:33 AM, lucenenew <mitesh.jes...@yahoo.com> wrote: > > i want to classify sentences stored as strings to a bunch of keywords related > to a certain category. > > so i will have 10 strings which will be a sentence long. and i will want to > compare each string to a set of 30 keywords stored somewhere, and then > compare with another set of 30 keywords, so on. > > i want to rank each string based on the number of times it matches a set of > keywords. so basically i want to categorize each sentence. > > is this possible with lucene, or would any other approach be more efficient. > > will this process take long? in terms of speed of program. > > and what tools would i need? > > any help would be great. > > thanks. > -- > View this message in context: > http://old.nabble.com/Lucene---Text-Classification.-tp26268359p26268359.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org