Lucene - Text Classification.

2009-11-09 Thread lucenenew

i want to classify sentences stored as strings to a bunch of keywords related
to a certain category.

so i will have 10 strings which will be a sentence long. and i will want to
compare each string to a set of 30 keywords stored somewhere, and then
compare with another set of 30 keywords, so on.

i want to rank each string based on the number of times it matches a set of
keywords. so basically i want to categorize each sentence.

is this possible with lucene, or would any other approach be more efficient.

will this process take long? in terms of speed of program.

and what tools would i need?

any help would be great.

thanks.
-- 
View this message in context: 
http://old.nabble.com/Lucene---Text-Classification.-tp26267794p26267794.html
Sent from the Lucene - Java Developer mailing list archive at Nabble.com.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



Re: Lucene - Text Classification.

2009-11-09 Thread Erick Erickson
Please re-post this question on the lucene user's list, this list is
intended for development discussions

Best
Erick

On Mon, Nov 9, 2009 at 10:02 AM, lucenenew mitesh.jes...@yahoo.com wrote:


 i want to classify sentences stored as strings to a bunch of keywords
 related
 to a certain category.

 so i will have 10 strings which will be a sentence long. and i will want to
 compare each string to a set of 30 keywords stored somewhere, and then
 compare with another set of 30 keywords, so on.

 i want to rank each string based on the number of times it matches a set of
 keywords. so basically i want to categorize each sentence.

 is this possible with lucene, or would any other approach be more
 efficient.

 will this process take long? in terms of speed of program.

 and what tools would i need?

 any help would be great.

 thanks.
 --
 View this message in context:
 http://old.nabble.com/Lucene---Text-Classification.-tp26267794p26267794.html
 Sent from the Lucene - Java Developer mailing list archive at Nabble.com.


 -
 To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-dev-h...@lucene.apache.org