Re: Search clustering question

2005-11-24 Thread Dawid Weiss
clusters documents and even labels them, and it takes a few seconds (up to two seconds) to cluster 100 search results snippets. Thanks for the info, Lorenzo. I'm just a little surprised at the timings you gave -- that's on a slow machine, right? On a PIII, 3GHZ the timings are between 250ms

Re: Search clustering question

2005-11-24 Thread Lorenzo Viscanti
Clustering is an intensive task. Carrot2 is an excellent framework that clusters documents and even labels them, and it takes a few seconds (up to two seconds) to cluster 100 search results snippets. If you are going to cluster entire documents you'll have to wait longer. Lorenzo On 11/23/05, Supr

Re: Search clustering question

2005-11-23 Thread Dawid Weiss
Does anyone have examples of using Carrot2? I've been looking into it lately and am not finding good documentation. Carrot2 is a programming framework. Its demos are meant to showcase the technology, but you need to delve into the code to actually build something of your own. The manual t

RE: Search clustering question

2005-11-23 Thread Runde, Kevin
earch clustering question Have you looked into using Carrot2 (it is on sourceforge...) -Original Message- From: Supreet Sethi <[EMAIL PROTECTED]> To: Java lucene list Sent: Wed, 23 Nov 2005 17:34:22 +0530 Subject: Search clustering question Hi, For final finish up on work for my

Re: Search clustering question

2005-11-23 Thread msftblows
Have you looked into using Carrot2 (it is on sourceforge...) -Original Message- From: Supreet Sethi <[EMAIL PROTECTED]> To: Java lucene list Sent: Wed, 23 Nov 2005 17:34:22 +0530 Subject: Search clustering question Hi, For final finish up on work for my project. We intend

Search clustering question

2005-11-23 Thread Supreet Sethi
Hi, For final finish up on work for my project. We intend to do search clustering. Now I have already read that there is no clear cut way of doing that in lucene. Wondering, if anyone has tackled this problem with time constraint as one issue. With turn around time of 3 sec clustering 5000 searc