> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
> Sent: April 11, 2004 1:03 PM
> To: Lucene Users List
> Subject: Re: clustering results
>
> I got all excited reading the subject line "clustering
> results" but this isn
April 2004 19:03
An: Lucene Users List
Betreff: Re: clustering results
I got all excited reading the subject line "clustering results" but this isn't
really clustering is it? This is more sorting. Does anyone know of any work
within Lucene (or another indexer) to do actual sub
Carrot (2):
http://www.cs.put.poznan.pl/dweiss/carrot/xml/index.xml?lang=en
Otis
--- [EMAIL PROTECTED] wrote:
> I got all excited reading the subject line "clustering results" but
> this isn't
> really clustering is it? This is more sorting. Does anyone know of
>
I got all excited reading the subject line "clustering results" but this isn't
really clustering is it? This is more sorting. Does anyone know of any work
within Lucene (or another indexer) to do actual subject clustering (i.e. like
Vivisimo @ http://vivisimo.com/ o
that'd be great.
Any other ideas?
- Original Message -
From: "Erik Hatcher" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Saturday, April 10, 2004 12:35 AM
Subject: Re: clustering results
> On Apr 9, 2004, at 8:16 PM, Michael
On Apr 10, 2004, at 9:47 AM, Venu Durgam wrote:
I am not sure how sort can filter out results.
sort will just sort the results right ?
Right no filtering using Sort.
lets say if i had below results
http://www.b.com/1.html
http://www.a.com/1.html
http://www.b.com/2.html
http://www.a.com/2.html
Erik,
Thanks for the poiner.
I am not sure how sort can filter out results.
sort will just sort the results right ?
lets say if i had below results
http://www.b.com/1.html
http://www.a.com/1.html
http://www.b.com/2.html
http://www.a.com/2.html
if you sort by domain name, results might be
http:
On Apr 9, 2004, at 8:16 PM, Michael A. Schoen wrote:
I have an index of urls, and need to display the top 10 results for a
given query, but want to display only 1 result per domain. It seems
that using either Hits or a HitCollector, I'll need to access the doc,
grab the domain field (I'll have i
I have an index of urls, and need to display the top 10 results for a given query, but
want to display only 1 result per domain. It seems that using either Hits or a
HitCollector, I'll need to access the doc, grab the domain field (I'll have it parse
ahead of time) and only take/display document