Hi,
Sorry for being late to the party, let me try to clear some doubts about
Carrot2.
Do you know under what circumstances or application should we cluster the
> whole corpus of documents vs just the search results?
I think it depends on what you're trying to achieve. If you'd like to give
the
atext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
> > From: Jeffrey Tiong
> > To: solr-user@lucene.apache.org
> > Sent: Friday, June 12, 2009 12:44:55 AM
> > Subject: Re: Faceting on text fields
> >
> > Hi all,
&
ematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Jeffrey Tiong
> To: solr-user@lucene.apache.org
> Sent: Friday, June 12, 2009 12:44:55 AM
> Subject: Re: Faceting on text fields
>
> Hi all,
>
> We are thinking of using the carrot
Hi all,
We are thinking of using the carrot clustering too. But we saw that carrot
maybe can only cluster up to 1000 search snippets. Does anyone know how can
we cluster snippets that is much more than that ? (maybe in the million
range?)
And what is the difference between mahout and carrot?
Tha
Yao Ge schrieb:
BTW, Carrot2 has a very impressive Clustering Workbench (based on
eclipse) that has built-in integration with Solr. If you have a Solr
service running, it is a just a matter of point the workbench to it.
The clustering results and visualization are amazing.
(http://project.carrot2
exactly which algo is used under
>> the hood.
>>
>> Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>
>>
>>
>> - Original Message
>>> From: Michael Ludwig
>>> To: solr-user@lucene.apac
/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
>> From: Michael Ludwig
>> To: solr-user@lucene.apache.org
>> Sent: Wednesday, June 10, 2009 9:41:54 AM
>> Subject: Re: Faceting on text fields
>>
>> Otis Gospodnetic schrieb:
>> &g
o is used under the hood.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Michael Ludwig
> To: solr-user@lucene.apache.org
> Sent: Wednesday, June 10, 2009 9:41:54 AM
> Subject: Re: Faceting on text fields
>
> Otis Gosp
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> - Original Message
>> From: Yao Ge
>> To: solr-user@lucene.apache.org
>> Sent: Tuesday, June 9, 2009 3:46:13 PM
>> Subject: Re: Faceting on text fields
>>
Otis Gospodnetic schrieb:
Solr can already cluster top N hits using Carrot2:
http://wiki.apache.org/solr/ClusteringComponent
Would it be fair to say that clustering as detailed on the page you're
referring to is a kind of dynamic faceting? The faceting not being done
based on distinct values o
Yonik Seeley schrieb:
Yep, all that sounds right.
An additional optimization counts terms for the documents *not* in the
set when the base set is over half the size of the index.
Cool :-) Thanks for confirming my assumptions!
Michael Ludwig
thing like http://www.sematext.com/product-key-phrase-extractor.html could
also be used.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Yao Ge
> To: solr-user@lucene.apache.org
> Sent: Tuesday, June 9, 2009 3:46:13 PM
> Subject:
ta mining question! I'm sorry I don't have any
> answers for you. Maybe someone else does.
>
> Best,
>
> Michael Ludwig
>
>
--
View this message in context:
http://www.nabble.com/Faceting-on-text-fields-tp23872891p23950084.html
Sent from the Solr - User mailing list archive at Nabble.com.
Yep, all that sounds right.
An additional optimization counts terms for the documents *not* in the
set when the base set is over half the size of the index.
-Yonik
http://www.lucidimagination.com
On Tue, Jun 9, 2009 at 1:01 PM, Michael Ludwig wrote:
> Yonik,
>
> from your initial comment for SO
Yonik Seeley schrieb:
Are you using Solr 1.3?
You might want to try the latest 1.4 test build -
faceting has changed a lot.
I found two significant changes (but there may well be more):
[#SOLR-911] multi-select facets - ASF JIRA
https://issues.apache.org/jira/browse/SOLR-911
Yao,
it sounds l
Yao Ge schrieb:
The facet query is considerably slower comparing to other facets from
structured database fields (with highly repeated values). What I found
interesting is that even after I constrained search results to just a
few hunderd hits using other facets, these text facets are still very
filterCahce setting:
>> > autowarmCount="128"/>
>>
>> Lastly, what I really want to is to give user a chance to visualize and
>> filter on top relevant words in the free-text fields. Are there
>> alternative
>> to facet field approach? term
nt="128"/>
>
> Lastly, what I really want to is to give user a chance to visualize and
> filter on top relevant words in the free-text fields. Are there alternative
> to facet field approach? term vectors? I can do client side process based on
> top N (say 100) hits for this but it is my last option.
> --
> View this message in context:
> http://www.nabble.com/Faceting-on-text-fields-tp23872891p23872891.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>
ge in context:
http://www.nabble.com/Faceting-on-text-fields-tp23872891p23872891.html
Sent from the Solr - User mailing list archive at Nabble.com.
19 matches
Mail list logo