Re: Faceting on text fields

2009-06-12 Thread Stanislaw Osinski
Hi, Sorry for being late to the party, let me try to clear some doubts about Carrot2. Do you know under what circumstances or application should we cluster the > whole corpus of documents vs just the search results? I think it depends on what you're trying to achieve. If you'd like to give the

Re: Faceting on text fields

2009-06-11 Thread Jeffrey Tiong
atext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message > > From: Jeffrey Tiong > > To: solr-user@lucene.apache.org > > Sent: Friday, June 12, 2009 12:44:55 AM > > Subject: Re: Faceting on text fields > > > > Hi all, &

Re: Faceting on text fields

2009-06-11 Thread Otis Gospodnetic
ematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jeffrey Tiong > To: solr-user@lucene.apache.org > Sent: Friday, June 12, 2009 12:44:55 AM > Subject: Re: Faceting on text fields > > Hi all, > > We are thinking of using the carrot

Re: Faceting on text fields

2009-06-11 Thread Jeffrey Tiong
Hi all, We are thinking of using the carrot clustering too. But we saw that carrot maybe can only cluster up to 1000 search snippets. Does anyone know how can we cluster snippets that is much more than that ? (maybe in the million range?) And what is the difference between mahout and carrot? Tha

Re: Faceting on text fields

2009-06-11 Thread Michael Ludwig
Yao Ge schrieb: BTW, Carrot2 has a very impressive Clustering Workbench (based on eclipse) that has built-in integration with Solr. If you have a Solr service running, it is a just a matter of point the workbench to it. The clustering results and visualization are amazing. (http://project.carrot2

Re: Faceting on text fields

2009-06-11 Thread Yao Ge
exactly which algo is used under >> the hood. >> >> Otis >> -- >> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch >> >> >> >> - Original Message >>> From: Michael Ludwig >>> To: solr-user@lucene.apac

Re: Faceting on text fields

2009-06-11 Thread Yao Ge
/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Michael Ludwig >> To: solr-user@lucene.apache.org >> Sent: Wednesday, June 10, 2009 9:41:54 AM >> Subject: Re: Faceting on text fields >> >> Otis Gospodnetic schrieb: >> &g

Re: Faceting on text fields

2009-06-10 Thread Otis Gospodnetic
o is used under the hood. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Michael Ludwig > To: solr-user@lucene.apache.org > Sent: Wednesday, June 10, 2009 9:41:54 AM > Subject: Re: Faceting on text fields > > Otis Gosp

Re: Faceting on text fields

2009-06-10 Thread Yao Ge
> Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Yao Ge >> To: solr-user@lucene.apache.org >> Sent: Tuesday, June 9, 2009 3:46:13 PM >> Subject: Re: Faceting on text fields >>

Re: Faceting on text fields

2009-06-10 Thread Michael Ludwig
Otis Gospodnetic schrieb: Solr can already cluster top N hits using Carrot2: http://wiki.apache.org/solr/ClusteringComponent Would it be fair to say that clustering as detailed on the page you're referring to is a kind of dynamic faceting? The faceting not being done based on distinct values o

Re: Faceting on text fields

2009-06-10 Thread Michael Ludwig
Yonik Seeley schrieb: Yep, all that sounds right. An additional optimization counts terms for the documents *not* in the set when the base set is over half the size of the index. Cool :-) Thanks for confirming my assumptions! Michael Ludwig

Re: Faceting on text fields

2009-06-09 Thread Otis Gospodnetic
thing like http://www.sematext.com/product-key-phrase-extractor.html could also be used. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Yao Ge > To: solr-user@lucene.apache.org > Sent: Tuesday, June 9, 2009 3:46:13 PM > Subject:

Re: Faceting on text fields

2009-06-09 Thread Yao Ge
ta mining question! I'm sorry I don't have any > answers for you. Maybe someone else does. > > Best, > > Michael Ludwig > > -- View this message in context: http://www.nabble.com/Faceting-on-text-fields-tp23872891p23950084.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Faceting on text fields

2009-06-09 Thread Yonik Seeley
Yep, all that sounds right. An additional optimization counts terms for the documents *not* in the set when the base set is over half the size of the index. -Yonik http://www.lucidimagination.com On Tue, Jun 9, 2009 at 1:01 PM, Michael Ludwig wrote: > Yonik, > > from your initial comment for SO

Re: Faceting on text fields

2009-06-09 Thread Michael Ludwig
Yonik Seeley schrieb: Are you using Solr 1.3? You might want to try the latest 1.4 test build - faceting has changed a lot. I found two significant changes (but there may well be more): [#SOLR-911] multi-select facets - ASF JIRA https://issues.apache.org/jira/browse/SOLR-911 Yao, it sounds l

Re: Faceting on text fields

2009-06-09 Thread Michael Ludwig
Yao Ge schrieb: The facet query is considerably slower comparing to other facets from structured database fields (with highly repeated values). What I found interesting is that even after I constrained search results to just a few hunderd hits using other facets, these text facets are still very

Re: Faceting on text fields

2009-06-04 Thread Yao Ge
filterCahce setting: >>     > autowarmCount="128"/> >> >> Lastly, what I really want to is to give user a chance to visualize and >> filter on top relevant words in the free-text fields. Are there >> alternative >> to facet field approach? term

Re: Faceting on text fields

2009-06-04 Thread Yonik Seeley
nt="128"/> > > Lastly, what I really want to is to give user a chance to visualize and > filter on top relevant words in the free-text fields. Are there alternative > to facet field approach? term vectors? I can do client side process based on > top N (say 100) hits for this but it is my last option. > -- > View this message in context: > http://www.nabble.com/Faceting-on-text-fields-tp23872891p23872891.html > Sent from the Solr - User mailing list archive at Nabble.com. > >

Faceting on text fields

2009-06-04 Thread Yao Ge
ge in context: http://www.nabble.com/Faceting-on-text-fields-tp23872891p23872891.html Sent from the Solr - User mailing list archive at Nabble.com.