[ANN] Carrot2 version 3.1.0 released

2009-09-29 Thread Stanislaw Osinski
-new-clustering-capabilities/ ) Release notes: http://project.carrot2.org/release-3.1.0-notes.html On-line demo: http://search.carrot2.org Download: http://download.carrot2.org Project website: http://project.carrot2.org Thanks, Staszek -- Stanislaw Osinski, http://carrot2.org

Re: Scale Out

2010-02-08 Thread Stanislaw Osinski
> Any thoughts on scaling / clustering? Whether i need to use Hadoop / Carrot > etc... > Carrot2 does search results clustering (by content), while what you probably need is server/index clustering. See the other responses in this thread for suggestions. S.

[ANN] Carrot2 3.2.0 released

2010-03-03 Thread Stanislaw Osinski
at i...@carrotsearch.com for details. Carrot Search Labs shares some small pieces of software we created when working on Carrot2 and Lingo3G. Please see http://labs.carrotsearch.com for details and downloads. Thanks! Dawid Weiss, Stanislaw Osinski Carrot Search, i...@carrot-search.com

[ANN] Carrot2 3.3.0 released

2010-04-19 Thread Stanislaw Osinski
ngine from Carrot Search. Thanks! Dawid Weiss, Stanislaw Osinski Carrot Search, i...@carrot-search.com

Re: cluster documents based on fields' values

2010-08-18 Thread Stanislaw Osinski
> > A colleague of mine also discovered solr's clustering component - > http://wiki.apache.org/solr/ClusteringComponent. It's still labeled as > experimental - does anybody have experience with it? > The clustering component is based on the Carrot2 project ( project.carrot2.org). Carrot2 has been

Re: StandardTokenizer is slowing down highlighting a lot

2007-07-25 Thread Stanislaw Osinski
ups. Best, Staszek -- Stanislaw Osinski, [EMAIL PROTECTED] http://www.carrot-search.com

Re: StandardTokenizer is slowing down highlighting a lot

2007-07-25 Thread Stanislaw Osinski
ample, dropping NUM recognition if you don't need it in the current StandardAnalyzer gains like 25% speed. That's a good idea, though I'd need to check if in case of JFlex there would be considerable performance differences depending on the grammar. Staszek -- Stanislaw Osinski, [E

Re: StandardTokenizer is slowing down highlighting a lot

2007-07-26 Thread Stanislaw Osinski
I've attached a patch with the JFlex-based analyzer to https://issues.apache.org/jira/browse/LUCENE-966. The code needs some refactoring, but it shows some nice performance gains (5.5 -- 8.1 times compared to StandardAnalyzer on Sun JVMs). Staszek -- Stanislaw Osinski, [EMAIL PROTECTE

Re: StandardTokenizer is slowing down highlighting a lot

2007-07-25 Thread Stanislaw Osinski
On 25/07/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: On 7/25/07, Stanislaw Osinski <[EMAIL PROTECTED]> wrote: > JavaCC is slow indeed. JavaCC is a very fast parser for a large document... the issue is small fields and JavaCC's use of an exception for flow control at the

[release announcement] Carrot2 version 2.1 released

2007-08-13 Thread Stanislaw Osinski
Hi All, A bit of self-promotion again :) I hope you don't find it out of topic, after all, some folks are using Carrot2 with Lucene and Solr, and Nutch has a Carrot2-based clustering plugin. Staszek [EMAIL PROTECTED] ___

Re: Clustering with Lucene

2005-10-17 Thread Stanislaw Osinski
ery curious how Carrot works in your application. Cheers, Stanislaw -- Stanislaw Osinski, [EMAIL PROTECTED] http://www.carrot-search.com On 10/17/05, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: > > Hi All- > > I have seen an example using carrot2 for clustering, but have not rea