Re: Newbie with Carrot2?

2012-05-22 Thread Stanislaw Osinski
Hi Bruno,

Just to confirm -- are you seeing the clusters array in the result at all
(arr name=clusters)? To get reasonable clusters, you should request at
least 30-50 documents (rows), but even with smaller values, you should see
an empty clusters array.

Staszek

On Sun, May 20, 2012 at 9:20 PM, Bruno Mannina bmann...@free.fr wrote:

 Le 20/05/2012 11:43, Stanislaw Osinski a écrit :

  Hi Bruno,

 Here's the wiki documentation for Solr's clustering component:

 http://wiki.apache.org/solr/**ClusteringComponenthttp://wiki.apache.org/solr/ClusteringComponent

 For configuration examples, take a look at the Configuration section:
 http://wiki.apache.org/solr/**ClusteringComponent#**Configurationhttp://wiki.apache.org/solr/ClusteringComponent#Configuration
 .

 If you hit any problems, let me know.

 Staszek

 On Sun, May 20, 2012 at 11:38 AM, Bruno Manninabmann...@free.fr  wrote:

  Dear all,

 I use Solr 3.6.0 and I indexed some documents (around 12000).
 Each documents contains a Abstract-en field (and some other fields).

 Is it possible to use Carrot2 to create cluster (classes) with the
 Abstract-en field?

 What must I configure in the schema.xml ? or in other files?

 Sorry for my newbie question, but I found only documentation for
 Workbench
 tool.

 Bruno

  Thx for this link but I have a problem to configure my solrconfig.xml
 in the section:
 (note I run java -Dsolr.clustering.enabled=**true)

 I have a field named abstract-en, and I would like to use only this field.

 I would like to know if my requestHandler is good?
 I have a doubt with the content of  : carrot.title, carrot.url

 and also the latest field
 str name=dfabstract-en/str
 str name=defTypeedismax/str
 str name=qf
  abstract-en^1.0
 /str
 str name=q.alt*:*/str
 str name=rows10/str
 str name=fl*,score/str

 because the result when I do a request is exactly like a search request
 (without more information)


 My entire requestHandler is:

 requestHandler name=/clustering startup=lazy
 enable=${solr.clustering.**enabled:false} class=solr.SearchHandler
 lst name=defaults
 bool name=clusteringtrue/bool
 str name=clustering.engine**default/str
 bool name=clustering.results**true/bool
 !-- The title field --
 str name=carrot.titlename/str
 str name=carrot.urlid/str
 !-- The field to cluster on --
 str name=carrot.snippet**abstract-en/str
 !-- produce summaries --
 bool name=carrot.produceSummary**true/bool
 !-- the maximum number of labels per cluster --
 !--int name=carrot.numDescriptions**5/int--
 !-- produce sub clusters --
 bool name=carrot.**outputSubClustersfalse/**bool
 str name=dfabstract-en/str
 str name=defTypeedismax/str
 str name=qf
  abstract-en^1.0
 /str
 str name=q.alt*:*/str
 str name=rows10/str
 str name=fl*,score/str
 /lst
 arr name=last-components
 strclustering/str
 /arr
 /requestHandler




Re: Newbie with Carrot2?

2012-05-22 Thread Bruno Mannina

Arfff

Clusters are at the end of my XML answer 

doc
/doc
doc
/doc
doc
/doc
doc
/doc
..
..
cluster
/cluster

ok all work fine now !


Le 22/05/2012 15:33, Stanislaw Osinski a écrit :

Hi Bruno,

Just to confirm -- are you seeing the clusters array in the result at all
(arr name=clusters)? To get reasonable clusters, you should request at
least 30-50 documents (rows), but even with smaller values, you should see
an empty clusters array.

Staszek

On Sun, May 20, 2012 at 9:20 PM, Bruno Manninabmann...@free.fr  wrote:


Le 20/05/2012 11:43, Stanislaw Osinski a écrit :

  Hi Bruno,

Here's the wiki documentation for Solr's clustering component:

http://wiki.apache.org/solr/**ClusteringComponenthttp://wiki.apache.org/solr/ClusteringComponent

For configuration examples, take a look at the Configuration section:
http://wiki.apache.org/solr/**ClusteringComponent#**Configurationhttp://wiki.apache.org/solr/ClusteringComponent#Configuration
.

If you hit any problems, let me know.

Staszek

On Sun, May 20, 2012 at 11:38 AM, Bruno Manninabmann...@free.fr   wrote:

  Dear all,

I use Solr 3.6.0 and I indexed some documents (around 12000).
Each documents contains a Abstract-en field (and some other fields).

Is it possible to use Carrot2 to create cluster (classes) with the
Abstract-en field?

What must I configure in the schema.xml ? or in other files?

Sorry for my newbie question, but I found only documentation for
Workbench
tool.

Bruno

  Thx for this link but I have a problem to configure my solrconfig.xml

in the section:
(note I run java -Dsolr.clustering.enabled=**true)

I have a field named abstract-en, and I would like to use only this field.

I would like to know if my requestHandler is good?
I have a doubt with the content of  : carrot.title, carrot.url

and also the latest field
str name=dfabstract-en/str
str name=defTypeedismax/str
str name=qf
  abstract-en^1.0
/str
str name=q.alt*:*/str
str name=rows10/str
str name=fl*,score/str

because the result when I do a request is exactly like a search request
(without more information)


My entire requestHandler is:

requestHandler name=/clustering startup=lazy
enable=${solr.clustering.**enabled:false} class=solr.SearchHandler
lst name=defaults
bool name=clusteringtrue/bool
str name=clustering.engine**default/str
bool name=clustering.results**true/bool
!-- The title field --
str name=carrot.titlename/str
str name=carrot.urlid/str
!-- The field to cluster on --
str name=carrot.snippet**abstract-en/str
!-- produce summaries --
bool name=carrot.produceSummary**true/bool
!-- the maximum number of labels per cluster --
!--int name=carrot.numDescriptions**5/int--
!-- produce sub clusters --
bool name=carrot.**outputSubClustersfalse/**bool
str name=dfabstract-en/str
str name=defTypeedismax/str
str name=qf
  abstract-en^1.0
/str
str name=q.alt*:*/str
str name=rows10/str
str name=fl*,score/str
/lst
arr name=last-components
strclustering/str
/arr
/requestHandler






Re: Newbie with Carrot2?

2012-05-21 Thread Chris Hostetter

: Subject: Newbie with Carrot2?
: References: 35E48F3294A0416A8F476E9C173321F3@msrvcn04
: In-Reply-To: 35E48F3294A0416A8F476E9C173321F3@msrvcn04

https://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists

When starting a new discussion on a mailing list, please do not reply to 
an existing message, instead start a fresh email.  Even if you change the 
subject line of your email, other mail headers still track which thread 
you replied to and your question is hidden in that thread and gets less 
attention.   It makes following discussions in the mailing list archives 
particularly difficult.



-Hoss


Re: Newbie with Carrot2?

2012-05-20 Thread Stanislaw Osinski
Hi Bruno,

Here's the wiki documentation for Solr's clustering component:

http://wiki.apache.org/solr/ClusteringComponent

For configuration examples, take a look at the Configuration section:
http://wiki.apache.org/solr/ClusteringComponent#Configuration.

If you hit any problems, let me know.

Staszek

On Sun, May 20, 2012 at 11:38 AM, Bruno Mannina bmann...@free.fr wrote:

 Dear all,

 I use Solr 3.6.0 and I indexed some documents (around 12000).
 Each documents contains a Abstract-en field (and some other fields).

 Is it possible to use Carrot2 to create cluster (classes) with the
 Abstract-en field?

 What must I configure in the schema.xml ? or in other files?

 Sorry for my newbie question, but I found only documentation for Workbench
 tool.

 Bruno



Re: Newbie with Carrot2?

2012-05-20 Thread Xue-Feng Yang
Hi Staszek,

I haven't found a way for inputting data into solr in the wiki. Does that mean 
docs can be inputted in a normal solr way after configuration? for example, DIH 
or solrj.

Thanks,

Xue-Feng




 From: Stanislaw Osinski stanis...@osinski.name
To: solr-user@lucene.apache.org 
Sent: Sunday, May 20, 2012 5:43:48 AM
Subject: Re: Newbie with Carrot2?
 
Hi Bruno,

Here's the wiki documentation for Solr's clustering component:

http://wiki.apache.org/solr/ClusteringComponent

For configuration examples, take a look at the Configuration section:
http://wiki.apache.org/solr/ClusteringComponent#Configuration.

If you hit any problems, let me know.

Staszek

On Sun, May 20, 2012 at 11:38 AM, Bruno Mannina bmann...@free.fr wrote:

 Dear all,

 I use Solr 3.6.0 and I indexed some documents (around 12000).
 Each documents contains a Abstract-en field (and some other fields).

 Is it possible to use Carrot2 to create cluster (classes) with the
 Abstract-en field?

 What must I configure in the schema.xml ? or in other files?

 Sorry for my newbie question, but I found only documentation for Workbench
 tool.

 Bruno


Re: Newbie with Carrot2?

2012-05-20 Thread Koji Sekiguchi

(12/05/20 23:21), Xue-Feng Yang wrote:

Hi Staszek,

I haven't found a way for inputting data into solr in the wiki. Does that mean 
docs can be inputted in a normal solr way after configuration? for example, DIH 
or solrj.

Thanks,

Xue-Feng


Right, because Carrot2 clustering is for search time.

koji
--
Query Log Visualizer for Apache Solr
http://soleami.com/


Re: Newbie with Carrot2?

2012-05-20 Thread Bruno Mannina

Le 20/05/2012 11:43, Stanislaw Osinski a écrit :

Hi Bruno,

Here's the wiki documentation for Solr's clustering component:

http://wiki.apache.org/solr/ClusteringComponent

For configuration examples, take a look at the Configuration section:
http://wiki.apache.org/solr/ClusteringComponent#Configuration.

If you hit any problems, let me know.

Staszek

On Sun, May 20, 2012 at 11:38 AM, Bruno Manninabmann...@free.fr  wrote:


Dear all,

I use Solr 3.6.0 and I indexed some documents (around 12000).
Each documents contains a Abstract-en field (and some other fields).

Is it possible to use Carrot2 to create cluster (classes) with the
Abstract-en field?

What must I configure in the schema.xml ? or in other files?

Sorry for my newbie question, but I found only documentation for Workbench
tool.

Bruno

Thx for this link but I have a problem to configure my solrconfig.xml in 
the section:

(note I run java -Dsolr.clustering.enabled=true)

I have a field named abstract-en, and I would like to use only this field.

I would like to know if my requestHandler is good?
I have a doubt with the content of  : carrot.title, carrot.url

and also the latest field
str name=dfabstract-en/str
str name=defTypeedismax/str
str name=qf
  abstract-en^1.0
/str
str name=q.alt*:*/str
str name=rows10/str
str name=fl*,score/str

because the result when I do a request is exactly like a search request 
(without more information)



My entire requestHandler is:

requestHandler name=/clustering startup=lazy 
enable=${solr.clustering.enabled:false} class=solr.SearchHandler

lst name=defaults
bool name=clusteringtrue/bool
str name=clustering.enginedefault/str
bool name=clustering.resultstrue/bool
!-- The title field --
str name=carrot.titlename/str
str name=carrot.urlid/str
!-- The field to cluster on --
str name=carrot.snippetabstract-en/str
!-- produce summaries --
bool name=carrot.produceSummarytrue/bool
!-- the maximum number of labels per cluster --
!--int name=carrot.numDescriptions5/int--
!-- produce sub clusters --
bool name=carrot.outputSubClustersfalse/bool
str name=dfabstract-en/str
str name=defTypeedismax/str
str name=qf
  abstract-en^1.0
/str
str name=q.alt*:*/str
str name=rows10/str
str name=fl*,score/str
/lst
arr name=last-components
strclustering/str
/arr
/requestHandler



Re: Newbie with Carrot2?

2012-05-20 Thread Xue-Feng Yang
Thanks, 

Xue-Feng




 From: Koji Sekiguchi k...@r.email.ne.jp
To: solr-user@lucene.apache.org 
Sent: Sunday, May 20, 2012 10:33:53 AM
Subject: Re: Newbie with Carrot2?
 
(12/05/20 23:21), Xue-Feng Yang wrote:
 Hi Staszek,

 I haven't found a way for inputting data into solr in the wiki. Does that 
 mean docs can be inputted in a normal solr way after configuration? for 
 example, DIH or solrj.

 Thanks,

 Xue-Feng

Right, because Carrot2 clustering is for search time.

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/