Hi Frank,

Thanks for the link. That was useful. It's still bit unclear on how he built 
his index. are we saying, we index  clusterId,clusterSize and clusterLable in 
the same index (where other data is indexed)? So one index will have two sets 
of Solr documents in it?  one containing cluster info? 

My requirement again; I have bunch of db columns which are being indexed. e.g. 
Title,             RiskLevel1, RiskLevel2,RiskLevel3 etc
Title1        High             Medium      Low

Current requirement is to cluster documents based on their riskLevels and NOT 
the title. 

Thanks,


________________________________
 From: Frank Scholten <fr...@frankscholten.nl>
To: user@mahout.apache.org; Vikas Pandya <vika...@yahoo.com> 
Sent: Thursday, January 19, 2012 4:24 AM
Subject: Re: How to present mahout cluster in combination with Solr results
 
Hi Vikas,

I suggest indexing the cluster label, cluster size and
cluster-document mappings so you can use that information to build a
tag cloud of your data. Checkout this presentation
http://java.dzone.com/videos/configuring-mahout-clustering

Cheers,

Frank

On Thu, Jan 19, 2012 at 4:18 AM, Vikas Pandya <vika...@yahoo.com> wrote:
> Hello,
>
> I have successfully created vectors from reading my existing Solr Index. Then 
> created sequenceFile and mahout clusters from it. As I understand that 
> currently solr and mahout clustering aren't integrated, what's the best way 
> to represent mahout clusters to the user? Mine is a search application which 
> renders results by querying solr index. Now I need to incorporate Mahout 
> created clusters in the result. While Solr-Mahout integration isn't there 
> yet, what's the best alternative way to represent this info?
>
> Thanks,

Reply via email to