Re: How to SSVD output to generate Clusters

2013-07-31 Thread Ted Dunning
Use the U matrix for clustering. Note that you are replicating what the spectral clustering already does. On Wed, Jul 31, 2013 at 5:09 AM, Stuti Awasthi wrote: > Hi All, > > I wanted to group the documents with same context but which belongs to one > single domain together. I have tried KMeans

Re: How to SSVD output to generate Clusters

2013-07-31 Thread Dmitriy Lyubimov
many people also use PCA options workflow with SSVD and then try clusterize the output U*Sigma which is dimensionally reduced representation of original row-wise dataset. To enable PCA and U*Sigma output, use ssvd -pca true -us true -u false -v false -k=... -q=1 ... -q=1 recommended for accuracy.

RE: How to SSVD output to generate Clusters

2013-08-01 Thread Stuti Awasthi
Thanks Stuti Awasthi -Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, July 31, 2013 11:15 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters many people also use PCA options workflow with SSVD and then try clusterize the out

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Chirag Lakhani
(My apologies, but I am having some problem in grasping the SSVD > algorithm. The concept of Rank is not clear to me) > > 4. If I generate -k =100 in SSVD, will I still be able to create say 10 > Clusters using the clustering with this data. > > Thanks > Stuti Awasthi > >

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Ted Dunning
On Thu, Aug 1, 2013 at 5:49 AM, Stuti Awasthi wrote: > I think there is a problem because of NamedVector as after some search I > get this Jira. https://issues.apache.org/jira/browse/MAHOUT-1067 > Note also that this bug is fixed in 0.8

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Ted Dunning
> > Number of cluster to be created . What is the purpose of -k(rank) in SSVD > > (My apologies, but I am having some problem in grasping the SSVD > > algorithm. The concept of Rank is not clear to me) > > > > 4. If I generate -k =100 in SSVD, will I still be abl

RE: How to SSVD output to generate Clusters

2013-08-02 Thread Stuti Awasthi
-Original Message- From: Ted Dunning [mailto:ted.dunn...@gmail.com] Sent: Thursday, August 01, 2013 8:37 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters On Thu, Aug 1, 2013 at 5:49 AM, Stuti Awasthi wrote: > I think there is a problem because of Named

Re: How to SSVD output to generate Clusters

2013-08-02 Thread Dmitriy Lyubimov
t > > Regards > Stuti Awasthi > > > > > > -----Original Message- > From: Ted Dunning [mailto:ted.dunn...@gmail.com] > Sent: Thursday, August 01, 2013 8:37 PM > To: user@mahout.apache.org > Subject: Re: How to SSVD output to generate Clusters > > On

Re: How to SSVD output to generate Clusters

2013-08-02 Thread Dmitriy Lyubimov
ut as : >> ,,. >> >> Please let me know if I have performed any mistake in the end to end >> execution as well Im not sure Why SSVD output is not generating the named >> vectors as the issue id fixed.. >> >> Please suggest >> >> Regards >

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
riginal Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Friday, August 02, 2013 11:39 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters by eyeballing the code, i think i don' t see a problem. if rows of A are named values, then row of U (or U*Si

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
> Stuti Awasthi > > -Original Message- > From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] > Sent: Friday, August 02, 2013 11:39 PM > To: user@mahout.apache.org > Subject: Re: How to SSVD output to generate Clusters > > by eyeballing the code, i think i don' t s

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like it is not there. Let me run a test. One question . Did you use -q 0 or 1 or sometni g elw se? On Aug 7, 2013 12:18 AM, "Stuti Awasthi" wrote: > Hey Dmitriy, > > Sorry for

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Suneel Marthi
R u working off of Mahout 0.8 or 0.7? "hadoop jar mahout-distribution-0.7/mahout-core-0.8-job.jar"  From: Stuti Awasthi To: "user@mahout.apache.org" Sent: Wednesday, August 7, 2013 5:12 AM Subject: RE: How to SSVD output to generat

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
riginal Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Wednesday, August 07, 2013 2:49 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters R u working off of Mahout 0.8 or 0.7? "hadoop jar mahout-distribution-0.7/mahout-core-0.8-job.jar"  ___

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
vectors -o /stuti/SSVD/Output -k 90 -U true -V > true --reduceTasks 1 > > Thanks > Stuti Awasthi > > -Original Message- > From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] > Sent: Wednesday, August 07, 2013 2:14 PM > To: user@mahout.apache.org > Subject: RE: Ho

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
rue -V >> true --reduceTasks 1 >> >> Thanks >> Stuti Awasthi >> >> -Original Message- >> From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] >> Sent: Wednesday, August 07, 2013 2:14 PM >> To: user@mahout.apache.org >> Subject: RE: How to SS

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
0.0787696939450401,37:0.070653 >>> > 56340476747,38:0.08437107545490818,39:0.06381670380272558,40:0.0464059 >>> > 64753673735,41:0.0601332388 >>> > >>> > 594578,42:-0.12996454299711707,43:0.10779361589915878,44:-0.0652470275 >>> > 4474347,45:0.0

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
t;> > 533887785207,28:0.0019344239524856396,29:0.002936116541403362,30:-0.07 >>>> > 249587007236825,31:0.0016026176038041033,32:-0.075256224166,33:0.0 >>>> > 6603931206284432,34:0.01922 >>>> > >>>> > 6806201249697,35:0.1397278124533

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
doing this successfully.Let me know if you find something. > > Sent: Friday, August 02, 2013 11:39 PM >>>>> > To: user@mahout.apache.org >>>>> > Subject: Re: How to SSVD output to generate Clusters >>>>> > >>>>> > by