Re: How to SSVD output to generate Clusters

2013-08-08 Thread Dmitriy Lyubimov
this successfully.Let me know if you find something. Sent: Friday, August 02, 2013 11:39 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters by eyeballing the code, i think i don' t see a problem. if rows of A are named values, then row of U (or U*Sigma or U*Sigma^1

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
Lyubimov [mailto:dlie...@gmail.com] Sent: Friday, August 02, 2013 11:39 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters by eyeballing the code, i think i don' t see a problem. if rows of A are named values, then row of U (or U*Sigma or U*Sigma^1/2) would also

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters by eyeballing the code, i think i don' t see a problem. if rows of A are named values, then row of U (or U*Sigma or U*Sigma^1/2) would also retain names from values of rows of A. Output would not contain NamedVector values

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like it is not there. Let me run a test. One question . Did you use -q 0 or 1 or sometni g elw se? On Aug 7, 2013 12:18 AM, Stuti Awasthi stutiawas...@hcl.com wrote: Hey Dmitriy, Sorry

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Suneel Marthi
, August 07, 2013 2:14 PM To: user@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like it is not there. Let me run a test. One question . Did you use -q 0 or 1 or sometni g elw  se? On Aug 7, 2013 12:18 AM, Stuti Awasthi stutiawas...@hcl.com wrote

RE: How to SSVD output to generate Clusters

2013-08-07 Thread Stuti Awasthi
...@yahoo.com] Sent: Wednesday, August 07, 2013 2:49 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters R u working off of Mahout 0.8 or 0.7? hadoop jar mahout-distribution-0.7/mahout-core-0.8-job.jar  From: Stuti Awasthi stutiawas

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
-k 90 -U true -V true --reduceTasks 1 Thanks Stuti Awasthi -Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, August 07, 2013 2:14 PM To: user@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, August 07, 2013 2:14 PM To: user@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like it is not there. Let me run a test. One question . Did you use -q 0 or 1 or sometni g elw se? On Aug 7

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
/Output -k 90 -U true -V true --reduceTasks 1 Thanks Stuti Awasthi -Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, August 07, 2013 2:14 PM To: user@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks

Re: How to SSVD output to generate Clusters

2013-08-07 Thread Dmitriy Lyubimov
true --reduceTasks 1 Thanks Stuti Awasthi -Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, August 07, 2013 2:14 PM To: user@mahout.apache.org Subject: RE: How to SSVD output to generate Clusters Thanks Stuti. Yes it looks like it is not there. Let

RE: How to SSVD output to generate Clusters

2013-08-02 Thread Stuti Awasthi
Dunning [mailto:ted.dunn...@gmail.com] Sent: Thursday, August 01, 2013 8:37 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters On Thu, Aug 1, 2013 at 5:49 AM, Stuti Awasthi stutiawas...@hcl.com wrote: I think there is a problem because of NamedVector as after some

Re: How to SSVD output to generate Clusters

2013-08-02 Thread Dmitriy Lyubimov
To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters On Thu, Aug 1, 2013 at 5:49 AM, Stuti Awasthi stutiawas...@hcl.com wrote: I think there is a problem because of NamedVector as after some search I get this Jira. https://issues.apache.org/jira/browse/MAHOUT

RE: How to SSVD output to generate Clusters

2013-08-01 Thread Stuti Awasthi
-Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, July 31, 2013 11:15 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters many people also use PCA options workflow with SSVD and then try clusterize the output U*Sigma which

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Chirag Lakhani
the clustering with this data. Thanks Stuti Awasthi -Original Message- From: Dmitriy Lyubimov [mailto:dlie...@gmail.com] Sent: Wednesday, July 31, 2013 11:15 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters many people also use PCA options workflow with SSVD

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Ted Dunning
On Thu, Aug 1, 2013 at 5:49 AM, Stuti Awasthi stutiawas...@hcl.com wrote: I think there is a problem because of NamedVector as after some search I get this Jira. https://issues.apache.org/jira/browse/MAHOUT-1067 Note also that this bug is fixed in 0.8

Re: How to SSVD output to generate Clusters

2013-08-01 Thread Ted Dunning
, July 31, 2013 11:15 PM To: user@mahout.apache.org Subject: Re: How to SSVD output to generate Clusters many people also use PCA options workflow with SSVD and then try clusterize the output U*Sigma which is dimensionally reduced representation of original row-wise dataset. To enable PCA

How to SSVD output to generate Clusters

2013-07-31 Thread Stuti Awasthi
Hi All, I wanted to group the documents with same context but which belongs to one single domain together. I have tried KMeans and LDA provided in Mahout to perform the clustering but the groups which are generated are not very good. Hence I thought to use LSA to indentify the context related

Re: How to SSVD output to generate Clusters

2013-07-31 Thread Dmitriy Lyubimov
many people also use PCA options workflow with SSVD and then try clusterize the output U*Sigma which is dimensionally reduced representation of original row-wise dataset. To enable PCA and U*Sigma output, use ssvd -pca true -us true -u false -v false -k=... -q=1 ... -q=1 recommended for