Issue with Partial Implementation Problem

2013-01-17 Thread Ranjitha Chandrashekar
Hi I am using Partial Implementation for Random Forest classification. I have a training dataset with labels class0, class 1, class 2. The decision forest is built on this training dataset. The classification for the test dataset is computed using the same data descriptor generated for the

Re: Issue with Partial Implementation Problem

2013-01-17 Thread deneche abdelhakim
Hi Ranjitha, just put any numerical value in the label attribute. You should be able to classify the data, but you won't be able to compute the confusion matrix or the accuracy. On Thu, Jan 17, 2013 at 12:15 PM, Ranjitha Chandrashekar ranjitha...@hcl.com wrote: Hi I am using Partial

Re: is Hadoop based SVD_ALS a complete feature?

2013-01-17 Thread Pat Ferrel
+1 this, found the same problems, same fixes. Haven't seem your last problem On Jan 11, 2013, at 1:41 PM, Ying Liao yliao...@gmail.com wrote: I am tring factorize-movielens-1M.sh. I first find a bug in the sh file. Then I find a bug in org.apache.mahout.cf.taste.hadoop.als.DatasetSplitter, the

Re: is Hadoop based SVD_ALS a complete feature?

2013-01-17 Thread Sebastian Schelter
Which version/distribution of Hadoop are you using? On 17.01.2013 16:08, Pat Ferrel wrote: +1 this, found the same problems, same fixes. Haven't seem your last problem On Jan 11, 2013, at 1:41 PM, Ying Liao yliao...@gmail.com wrote: I am tring factorize-movielens-1M.sh. I first find a bug

Re: is Hadoop based SVD_ALS a complete feature?

2013-01-17 Thread Pat Ferrel
There is a problem in factorize-movielens-1M.sh and the DataSplitter needs to initialize the args parser before it accesses the options ( I think I put a ticket in for the DataSplitter with a patch). The last problem below is Ying Liao's alone. On Jan 17, 2013, at 7:12 AM, Sebastian Schelter

Re: How to start the factorize-movielens.sh from a PHP script ?

2013-01-17 Thread Sékine Coulibaly
Sebastian, This sounds reasonable. However, I observe that running the factorize-movielens script computes recommendations for *all* users. Is there a way to compute the recommandation for only one user ? The recommenditembased recommender allows for using an external file containing the user

Re: How to start the factorize-movielens.sh from a PHP script ?

2013-01-17 Thread Sebastian Schelter
Hi Sekine, I'm not sure I understand your problem correctly, What exactly is your usecase, how many users and items do you have? The mahout commandline tools only offer Hadoop-based recommenders that are designed to recommend in batch for millions of users and will usually take minutes to hours

trying to get grouplens example to run

2013-01-17 Thread Kamal Ali
I'm starting w mahout and I'm trying to get the grouplens example to run but having some difficulty . this is using mahout0.7 using jdk1.7 on a 64bit mac os 10.8 I followed instructions at https://cwiki.apache.org/MAHOUT/recommendationexamples.html and fetched the ml-100k.zip dataset from

Re: trying to get grouplens example to run

2013-01-17 Thread Sean Owen
That's the error right there: On Thu, Jan 17, 2013 at 9:57 PM, Kamal Ali k...@grokker.com wrote: Caused by: java.io.IOException: Unexpected input format on line: 1 1 5

Re: trying to get grouplens example to run

2013-01-17 Thread Kamal Ali
right, i realize that at a low level you and i have both localized the error. i guess i wasnt clear. i had expected the data i download per instructions to work consistently with the OOB mahout0.7 code. so there is a mismatch between the format that the code expects that as is supplied from the

Re: Any utility to solve the matrix inversion in Map/Reduce Way

2013-01-17 Thread Koobas
Martix inversion, as in explicitly computing the inverse, e.g. computing variance / covariance, or matrix inversion, as in solving a linear system of equations? On Thu, Jan 17, 2013 at 7:49 PM, Colin Wang colin.bin.wang.mah...@gmail.com wrote: Hi All, I want to solve the matrix inversion,

RE: trying to get grouplens example to run

2013-01-17 Thread Utkarsh Gupta
Hi Kamal, The code is written for 1M and 10M dataset Which has delimiter :: while the file you have specified has \t as delimiter Try to run with 1 M dataset If you want to run with this file only then just use this command to change the delimiter cat ratings.dat |tr -s \t :: new-ratings.dat

RE: Issue with Partial Implementation Problem

2013-01-17 Thread Ranjitha Chandrashekar
Hi Deneche Thank you for your quick response. I tried using the numerical value in the label attribute in the test data. Original Record in KDDTest :

Re: Issue with Partial Implementation Problem

2013-01-17 Thread deneche abdelhakim
My mistake. You should put any label value available in the training set. In the previous example, putting normal in all test record should be fine. On Fri, Jan 18, 2013 at 7:26 AM, Ranjitha Chandrashekar ranjitha...@hcl.com wrote: Hi Deneche Thank you for your quick response. I tried