Re: Building Mahout Source

2015-06-12 Thread Raghuveer
is managed for released apache projects. Binary artifacts are available for downloads. Also if you are building from source, they would be found on standard places for a maven multimodule project, i.e. module-name/target/artifact-jar. On Jun 11, 2015 3:28 AM, Raghuveer alwaysra...@yahoo.com.invalid

Building Mahout Source

2015-06-11 Thread Raghuveer
I took mahout-0.10.0 source and build it using mvn install -DskipTests. I dont see any jar in root target folder or distribution target folder. Nor in the maven local repository in /.m2/repository/org/apache/mahout. And i dont see the maven repo with 0.10.0 version yet. Kindly suggest where

Analysing NDT data for POC

2015-05-22 Thread Raghuveer
I am doing a POC and have a dataset of the format ( client_ip, timestamp, bytes_transferred ) and trying to do the usecase Predict bytes_transferred for a particular client for a give timestamp. I got as dataset for example a client_ip 1.1.1.1 which has downloaded bytes 234 for timestamp

HMM Modeling

2015-05-18 Thread Raghuveer
I am running the following with HMM model:Step 1:echo 196 198 199 201 203 203 205 206 208 209 211 212 214 215 217 218 220 221 223 224 226 227 229 230 232 233 234 236 239 239 240 242 243 245 246 hmm-input Step 2:mahout baumwelch -i hmm-input -o hmm-model -nh 3 -no 250 -e .01 -m 1000 Step

HMM Model error

2015-05-11 Thread Raghuveer
of this error. Regards,Raghuveer

Re: HMM Model error

2015-05-11 Thread Raghuveer
Can you please tell me how is it 107 because i have only 64 elements and if i remove all the spaces its 90 elements, can you kindly explain. On Monday, May 11, 2015 5:21 PM, Max Heimel mhei...@gmail.com wrote: Hi Raghuveer, the crash was caused because you did not provide

Re: HMM Model error

2015-05-11 Thread Raghuveer
AM, Raghuveer alwaysra...@yahoo.com.INVALID wrote: Can you please tell me how is it 107 because i have only 64 elements and if i remove all the spaces its 90 elements, can you kindly explain.     On Monday, May 11, 2015 5:21 PM, Max Heimel mhei...@gmail.com wrote:   Hi Raghuveer

Re: Error running HMM model

2015-04-08 Thread Raghuveer
, Suneel Marthi suneel.mar...@gmail.com wrote: Could u  post the original issue more clearly formatted? its hard to discern from your earlier post as to what is wrong seems like an installation issue on ur end. On Wed, Apr 8, 2015 at 1:37 AM, Raghuveer alwaysra...@yahoo.com wrote: Same

Error running HMM model

2015-04-07 Thread Raghuveer
I am learning mahout usage and as suggested here am trying to run my sample but i get the below error, kindly suggestError: Could not find or load main class ..mahout Note: I have set MAHOUT_HOME to trunk and $PATH has $MAHOUT_HOME/bin in ~/.bashrc.Also am unable to run mahout from the

Re: Error running HMM model

2015-04-07 Thread Raghuveer
Same no change except Error: Could not find or load main class ..bin.mahout On Wednesday, April 8, 2015 10:55 AM, Suneel Marthi suneel.mar...@gmail.com wrote: From $MAHOUT_HOME try running ./bin/mahout and see if that works. On Wed, Apr 8, 2015 at 1:22 AM, Raghuveer alwaysra

Re: Error running HMM model

2015-04-07 Thread Raghuveer
Infact if i just type mahout i get the following error raghuveer@csstpdfc561:~/trunk$ mahout MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath. MAHOUT_LOCAL is set, running locally Error occurred during initialization of VM Could not reserve enough space for 3145728KB object heap

Re: Fw: Mahout dataset Vectorization

2015-03-25 Thread Raghuveer
IP address is not implicit infact its in the second row and is a server. Kindly suggest how i can do the kmeans clustering wrt timestamp or is there a better way? Regards,Raghuveer On Thursday, March 26, 2015 6:34 AM, Ted Dunning ted.dunn...@gmail.com wrote: This is an old question

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
centroids in the folder specified by -c. so yes -c is required. On Tue, Mar 10, 2015 at 1:42 AM, Raghuveer alwaysra...@yahoo.com.invalid wrote: No i have removed the -c option now so i get the mentioned exception that -c is mandatory.      On Tuesday, March 10, 2015 11:06 AM, Suneel Marthi

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
I see the error below: Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/raghuveer/trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar 15/03/10 11:50:20 INFO common.AbstractJob: Command line arguments: {--clustering=null, --clusters=[hdfs

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
the # of centroids specified as seed would be the k b) u let the algorithm choose random centroids by specifying -k, it needs -c to write the random centroids to hence -c is needed with -k. On Tue, Mar 10, 2015 at 2:09 AM, Raghuveer alwaysra...@yahoo.com wrote: ok so if -c is required then how can i

mahout failing with -c as required option

2015-03-09 Thread Raghuveer
Hi All, I am trying to run the command: ./mahout kmeans -i hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-0 -o  hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer -c  hdfs://master:54310/user/netlog/upload/mahoutoutput -dm

Re: mahout output of seq2sparse is empty

2015-03-03 Thread Raghuveer
andrew.mussel...@gmail.com wrote: I don't have a terminal in front of me but are you sure tfidf-vectors is a file, not a directory? On Tuesday, March 3, 2015, Raghuveer alwaysra...@yahoo.com.invalid wrote: I have data file of the formatsrc_ip,dest_ip,packet, bytes_transferred, src_port,dest_port

mahout output of seq2sparse is empty

2015-03-03 Thread Raghuveer
I have data file of the formatsrc_ip,dest_ip,packet, bytes_transferred, src_port,dest_port, start_timestamp 71.105.62.168, 38.106.70.147, 1, 54, 55704, 52747, 1341775056478 38.106.70.147, 71.105.62.168, 2, 1568, 52747, 55704, 1341775056478 Firstly the text like src_ip should be converted to a