Re: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Kevin Moulart
Ok it does compile with maven in eclipse as well, but still, many imports are not recognized in the sources : - import org.apache.mahout.math.function.IntObjectProcedure; - import org.apache.mahout.math.map.OpenIntLongHashMap; - import org.apache.mahout.math.map.OpenIntObjectHashMap; - import

Re: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Kevin Moulart
How can I generate them to make these errors go away then ? Or don't I have to ? Kévin Moulart 2014-03-13 9:17 GMT+01:00 Sebastian Schelter ssc.o...@googlemail.com: Those are autogenerated. On 03/13/2014 09:05 AM, Kevin Moulart wrote: Ok it does compile with maven in eclipse as well, but

Re: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Sebastian Schelter
Maven should generate the classes automatically. Have you tried running mvn -DskipTests clean install on the commandline? On 03/13/2014 09:50 AM, Kevin Moulart wrote: How can I generate them to make these errors go away then ? Or don't I have to ? Kévin Moulart 2014-03-13 9:17 GMT+01:00

Re: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Kevin Moulart
I did, but then it fails because of these missing files : https://gist.github.com/kmoulart/9524828 Kévin Moulart 2014-03-13 9:57 GMT+01:00 Sebastian Schelter s...@apache.org: Maven should generate the classes automatically. Have you tried running mvn -DskipTests clean install on the

Re: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Sebastian Schelter
Are executing maven in the topmost directory? On 03/13/2014 10:09 AM, Kevin Moulart wrote: I did, but then it fails because of these missing files : https://gist.github.com/kmoulart/9524828 Kévin Moulart 2014-03-13 9:57 GMT+01:00 Sebastian Schelter s...@apache.org: Maven should generate

verbose output

2014-03-13 Thread Mahmood Naderan
Hi, Is there any verbosity flag for hadoop and mahout commands? I can not find such thing in the command line.   Regards, Mahmood

Re: verbose output

2014-03-13 Thread Sebastian Schelter
To my knowledge, there is no such flag for mahout. You can check hadoop's logs for further information however. On 03/13/2014 10:21 AM, Mahmood Naderan wrote: Hi, Is there any verbosity flag for hadoop and mahout commands? I can not find such thing in the command line. Regards, Mahmood

Fwd: Compiling Mahout with maven in Eclipse

2014-03-13 Thread Kevin Moulart
Here are the followings of the conversation between Sebastian and me, since I hit reply instead of reply all at some point. Kévin Moulart -- Forwarded message -- From: Sebastian Schelter s...@apache.org Date: 2014-03-13 10:33 GMT+01:00 Subject: Re: Compiling Mahout with maven

Commons IO version mismatch with CDH 4.6

2014-03-13 Thread Bikash Gupta
Hi, Running Kmeans in cluster of CDH 4.6 I have a new issue with commons-io compatibility Exception in thread main java.lang.NoSuchMethodError: org.apache.commons.io.IOUtils.closeQuietly(Ljava/io/Closeable;)V at

Re: verbose output

2014-03-13 Thread Mahmood Naderan
The hadoop-2.3.0/log is empty when I run mahout command which uses hadoop   Regards, Mahmood On Thursday, March 13, 2014 12:53 PM, Sebastian Schelter s...@apache.org wrote: To my knowledge, there is no such flag for mahout. You can check hadoop's logs for further information however. On

Re: Solving heap size error

2014-03-13 Thread Mahmood Naderan
Strange thing is that if I use either -Xmx128m of -Xmx16384m the process stops at the chunk #571 (571*64=36.5GB). Still I haven't figured out is this a problem with JVM or Hadoop or Mahout? I have tested various parameters on 16GB RAM property namemapred.map.child.java.opts/name

Re: Commons IO version mismatch with CDH 4.6

2014-03-13 Thread Bikash Gupta
My Bad!!! I was pointing to wrong jar..sorry for this... On Thu, Mar 13, 2014 at 4:11 PM, Bikash Gupta bikash.gupt...@gmail.comwrote: Hi, Running Kmeans in cluster of CDH 4.6 I have a new issue with commons-io compatibility Exception in thread main java.lang.NoSuchMethodError:

Re: Website, urgent help needed

2014-03-13 Thread Sebastian Schelter
Hi Scott, Create a jira ticket and attach your scripts and a text version of the page there. Best, Sebastian On 03/12/2014 03:27 PM, Scott C. Cote wrote: I took the tour of the text analysis and pushed through despite the problems on the page. Commiters helped me over the hump where

bug report

2014-03-13 Thread Mahmood Naderan
Hi Where can I submit a mahout bug? I am not familiar with JIRA and I see issues and agile.   Regards, Mahmood

Re: Solving heap size error

2014-03-13 Thread Mahmood Naderan
I am pretty sure that there is something wrong with hadoop/mahout/java. With any configuration, it stuck at the chunk #571. Previous chunks are created rapidly but I see it waits for bout 30 minutes on 571 and that is the reason for heap error size. I will try to submit a bug report.  

Re: bug report

2014-03-13 Thread Andrew Musselman
https://issues.apache.org/jira/browse/MAHOUT/?selectedTab=com.atlassian.jira.jira-projects-plugin:summary-panel Create Issue button at the top of the page. On Thu, Mar 13, 2014 at 11:29 AM, Mahmood Naderan nt_mahm...@yahoo.comwrote: Hi Where can I submit a mahout bug? I am not familiar with

Re: Solving heap size error

2014-03-13 Thread Andrew Musselman
What's your larger goal here; are you putting Hadoop and Mahout through paces as an exercise? If your process is blowing through data quickly up to a certain point there may be something happening with a common value, which is a data bug. I don't know what this wikipedia splitter class does but

Re: bug report

2014-03-13 Thread Ted Dunning
You have to be logged in to JIRA to do this. To log in, you may need to create an account. On Thu, Mar 13, 2014 at 11:33 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: https://issues.apache.org/jira/browse/MAHOUT/?selectedTab=com.atlassian.jira.jira-projects-plugin:summary-panel

Re: bug report

2014-03-13 Thread Andrew Musselman
That's right, thanks On Thu, Mar 13, 2014 at 11:52 AM, Ted Dunning ted.dunn...@gmail.com wrote: You have to be logged in to JIRA to do this. To log in, you may need to create an account. On Thu, Mar 13, 2014 at 11:33 AM, Andrew Musselman andrew.mussel...@gmail.com wrote:

Re: Solving heap size error

2014-03-13 Thread Mahmood Naderan
We used the wikipedia splitter as a benchmark for our simulation on hadoop 0.2. I am now trying to run that on the latest hadoop to be up to date and check some differences. For now, I have no other choice.   Regards, Mahmood On Thursday, March 13, 2014 10:12 PM, Andrew Musselman

Re: bug report

2014-03-13 Thread Mahmood Naderan
OK I see that. thanks   Regards, Mahmood On Thursday, March 13, 2014 10:25 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: That's right, thanks On Thu, Mar 13, 2014 at 11:52 AM, Ted Dunning ted.dunn...@gmail.com wrote: You have to be logged in to JIRA to do this.  To log in,

Re: Website, urgent help needed

2014-03-13 Thread Scott C. Cote
I have created issue https://issues.apache.org/jira/browse/MAHOUT-1461 Will upload shell scripts and suggested replacement text later tonight …. SCott On 3/13/14, 10:43 AM, Sebastian Schelter s...@apache.org wrote: Hi Scott, Create a jira ticket and attach your scripts and a text version of

local file input for seqdirectory

2014-03-13 Thread Steven Cullens
Hi, I have a large number of files on the order of kilobytes on my local machine that I want to convert to a sequence file on HDFS. Whenever, I try to copy the local files to HDFS, hadoop complains about bad blocks, presumably because each block is 64mb and there are more files than blocks. In

Re: local file input for seqdirectory

2014-03-13 Thread Suneel Marthi
The workaround is to add -xm sequential. A MR version of seqdirectory was introduced in 0.8 and hence the default execution mode is MR if none is specified. On Thursday, March 13, 2014 4:12 PM, Steven Cullens srcull...@gmail.com wrote: Hi, I have a large number of files on the order of

Re: local file input for seqdirectory

2014-03-13 Thread Steven Cullens
Thanks, Suneel. On Thu, Mar 13, 2014 at 4:17 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: The workaround is to add -xm sequential. A MR version of seqdirectory was introduced in 0.8 and hence the default execution mode is MR if none is specified. On Thursday, March 13, 2014 4:12