Re: Need help with unexpected ClassNotFoundException when using GNU Trove library in custom jar MapReduce job

2011-03-31 Thread John Therrell
Just found the answer. I didn't realize that I needed to do either of two options: 1. If using the GenericOptionsParser, pass a -libjars argument. 2. Merge all dependent jar libraries and main jar into one jar following suggestions in http://stackoverflow.com/questions/81260/java-easiest-way-to-m

Re: mailing list removal?

2011-03-31 Thread John Therrell
mail to mapreduce-user-unsubscr...@hadoop.apache.org to unsubscribe. On Thu, Mar 31, 2011 at 6:55 AM, David Heneghan wrote: > hi, i inadvertently got onto this list and it's not all that relevant for > me, can anyone suggest how i get off it? > > Many thanks > > - Original Message - > *

Need help with unexpected ClassNotFoundException when using GNU Trove library in custom jar MapReduce job

2011-03-31 Thread John Therrell
Hi everyone, I'm experiencing a chain of runtime exceptions which I believe starts with a ClassNotFoundException for a Trove library class I'm using in my custom jar. Here's a copy of the error output: http://pastie.org/1741485. I created a test java application that used the same Trove library a

Re: distributed cache exceeding local.cache.size

2011-03-31 Thread Allen Wittenauer
On Mar 31, 2011, at 11:45 AM, Travis Crawford wrote: > Is anyone familiar with how the distributed cache deals when datasets > larger than the total cache size are referenced? I've disabled the job > that caused this situation but am wondering if I can configure things > more defensively.

distributed cache exceeding local.cache.size

2011-03-31 Thread Travis Crawford
Hi mapreduce gurus - Today while looking into a few TaskTracker's with full disks I came across the following directory using 207GB: /data/disk3/mapred/local/taskTracker/archive/namenode.foo.com/tmp/temp-2024081/tmp942425908/tmp942425908 Digging a bit further a job did indeed reference a 200+GB

mailing list removal?

2011-03-31 Thread David Heneghan
Re: Mapper and Reducer of Terasort examplehi, i inadvertently got onto this list and it's not all that relevant for me, can anyone suggest how i get off it? Many thanks - Original Message - From: Amareshwari Sri Ramadasu To: mapreduce-user@hadoop.apache.org Sent: Thursday, Marc

Stopping hadoop from deleting job directory

2011-03-31 Thread Paweł Łoziński
Hi, is there a config param that prevents m/r from deleting the job directory after the job is finished/failed? I need to look into the jar file with which the job is executed. Best regards, PŁ

Re: Mapper and Reducer of Terasort example

2011-03-31 Thread Amareshwari Sri Ramadasu
The classes are org.apache.hadoop.mapreduce.Mapper and org.apache.hadoop.mapreduce.Reducer. Thanks Amareshwari On 3/31/11 7:13 AM, "Wantao" wrote: Hi all, I did not find where Mapper and Reducer are set in source code of the Terasort example in Hadoop 0.21.0. So I guess default Mapper and Re

Re: Need 0.20.2 new API documentation/examples, where are they?

2011-03-31 Thread Amareshwari Sri Ramadasu
John, Examples and libraries are rewritten to use new api in branch 0.21. You can have a look at them. New api in branch 0.20 is not stable yet. And old api is undeprecated in branch 0.21. So, you can use old api still. Thanks Amareshwari On 3/30/11 11:38 PM, "John Therrell" wrote: I'm lookin