JAVA_HOME not set

2010-05-18 Thread Erik Test
Hi All, I continually get this error when trying to run start-all.sh for hadoop 0.20.2 on ubuntu. What confuses me is I DO have JAVA_HOME set in hadoop-env.sh to /usr/lib/jvm/jdk1.6.0_17. I've double checked to see that JAVA_HOME is set to this by echoing the path before running the start script b

Re: JAVA_HOME not set

2010-05-18 Thread Erik Test
. > > If you think it might be the same problem, please weigh in: > > > http://getsatisfaction.com/cloudera/topics/java_home_setting_in_hadoop_env_sh_not_respected_in_cdh_3 > > > - David > > > On Tue, May 18, 2010 at 12:30 PM, Erik Test wrote: > > Hi All, &g

Re: JAVA_HOME not set

2010-05-19 Thread Erik Test
on.com/cloudera/topics/java_home_setting_in_hadoop_env_sh_not_respected_in_cdh_3 >> >> >> - David >> >> >> On Tue, May 18, 2010 at 12:30 PM, Erik Test >> wrote: >> >>> Hi All, >>> >>> I continually get this error when try

TaskTracker and DataNodes cannot connect to master node (NoRouteToHostException)

2010-05-25 Thread Erik Test
Hello All, I've been unable to resolve this problem on my own so I've decided to ask for help. I've pasted the logs I have for the DataNode on of the slave nodes. The logs for TaskTracker are essentially the same (i.e. same exception causing a shutdown). Any suggestions or hints as to what could

Re: TaskTracker and DataNodes cannot connect to master node (NoRouteToHostException)

2010-05-26 Thread Erik Test
I'm able to ssh and ping the from the slave node to the master node w/o problems. I've open the ports on the master node to receive communication from the slave nodes on the port but still no luck. I'm going to try opening the port on the slave node to communicate with the master node next. The pl

Re: TaskTracker and DataNodes cannot connect to master node (NoRouteToHostException)

2010-05-26 Thread Erik Test
tings: Direct connection Erik On 26 May 2010 10:20, Steve Loughran wrote: > Erik Test wrote: > >> I'm able to ssh and ping the from the slave node to the master node w/o >> problems. I've open the ports on the master node to receive communication >> from the sl

Re: TaskTracker and DataNodes cannot connect to master node (NoRouteToHostException)

2010-05-26 Thread Erik Test
I confirmed that the hostname for the machine in the /etc/hosts file points to the actual address of the machine and not the local loopback. However, I see that the ports reported in the log file are not available in the iptables. I'm new to configuring iptables (i.e. I made my first configuration

Re: TaskTracker and DataNodes cannot connect to master node (NoRouteToHostException)

2010-05-26 Thread Erik Test
[PROBLEM SOLVED] I'm running on an internal network so I shutdown the iptables on two internal nodes. I was able to run a node as a slave and another as a master. Thanks! Erik On 26 May 2010 13:01, Eric Sammer wrote: > On Wed, May 26, 2010 at 12:38 PM, Erik Test wrote: > > I

General question for warning message.

2010-07-28 Thread Erik Test
Hello, I'm getting the following messages when I try to run a job I've developed. hadoop jar distanceCalc.jar DistanceCalc distanceCalculations distanceCalculations/output9 10/07/28 09:25:37 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement

Re: General question for warning message.

2010-07-28 Thread Erik Test
2010 11:57, Himanshu Vashishtha wrote: > Hello, > The exception is not related to the warning message. > > It is a Reflection error. You have defined a custom Input format. Provide a > no arg constructor and then test. > > On Wed, Jul 28, 2010 at 9:42 AM, Erik Test wrote: >

Distance Calculation problem

2010-07-28 Thread Erik Test
Hello, I've implemented a program using map reduce for a simple distance calculations between two 2D points. I've set up my input such that all calculations should be the same but they are not. This makes me think I'm doing something wrong within a map and/or reduce function. Here is my output. 1

Re: Distance Calculation problem

2010-07-30 Thread Erik Test
e.tasks*). You do not need the reduce step. > > Alex K > > On Wed, Jul 28, 2010 at 11:44 AM, Erik Test wrote: > > > Hello, > > > > I've implemented a program using map reduce for a simple distance > > calculations between two 2D points. I've set up

Set variables in mapper

2010-08-02 Thread Erik Test
Hi, I'm trying to set a variable in my mapper class by reading an argument from the command line and then passing the entry to the mapper from main. Is this possible? public static void main(String[] args) throws Exception { JobConf conf = new JobConf(DistanceCalc2.class); conf.setJob

Re: Set variables in mapper

2010-08-03 Thread Erik Test
Really? This seems pretty nice. In the future, with your implementation, would the value always have to be wrapped in a MyMapper instance? How would parameters be removed if necessary? Erik On 3 August 2010 02:37, Owen O'Malley wrote: > > On Aug 2, 2010, at 9:17 AM, Erik Test wrot

Re: Set variables in mapper

2010-08-03 Thread Erik Test
O ok. Yes this is clear now. Thanks for the explanation Erik On 3 August 2010 11:34, Owen O'Malley wrote: > > On Aug 3, 2010, at 6:12 AM, Erik Test wrote: > > Really? This seems pretty nice. >> >> In the future, with your implementation, would the value alwa

[Input split] File manipulation

2010-08-17 Thread Erik Test
Hello, I'm trying to determine how to split a file evenly so each map task has a similar work load. The input I will have is a list of coordinates like this: 2,8 3,9 4,10 5,7 6,2 7,3 8,1 9,0 10,4 Since there are 9 inputs in this example, I would like to split the records so that there would be 3

Re: Reg: Source code for x86 windows

2010-08-17 Thread Erik Test
Hello, You will need to Cygwin installed on your windows machine for hadoop to run on your computer. Also, you can go to this link for installation instructions. http://hadoop.apache.org/common/docs/current/quickstart.html Erik On 17 August 2010 11:12, vigneshwara balaji wrote: > Hi, > > I am

Re: [Input split] File manipulation

2010-08-17 Thread Erik Test
is large enough, you do > not need to worry about the splitting, only one split (the last split) > has the different size, all the other splits has the same split. > > > > On Tue, Aug 17, 2010 at 7:50 AM, Erik Test wrote: > > Hello, > > > > I'm trying to determin

Re: [Input split] File manipulation

2010-08-17 Thread Erik Test
lit size is 64M which is the block size, and you can > change it by configuration. > What file type of your input file ? if it's gz , it can not been > spited, and you will always get only one mapper task. > > > On Wed, Aug 18, 2010 at 12:03 AM, Erik Test wrote: > > I&#

Multiple map-reduce iterations

2010-08-18 Thread Erik Test
Hello, I would like to make multiple map-reduce iterations on a job but I'm not sure how to do this. Would it be possible to run a configured job and then reconfigure the input path to the previous output path so I could run the same job with new input? I'm new to Hadoop so this looks like a poss

[chaining] manipulate job conf in reducer

2010-08-23 Thread Erik Test
Hello, I'm fairly new to hadoop and i'm writing a kmeans clustering algorithm using only hadoop. what i would like to do is determine the new centroids in the reducer has pass the new centroid values back to main and then run another map/reduce job. here's the code i have for the reducer. public