Re: Remote connection bottleneck?

2010-09-25 Thread Mario M
hadoop/bin directory in my computer > > Why didn't you issue the command from window which had ssh ? > > > On Sat, Sep 25, 2010 at 6:53 PM, Mario M wrote: > >> Hi, >> what I did was this: >> >> I am working with Cygwin in Windows 7. >> &

Re: Remote connection bottleneck?

2010-09-25 Thread Mario M
>>>> Please produce a jar, place it on one of the servers in the cloud and >>>> run from there. >>>> >>>> >>>> On Sat, Sep 25, 2010 at 7:46 AM, Raja Thiruvathuru < >>>> thiruvath...@gmail.com> wrote: >>>> >&

Re: Log file questions

2010-09-25 Thread Mario M
Hi, it doesn't include the time needed to divide the input into splits for each map task, that I can tell you for sure (e.g. my program takes 1 minute processing and 30 minutes dividing the input, but the log only shows one minute). Mario M 2010/9/25 Han Dong > Hi, > > I h

Re: Remote connection bottleneck?

2010-09-25 Thread Mario M
lf an hour doing that and then the mapreduce phase runs fine in about a minute. But I believe it shouldn't take so long to divide the input, and I am doing something wrong. 2010/9/25 Raja Thiruvathuru > Did u defined the Main Class in the manifest file? > > > On Sat, Sep 25, 2010

Re: Remote connection bottleneck?

2010-09-25 Thread Mario M
jar, place it on one of the servers in the cloud and run > from there. > > > On Sat, Sep 25, 2010 at 7:46 AM, Raja Thiruvathuru > wrote: > >> MapReduce doesn't download the actual data, but it reads meta-data before >> it starts MapReduce job >> >> &g

Remote connection bottleneck?

2010-09-25 Thread Mario M
Hello, I am having a problem that might be expected behaviour. I am using a cloud with Hadoop remotely through ssh. I have a program that runs for about a minute, it processes a 200 MB file using NLineInputFormat and the user decides the number of lines to divide the file. However, before the map-r