Re: Running Job on Multinode Yarn Cluster

Telles Nobrega Sat, 09 Aug 2014 04:35:27 -0700

Hi Chris,

I think the problem is that I forgot to update the yarn.job.package.
I will try again to see if it works now.


I have one more question, how can I stop (command line) the jobs running in my 
topology, for the experiment that I will run, I need to run the same job in 4 
minutes intervals. So I need to kill it, clean the kafka topics and rerun.

Thanks in advance.

On 08 Aug 2014, at 12:41, Chris Riccomini <[email protected]> 
wrote:

> Hey Telles,
> 
>>> Do I need to have the job folder on each machine in my cluster?
> 
> No, you should not need to do this. There are two ways to deploy your
> tarball to the YARN grid. One is to put it in HDFS, and the other is to
> put it on an HTTP server. The link to running a Samza job in a multi-node
> YARN cluster describes how to do both (either HTTP server or HDFS).
> 
> In both cases, once the tarball is put in on the HTTP/HDFS server(s), you
> must update yarn.package.path to point to it. From there, the YARN NM
> should download it for you automatically when you start your job.
> 
> * Can you send along a paste of your job config?
> 
> Cheers,
> Chris
> 
> On 8/8/14 8:04 AM, "Claudio Martins" <[email protected]> wrote:
> 
>> Hi Telles, it looks to me that you forgot to update the
>> "yarn.package.path"
>> attribute in your config file for the task.
>> 
>> - Claudio Martins
>> Head of Engineering
>> MobileAware USA Inc. / www.mobileaware.com
>> office: +1  617 986 5060 / mobile: +1 617 480 5288
>> linkedin: www.linkedin.com/in/martinsclaudio
>> 
>> 
>> On Fri, Aug 8, 2014 at 10:55 AM, Telles Nobrega <[email protected]>
>> wrote:
>> 
>>> Hi,
>>> 
>>> this is my first time trying to run a job on a multinode environment. I
>>> have the cluster set up, I can see in the GUI that all nodes are
>>> working.
>>> Do I need to have the job folder on each machine in my cluster?
>>> - The first time I tried running with the job on the namenode machine
>>> and
>>> it failed saying:
>>> 
>>> Application application_1407509228798_0001 failed 2 times due to AM
>>> Container for appattempt_1407509228798_0001_000002 exited with exitCode:
>>> -1000 due to: File
>>> 
>>> 
>>> file:/home/ubuntu/alarm-samza/samza-job-package/target/samza-job-package-
>>> 0.7.0-dist.tar.gz
>>> does not exist
>>> 
>>> So I copied the folder to each machine in my cluster and got this error:
>>> 
>>> Application application_1407509228798_0002 failed 2 times due to AM
>>> Container for appattempt_1407509228798_0002_000002 exited with exitCode:
>>> -1000 due to: Resource
>>> 
>>> 
>>> file:/home/ubuntu/alarm-samza/samza-job-package/target/samza-job-package-
>>> 0.7.0-dist.tar.gz
>>> changed on src filesystem (expected 1407509168000, was 1407509434000
>>> 
>>> What am I missing?
>>> 
>>> p.s.: I followed this
>>> <https://github.com/yahoo/samoa/wiki/Executing-SAMOA-with-Apache-Samza>
>>> tutorial
>>> and this
>>> <
>>> 
>>> http://samza.incubator.apache.org/learn/tutorials/0.7.0/run-in-multi-node
>>> -yarn.html
>>>> 
>>> to
>>> set up the cluster.
>>> 
>>> Help is much appreciated.
>>> 
>>> Thanks in advance.
>>> 
>>> --
>>> ------------------------------------------
>>> Telles Mota Vidal Nobrega
>>> M.sc. Candidate at UFCG
>>> B.sc. in Computer Science at UFCG
>>> Software Engineer at OpenStack Project - HP/LSD-UFCG
>>> 
>

Re: Running Job on Multinode Yarn Cluster

Reply via email to