running pig against yarn cluster

Adam Silberstein Fri, 27 Jun 2014 14:51:26 -0700

Hi,
I am running Pig against a CDH5 Yarn cluster.  I am not running pig with the 
command line, but instead running it programmatically through java code.


The server I am running it from does NOT actually have Pig/Hadoop etc. 
installed on it, and so there is no mapred-site.xml, yarn-site.xml, etc.  All 
config I do is through a set of properties set at launching time from java.

I need some help figuring out what properties to set.  Currently I have these 
properties set as pig properties:
yarn.resourcemanager.hostname
yarn.resourcemanager.addressmapreduce.framework.name

When I run like this through java, pig doesn’t actually fail but reverts to 
running in local mode.  Of course I don’t want that, it just made it harder to 
spot what happened.


High level summary of question: what pig properties should I set to run against 
an external yarn cluster when I cannot rely on any local hadoop config that pig 
might normally pick up.

Thanks!
Adam

running pig against yarn cluster

Reply via email to