[
https://issues.apache.org/jira/browse/PIG-4698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14966876#comment-14966876
]
Srikanth Sundarrajan commented on PIG-4698:
-------------------------------------------
Yes. We need to enable auxillary shuffle service in YARN. Here is the list of
changes I had to do to get it to work.
1. Copied spark-1.4.1-yarn-shuffle.jar to yarn/lib folder on NodeManagers
2. Had to change the yarn-site on the NodeManager with the following changes
{noformat}
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle,spark_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.spark_shuffle.class</name>
<value>org.apache.spark.network.yarn.YarnShuffleService</value>
</property>
<property>
<name>spark.shuffle.service.enabled</name>
<value>true</value>
</property>
{noformat}
3. Restart the node manager
The gist of the change that you are likely to see in the patch is
{code}
//Copy all spark.* properties to SparkConf
for (String key : pigCtxtProperties.stringPropertyNames()) {
if (key.startsWith("spark.")) {
LOG.debug("Copying key " + key + " with value " +
pigCtxtProperties.getProperty(key) + " to SparkConf");
sparkConf.set(key, pigCtxtProperties.getProperty(key));
}
}
{code}
besides a few other changes & documentation.
I will finalize the patch and upload it sometime tomorrow. Also will verify
some scenarios where executors are removed and things fall back to aux-shuffle
service works fine.
> Enable dynamic resource allocation/de-allocation on Yarn backends
> -----------------------------------------------------------------
>
> Key: PIG-4698
> URL: https://issues.apache.org/jira/browse/PIG-4698
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Affects Versions: spark-branch
> Reporter: Srikanth Sundarrajan
> Assignee: Srikanth Sundarrajan
> Labels: spork
> Fix For: spark-branch
>
>
> Resource elasticity needs to be enabled on Yarn backend to allow jobs to
> scale out better and provide better wall clock execution times, while unused
> resources should be released back to RM for use.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)