[ 
https://issues.apache.org/jira/browse/PIG-4698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14966876#comment-14966876
 ] 

Srikanth Sundarrajan commented on PIG-4698:
-------------------------------------------

Yes. We need to enable auxillary shuffle service in YARN. Here is the list of 
changes I had to do to get it to work.

1. Copied spark-1.4.1-yarn-shuffle.jar to yarn/lib folder on NodeManagers
2. Had to change the yarn-site on the NodeManager with the following changes
{noformat}
  <property>
     <name>yarn.nodemanager.aux-services</name>
    <value>mapreduce_shuffle,spark_shuffle</value>
  </property>
  <property>
    <name>yarn.nodemanager.aux-services.spark_shuffle.class</name>
    <value>org.apache.spark.network.yarn.YarnShuffleService</value>
  </property>
  <property>
    <name>spark.shuffle.service.enabled</name>
    <value>true</value>
  </property>
{noformat}
3. Restart the node manager

The gist of the change that you are likely to see in the patch is 
{code}
            //Copy all spark.* properties to SparkConf
            for (String key : pigCtxtProperties.stringPropertyNames()) {
                if (key.startsWith("spark.")) {
                    LOG.debug("Copying key " + key + " with value " +
                        pigCtxtProperties.getProperty(key) + " to SparkConf");
                    sparkConf.set(key, pigCtxtProperties.getProperty(key));
                }
            }
{code}
besides a few other changes & documentation.

I will finalize the patch and upload it sometime tomorrow. Also will verify 
some scenarios where executors are removed and things fall back to aux-shuffle 
service works fine. 

> Enable dynamic resource allocation/de-allocation on Yarn backends
> -----------------------------------------------------------------
>
>                 Key: PIG-4698
>                 URL: https://issues.apache.org/jira/browse/PIG-4698
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>    Affects Versions: spark-branch
>            Reporter: Srikanth Sundarrajan
>            Assignee: Srikanth Sundarrajan
>              Labels: spork
>             Fix For: spark-branch
>
>
> Resource elasticity needs to be enabled on Yarn backend to allow jobs to 
> scale out better and provide better wall clock execution times, while unused 
> resources should be released back to RM for use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to