Hey Sandy, What are those sleeps for and do they still exist? We have seen about a 1min to 1:30 executor startup time, which is a large chunk for jobs that run in ~10min.
Thanks, Arun On Fri, Dec 5, 2014 at 3:20 PM, Sandy Ryza <sandy.r...@cloudera.com> wrote: > Hi Denny, > > Those sleeps were only at startup, so if jobs are taking significantly > longer on YARN, that should be a different problem. When you ran on YARN, > did you use the --executor-cores, --executor-memory, and --num-executors > arguments? When running against a standalone cluster, by default Spark > will make use of all the cluster resources, but when running against YARN, > Spark defaults to a couple tiny executors. > > -Sandy > > On Fri, Dec 5, 2014 at 11:32 AM, Denny Lee <denny.g....@gmail.com> wrote: > >> My submissions of Spark on YARN (CDH 5.2) resulted in a few thousand >> steps. If I was running this on standalone cluster mode the query finished >> in 55s but on YARN, the query was still running 30min later. Would the hard >> coded sleeps potentially be in play here? >> On Fri, Dec 5, 2014 at 11:23 Sandy Ryza <sandy.r...@cloudera.com> wrote: >> >>> Hi Tobias, >>> >>> What version are you using? In some recent versions, we had a couple of >>> large hardcoded sleeps on the Spark side. >>> >>> -Sandy >>> >>> On Fri, Dec 5, 2014 at 11:15 AM, Andrew Or <and...@databricks.com> >>> wrote: >>> >>>> Hey Tobias, >>>> >>>> As you suspect, the reason why it's slow is because the resource >>>> manager in YARN takes a while to grant resources. This is because YARN >>>> needs to first set up the application master container, and then this AM >>>> needs to request more containers for Spark executors. I think this accounts >>>> for most of the overhead. The remaining source probably comes from how our >>>> own YARN integration code polls application (every second) and cluster >>>> resource states (every 5 seconds IIRC). I haven't explored in detail >>>> whether there are optimizations there that can speed this up, but I believe >>>> most of the overhead comes from YARN itself. >>>> >>>> In other words, no I don't know of any quick fix on your end that you >>>> can do to speed this up. >>>> >>>> -Andrew >>>> >>>> >>>> 2014-12-03 20:10 GMT-08:00 Tobias Pfeiffer <t...@preferred.jp>: >>>> >>>> Hi, >>>>> >>>>> I am using spark-submit to submit my application to YARN in >>>>> "yarn-cluster" mode. I have both the Spark assembly jar file as well as my >>>>> application jar file put in HDFS and can see from the logging output that >>>>> both files are used from there. However, it still takes about 10 seconds >>>>> for my application's yarnAppState to switch from ACCEPTED to RUNNING. >>>>> >>>>> I am aware that this is probably not a Spark issue, but some YARN >>>>> configuration setting (or YARN-inherent slowness), I was just wondering if >>>>> anyone has an advice for how to speed this up. >>>>> >>>>> Thanks >>>>> Tobias >>>>> >>>> >>>> >>> >