Re: pyspark on yarn hdp hortonworks

2014-09-05 Thread Greg Hill
: Wednesday, September 3, 2014 4:19 PM To: Oleg Ruchovets oruchov...@gmail.commailto:oruchov...@gmail.com Cc: user@spark.apache.orgmailto:user@spark.apache.org user@spark.apache.orgmailto:user@spark.apache.org Subject: Re: pyspark on yarn hdp hortonworks Hi Oleg, There isn't much you need to do

pyspark on yarn hdp hortonworks

2014-09-03 Thread Oleg Ruchovets
Hi all. I am trying to run pyspark on yarn already couple of days: http://hortonworks.com/kb/spark-1-0-1-technical-preview-hdp-2-1-3/ I posted exception on previous posts. It looks that I didn't do correct configuration. I googled quite a lot and I can't find the steps should be done to

Re: pyspark on yarn hdp hortonworks

2014-09-03 Thread Andrew Or
Hi Oleg, There isn't much you need to do to setup a Yarn cluster to run PySpark. You need to make sure all machines have python installed, and... that's about it. Your assembly jar will be shipped to all containers along with all the pyspark and py4j files needed. One caveat, however, is that the