Re: Is it feasible to build and run Spark on Windows?

2019-12-09 Thread Ping Liu
Super. Thanks Deepak! On Mon, Dec 9, 2019 at 6:58 PM Deepak Vohra wrote: > Please install Apache Spark on Windows as discussed in Apache Spark on > Windows - DZone Open Source > > > Apache Spark on Windows - DZone Open Source > >

Re: Is it feasible to build and run Spark on Windows?

2019-12-09 Thread Deepak Vohra
Please install Apache Spark on Windows as discussed in Apache Spark on Windows - DZone Open Source | | | | | | | | | | | Apache Spark on Windows - DZone Open Source This article explains and provides solutions for some of the most common errors developers come across when inst...

Re: Is it feasible to build and run Spark on Windows?

2019-12-09 Thread Ping Liu
Thanks Deepak! Yes, I want to try it with Docker. But my AWS account ran out of free period. Is there a shared EC2 for Spark that we can use for free? Ping On Monday, December 9, 2019, Deepak Vohra wrote: > Haven't tested but the general procedure is to exclude all guava dependencies that

Re: Is it feasible to build and run Spark on Windows?

2019-12-09 Thread Deepak Vohra
Haven't tested but the general procedure is to exclude all guava dependencies that are not needed. The hadoop-common depedency does not have a dependency on guava according to Maven Repository: org.apache.hadoop » hadoop-common | | | | | | | | | | | Maven Repository:

Re: [VOTE] Shall we release ORC 1.4.5rc1?

2019-12-09 Thread Owen O'Malley
With four +1's and no -1's the vote passes. I'll promote the release. Thanks, Owen On Fri, Dec 6, 2019 at 6:12 PM Hyukjin Kwon wrote: > +1 (as a Spark user) > > 2019년 12월 7일 (토) 오전 11:06, Dongjoon Hyun 님이 작성: > > > +1 for Apache ORC 1.4.5 release. > > > > Thank you for making the release.

Re: Is it feasible to build and run Spark on Windows?

2019-12-09 Thread Ping Liu
Hi Deepak, I tried it. Unfortunately, it still doesn't work. 28.1-jre isn't downloaded for somehow. I'll try something else. Thank you very much for your help! Ping On Fri, Dec 6, 2019 at 5:28 PM Deepak Vohra wrote: > As multiple guava versions are found exclude guava from all the >

Re: No of cores per executor.

2019-12-09 Thread Samik Raychaudhuri
Hi, Take a look at this video: [https://www.youtube.com/watch?v=daXEp4HmS-E]. Pretty dense, but might answer some of your questions. Thanks. -Samik On 09-Dec-19 4:12 AM, Amit Sharma wrote: I have set  5 cores per executor. Is there any formula to determine best combination of executor and

unsubscribe

2019-12-09 Thread Calvin Tran
unsubscribe On Dec. 9, 2019 6:59 a.m., "Areg Baghdasaryan (BLOOMBERG/ 731 LEX)" wrote: This e-mail (and any attachments) is intended only for the use of the addressee and may contain confidential and privileged information. If you are not the intended recipient, any collection, use,

unsubscribe

2019-12-09 Thread Areg Baghdasaryan (BLOOMBERG/ 731 LEX)

[pyspark 2.3+] broadcast timeout

2019-12-09 Thread Rishi Shah
Hi All, All of a sudden recently we discovered that all of our auto broadcasts have been timing out, this started happening in our static cloudera cluster as well as databricks. Data has not changed much. Has anyone seen anything like this before? Any suggestions other than increasing the timeout