Re: [VOTE] Release Apache Spark 2.3.3 (RC1)

2019-01-30 Thread Jungtaek Lim
Please proceed without SPARK-26154 given that it is unlikely expected to get merged in one week. The patch needs some more work, and we still haven't reached consensus on the approach. Btw, could one of committer justify and modify the priority and correctness label on SPARK-26154? I mentioned

Re: Welcome Jose Torres as a Spark committer

2019-01-30 Thread Bryan Cutler
Congrats Jose! On Tue, Jan 29, 2019, 10:48 AM Shixiong Zhu Hi all, > > The Apache Spark PMC recently added Jose Torres as a committer on the > project. Jose has been a major contributor to Structured Streaming. Please > join me in welcoming him! > > Best Regards, > > Shixiong Zhu > >

Re: Welcome Jose Torres as a Spark committer

2019-01-30 Thread Stavros Kontopoulos
Congrats Jose! On Wed, Jan 30, 2019 at 10:44 AM Gabor Somogyi wrote: > Congrats Jose! > > BR, > G > > On Wed, Jan 30, 2019 at 9:05 AM Nuthan Reddy > wrote: > >> Congrats Jose, >> >> Regards, >> Nuthan Reddy >> >> >> >> On Wed, Jan 30, 2019 at 1:22 PM Marco Gaido >> wrote: >> >>> Congrats,

Re: Purpose of broadcast timeout

2019-01-30 Thread Ryan Blue
At Netflix, we disable the broadcast timeout in our defaults. I found that it never helped catch problems. With lazy evaluation, I think it is reasonable for a table that should be broadcast to take a long time to build. Just because a join uses a subset or aggregation of a large table or

Purpose of broadcast timeout

2019-01-30 Thread Justin Uang
Hi all, We have noticed a lot of broadcast timeouts on our pipelines, and from some inspection, it seems that they happen when I have two threads trying to save two different DataFrames. We use the FIFO scheduler, so if I launch a job that needs all the executors, the second DataFrame's collect

Re: [VOTE] [SPARK-25994] SPIP: DataFrame-based Property Graphs, Cypher Queries, and Algorithms

2019-01-30 Thread Xiao Li
Change my vote from +1 to ++1 Xiangrui Meng 于2019年1月30日周三 上午6:20写道: > Correction: +0 vote doesn't mean "Don't really care". Thanks Ryan for the > offline reminder! Below is the Apache official interpretation >

Re: [VOTE] [SPARK-25994] SPIP: DataFrame-based Property Graphs, Cypher Queries, and Algorithms

2019-01-30 Thread Xiangrui Meng
Correction: +0 vote doesn't mean "Don't really care". Thanks Ryan for the offline reminder! Below is the Apache official interpretation of fraction values: The in-between values are indicative of how strongly the

Re: Self join

2019-01-30 Thread Marco Gaido
Hi all, this thread got a bit stuck. Hence, if there are no objections, I'd go ahead with a design doc describing the solution/workaround I mentioned before. Any concerns? Thanks, Marco Il giorno gio 13 dic 2018 alle ore 18:15 Ryan Blue ha scritto: > Thanks for the extra context, Marco. I

Re: Welcome Jose Torres as a Spark committer

2019-01-30 Thread Gabor Somogyi
Congrats Jose! BR, G On Wed, Jan 30, 2019 at 9:05 AM Nuthan Reddy wrote: > Congrats Jose, > > Regards, > Nuthan Reddy > > > > On Wed, Jan 30, 2019 at 1:22 PM Marco Gaido > wrote: > >> Congrats, Jose! >> >> Bests, >> Marco >> >> Il giorno mer 30 gen 2019 alle ore 03:17 JackyLee ha >> scritto:

Re: [VOTE] [SPARK-25994] SPIP: DataFrame-based Property Graphs, Cypher Queries, and Algorithms

2019-01-30 Thread Martin Junghanns
Hi Dongjoon, Thanks for the hint! I updated the SPIP accordingly. I also changed the access permissions for the SPIP and design sketch docs so that anyone can comment. Best, Martin On 29.01.19 18:59, Dongjoon Hyun wrote: Hi, Xiangrui Meng. +1 for the proposal. However, please update the

Re: Welcome Jose Torres as a Spark committer

2019-01-30 Thread Nuthan Reddy
Congrats Jose, Regards, Nuthan Reddy On Wed, Jan 30, 2019 at 1:22 PM Marco Gaido wrote: > Congrats, Jose! > > Bests, > Marco > > Il giorno mer 30 gen 2019 alle ore 03:17 JackyLee ha > scritto: > >> Congrats, Joe! >> >> Best, >> Jacky >> >> >> >> -- >> Sent from: