Re: Run Python User Defined Functions / code in Spark with Scala Codebase

2018-07-09 Thread Chetan Khatri
Hello Jayant, Thank you so much for suggestion. My view was to use Python function as transformation which can take couple of column names and return object. which you explained. would that possible to point me to similiar codebase example. Thanks. On Fri, Jul 6, 2018 at 2:56 AM, Jayant

Re: [CRAN-pretest-archived] CRAN submission SparkR 2.2.2

2018-07-09 Thread Shivaram Venkataraman
I dont think we need to respin 2.2.2 -- Given that 2.3.2 is on the way we can just submit that. Shivaram On Mon, Jul 9, 2018 at 6:19 PM Tom Graves wrote: > > is there anyway to push it to CRAN without this fix, I don't really want to > respin 2.2.2 just with the test fix. > > Tom > > On Monday,

Re: [CRAN-pretest-archived] CRAN submission SparkR 2.2.2

2018-07-09 Thread Shivaram Venkataraman
Yes. I think Felix checked in a fix to ignore tests run on java versions that are not Java 8 (I think the fix was in https://github.com/apache/spark/pull/21666 which is in 2.3.2) Shivaram On Mon, Jul 9, 2018 at 5:39 PM Sean Owen wrote: > > Yes, this flavor of error should only come up in Java 9.

Re: [CRAN-pretest-archived] CRAN submission SparkR 2.2.2

2018-07-09 Thread Felix Cheung
I recall this might be a problem running Spark on java 9 From: Shivaram Venkataraman Sent: Monday, July 9, 2018 2:17 PM To: dev; Felix Cheung; Tom Graves Subject: Fwd: [CRAN-pretest-archived] CRAN submission SparkR 2.2.2 The upcoming 2.2.2 release was submitted

Fwd: [CRAN-pretest-archived] CRAN submission SparkR 2.2.2

2018-07-09 Thread Shivaram Venkataraman
The upcoming 2.2.2 release was submitted to CRAN. I think there are some knows issues on Windows, but does anybody know what the following error with Netty is ? > WARNING: Illegal reflective access by > io.netty.util.internal.PlatformDependent0$1 >

Re: [build system] taking ubuntu workers offline for docker update

2018-07-09 Thread shane knapp
this is done. On Mon, Jul 9, 2018 at 6:48 PM, shane knapp wrote: > we need to update docker to something more modern (17.05.0-ce -> > 18.03.1-ce), so i have taken the two ubuntu workers offline and once the > current builds finish, i will perform the update. > > this shouldn't take more than an

[build system] taking ubuntu workers offline for docker update

2018-07-09 Thread shane knapp
we need to update docker to something more modern (17.05.0-ce -> 18.03.1-ce), so i have taken the two ubuntu workers offline and once the current builds finish, i will perform the update. this shouldn't take more than an hour. shane -- Shane Knapp UC Berkeley EECS Research / RISELab Staff

Register catalyst expression as SQL DSL

2018-07-09 Thread geoHeil
Hi, I would like to register custom catalyst expressions as SQL DSL https://stackoverflow.com/questions/51199761/spark-register-expression-for-sql-dsl can someone shed some light here? The documentation does not seem to contain a lot of information regarding catalyst internals. Thanks a lot.

Unsubscribe

2018-07-09 Thread Prateek Goel

Register now for ApacheCon and save $250

2018-07-09 Thread Rich Bowen
Greetings, Apache software enthusiasts! (You’re getting this because you’re on one or more dev@ or users@ lists for some Apache Software Foundation project.) ApacheCon North America, in Montreal, is now just 80 days away, and early bird prices end in just two weeks - on July 21. Prices will

Re: Asking for reviewing PRs regarding structured streaming

2018-07-09 Thread Jungtaek Lim
Now I'm adding one more issue (SPARK-24763 [1]), which proposes a new option to enable optimization of state size in streaming aggregation without hurting performance. The idea is to remove data for key fields from value which is duplicated between key and value in state row. This requires