Re: How to cache SparkPlan.execute for reusing?

2017-03-02 Thread Liang-Chi Hsieh
Internally, in each partition of the resulting RDD[InternalRow], you will get the same UnsafeRow when iterating the rows. Typical RDD.cache doesn't work for it. You will get the output with the same rows. Not sure why you get empty output. Dataset.cache() is used for caching SQL query results.

Re: Straw poll: dropping support for things like Scala 2.10

2017-03-02 Thread Russell Spitzer
+1 on removing 2.10 On Thu, Mar 2, 2017 at 8:51 AM Koert Kuipers wrote: given the issues with scala 2.10 and java 8 i am in favor of dropping scala 2.10 in next release On Sat, Feb 25, 2017 at 2:10 PM, Sean Owen wrote: I want to bring up the issue of

Re: Straw poll: dropping support for things like Scala 2.10

2017-03-02 Thread Koert Kuipers
given the issues with scala 2.10 and java 8 i am in favor of dropping scala 2.10 in next release On Sat, Feb 25, 2017 at 2:10 PM, Sean Owen wrote: > I want to bring up the issue of Scala 2.10 support again, to see how > people feel about it. Key opinions from the previous