Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Bryan Cutler
The Arrow data format is not yet stable, meaning there are no guarantees on backwards/forwards compatibility. Once version 1.0 is released, it will have those guarantees but it's hard to say when that will be. The remaining work to get there can be seen at

Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Matei Zaharia
Okay, that makes sense, but is the Arrow data format stable? If not, we risk breakage when Arrow changes in the future and some libraries using this feature are begin to use the new Arrow code. Matei > On Apr 20, 2019, at 1:39 PM, Bobby Evans wrote: > > I want to be clear that this SPIP is

Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Bobby Evans
I want to be clear that this SPIP is not proposing exposing Arrow APIs/Classes through any Spark APIs. SPARK-24579 is doing that, and because of the overlap between the two SPIPs I scaled this one back to concentrate just on the columnar processing aspects. Sorry for the confusion as I didn't

Re: [VOTE] Release Apache Spark 2.4.2

2019-04-20 Thread Sean Owen
+1 from me too. It seems like there is support for merging the Jackson change into 2.4.x (and, I think, a few more minor dependency updates) but this doesn't have to go into 2.4.2. That said, if there is another RC for any reason, I think we could include it. Otherwise can wait for 2.4.3. On

Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Matei Zaharia
FYI, I’d also be concerned about exposing the Arrow API or format as a public API if it’s not yet stable. Is stabilization of the API and format coming soon on the roadmap there? Maybe someone can work with the Arrow community to make that happen. We’ve been bitten lots of times by API changes

Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Bobby Evans
I think you misunderstood the point of this SPIP. I responded to your comments in the SPIP JIRA. On Sat, Apr 20, 2019 at 12:52 AM Xiangrui Meng wrote: > I posted my comment in the JIRA >