Kudu JIRA has been moved to issues.apache.org/jira/browse/KUDU

2016-02-24 Thread Mike Percy
Hi everyone, We have moved the Kudu JIRA to Apache Software Foundation (ASF) infrastructure. You can now find all of the Kudu tickets migrated to https://issues.apache.org/jira/browse/KUDU The previous Kudu JIRA project on issues.cloudera.org is now retired and has been marked READ ONLY. Going

Re: Spark on Kudu

2016-02-24 Thread Jean-Daniel Cryans
The DStream stuff isn't there at all. I'm not sure if it's needed either. The kuduRDD is just leveraging the MR input format, ideally we'd use scans directly. The SparkSQL stuff is there but it doesn't do any sort of pushdown. It's really basic. The goal was to provide something for others to

Re: Spark on Kudu

2016-02-24 Thread Benjamin Kim
J-D, It looks like it fulfills most of the basic requirements (kudu RDD, kudu DStream) in KUDU-1214. Am I right? Besides shoring up more Spark SQL functionality (Dataframes) and doing the documentation, what more needs to be done? Optimizations? I believe that it’s a good place to start using

Re: Spark on Kudu

2016-02-24 Thread Jean-Daniel Cryans
AFAIK no one is working on it, but we did manage to get this in for 0.7.0: https://issues.cloudera.org/browse/KUDU-1321 It's a really simple wrapper, and yes you can use SparkSQL on Kudu, but it will require a lot more work to make it fast/useful. Hope this helps, J-D On Wed, Feb 24, 2016 at

Spark on Kudu

2016-02-24 Thread Benjamin Kim
I see this KUDU-1214 targeted for 0.8.0, but I see no progress on it. When this is complete, will this mean that Spark will be able to work with Kudu both programmatically and as a client via Spark SQL? Or is there more work that needs to be done

Re: Unsubscribe

2016-02-24 Thread Todd Lipcon
Please email user-unsubscribe@ -Todd On Wed, Feb 24, 2016 at 10:48 AM, Andrea Ferretti wrote: > -- Todd Lipcon Software Engineer, Cloudera

Unsubscribe

2016-02-24 Thread Andrea Ferretti