Re: CDH 5.5 - Kudu error not enough space remaining in buffer for op

2016-05-18 Thread Abhi Basu
I have tried with batch_size=500 and still get same error. For your reference are attached info that may help diagnose. Error: Error while applying Kudu session.: Incomplete: not enough space remaining in buffer for op (required 46.7K, 7.00M already used Config settings: Kudu Tablet Server

Re: Spark on Kudu

2016-05-18 Thread Chris George
There is some code in review that needs some more refinement. It will allow upsert/insert from a dataframe using the datasource api. It will also allow the creation and deletion of tables from a dataframe http://gerrit.cloudera.org:8080/#/c/2992/ Example usages will look something like:

Re: Spark on Kudu

2016-05-18 Thread Benjamin Kim
Can someone tell me what the state is of this Spark work? Also, does anyone have any sample code on how to update/insert data in Kudu using DataFrames? Thanks, Ben > On Apr 13, 2016, at 8:22 AM, Chris George wrote: > > SparkSQL cannot support these type of

Re: CDH 5.5 - Kudu error not enough space remaining in buffer for op

2016-05-18 Thread Abhi Basu
Thanks for the updates. I will give both options a try and report back. If you are interested in testing with such datasets, I can help. Thanks, Abhi On Wed, May 18, 2016 at 6:25 AM, Todd Lipcon wrote: > Hi Abhi, > > Will is right that the error is client-side, and