Re: Sparse Data

2016-05-12 Thread Chris George
I've used kudu with an EAV model for sparse data and that worked extremely well for us with billions of rows and the correct partitioning. -Chris On 5/12/16, 3:21 PM, "Dan Burkert" <d...@cloudera.com<mailto:d...@cloudera.com>> wrote: Hi Ben, Kudu doesn't support spa

Re: Sparse Data

2016-05-12 Thread Dan Burkert
info. - Dan On Thu, May 12, 2016 at 2:08 PM, Benjamin Kim <bbuil...@gmail.com> wrote: > Can Kudu handle the use case where sparse data is involved? In many of our > processes, we deal with data that can have any number of columns and many > previously unknown column names de

Sparse Data

2016-05-12 Thread Benjamin Kim
Can Kudu handle the use case where sparse data is involved? In many of our processes, we deal with data that can have any number of columns and many previously unknown column names depending on what attributes are brought in at the time. Currently, we use HBase to handle this. Since Kudu