Re: Spark on Kudu

2016-10-10 Thread Mark Hamstra
I realize that the Spark on Kudu work to date has been based on Spark 1.6, where your statement about Spark SQL relying on Hive is true. In Spark 2.0, however, that dependency no longer exists since Spark SQL essentially copied over the parts of Hive that were needed into Spark itself, and has

Re: Schema Normalization

2016-10-10 Thread Todd Lipcon
On Mon, Oct 10, 2016 at 4:44 PM, Benjamin Kim wrote: > Todd, > > We are not going crazy with normalization. Actually, we are only > normalizing where necessary. For example, we have a table for profiles and > behaviors. They are joined together by a behavior status table.