Did anybody try to convert HiveQL queries to SparkSQL? If so, would you
share the experience, pros cons please? Thank you.
On Thu, Jul 30, 2015 at 10:37 AM, Bigdata techguy bigdatatech...@gmail.com
wrote:
Thanks Jorn for the response and for the pointer questions to Hive
optimization tips
.
Caching will only help for iterative stuff. You may think about
denormalizing the model even more to avoid joins as much as possible.
Bigdata techguy bigdatatech...@gmail.com schrieb am Mi., 29.07.2015,
18:49:
Hi All,
I have a fairly complex HiveQL data processing which I am trying
Hi All,
I have a fairly complex HiveQL data processing which I am trying to convert
to SparkSQL to improve performance. Below is what it does.
Select around 100 columns including Aggregates
From a FACT_TABLE
Joined to the summary of the same FACT_TABLE
Joined to 2 smaller DIMENSION tables.
The