Re: Need advice for Spark newbie

2015-02-26 Thread Dean Wampler
Historically, many orgs. have replaced data warehouses with Hadoop clusters and used Hive along with Impala (on Cloudera deployments) or Drill (on MapR deployments) for SQL. Hive is older and slower, while Impala and Drill are newer and faster, but you typically need both for their complementary

RE: Need advice for Spark newbie

2015-02-26 Thread Steve Nunez
works best for you. HDInsights works well for mixing matching tools. HTH, - SteveN -Original Message- From: Dean Wampler [mailto:deanwamp...@gmail.com] Sent: Thursday, 26 February, 2015 8:54 To: Vikram Kone Cc: dev@spark.apache.org Subject: Re: Need advice for Spark newbie

Re: Need advice for Spark newbie

2015-02-26 Thread Vikram Kone
Dean Thanks for the info. Are you saying that we can create star/snowflake data models using spark so they can be queried from tableau ? On Thursday, February 26, 2015, Dean Wampler deanwamp...@gmail.com wrote: Historically, many orgs. have replaced data warehouses with Hadoop clusters and

Re: Need advice for Spark newbie

2015-02-26 Thread Vikram Kone
','dev@spark.apache.org'); Subject: Re: Need advice for Spark newbie Historically, many orgs. have replaced data warehouses with Hadoop clusters and used Hive along with Impala (on Cloudera deployments) or Drill (on MapR deployments) for SQL. Hive is older and slower, while Impala and Drill

Re: Need advice for Spark newbie

2015-02-26 Thread Dean Wampler
There's no support for star or snowflake models, per se. What you get with Hadoop is access to all your data and the processing power to build the ad hoc queries you want, when you need them, rather than having to figure out a schema/model in advance. I recommend that you also ask your questions

Need advice for Spark newbie

2015-02-25 Thread Vikram Kone
Hi, I'm a newbie when it comes to Spark and Hadoop eco system in general. Our team has been predominantly a Microsoft shop that uses MS stack for most of their BI needs. So we are talking SQL server for storing relational data and SQL Server Analysis services for building MOLAP cubes for