Hello, I work for an eCommerce company. Currently we are looking at building a Data warehouse platform as described below:
DW as a Service | REST API | SQL On No SQL (Drill/Pig/Hive/Spark SQL) | No SQL databases (One or more. May be RDBMS directly too) | (Bulk load) My SQL Database I wish to get a few clarifications on Apache Drill as follows: 1) Can we use Spark for SQL on No SQL or do we need to mix them with Pig/Hive or any other for any reason? 2) Can Spark SQL be used a query interface for Business Intelligence, Analytics and Reporting 3) Is Spark supports only Hadoop, HBase?. We may use Cassandra/MongoDb/CouchBase as well. 4) Is Spark supports RDBMS too?. We can have a single interface to pull out data from multiple data sources? 5) Any recommendations(not limited to usage of Spark) for our specific requirement described above. Thanks Ajay Note : I have posted a similar post on the Drill User list as well as I am not sure which one best fits for our usecase. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Clarifications-on-Spark-tp20440.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org