Within the context of your question, Spark SQL utilizing the Hive context is primarily about very fast queries. If you want to use real-time queries, I would utilize Spark Streaming. A couple of great resources on this topic include Guest Lecture on Spark Streaming in Stanford CME 323: Distributed Algorithms and Optimization <http://www.slideshare.net/tathadas/guest-lecture-on-spark-streaming-in-standford> and Recipes for Running Spark Streaming Applications in Production <https://spark-summit.org/2015/events/recipes-for-running-spark-streaming-applications-in-production/> (from the recent Spark Summit 2015)
HTH! On Mon, Jul 6, 2015 at 3:23 PM spierki <florian.spierc...@crisalid.com> wrote: > Hello, > > I'm actually asking my self about performance of using Spark SQL with Hive > to do real time analytics. > I know that Hive has been created for batch processing, and Spark is use to > do fast queries. > > But, use Spark SQL with Hive will allow me to do real time queries ? Or it > just will make fastest queries but not real time. > Should I use an other datawarehouse, like Hbase ? > > Thanks in advance for your time and consideration, > Florian > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Spark-SQL-queries-hive-table-real-time-tp23642.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >