Actually, with HiveContext, you can join hive tables with registered
temporary tables.
On Fri, Aug 22, 2014 at 9:07 PM, chutium teng@gmail.com wrote:
oops, thanks Yan, you are right, i got
scala sqlContext.sql(select * from a join b).take(10)
java.lang.RuntimeException: Table Not Found:
as far as i know, HQL queries try to find the schema info of all the tables
in this query from hive metastore, so it is not possible to join tables from
sqlContext using hiveContext.hql
but this should work:
hiveContext.hql(select ...).regAsTable(a)
sqlContext.jsonFile(xxx).regAsTable(b)
then
[mailto:teng@gmail.com]
Sent: Thursday, August 21, 2014 4:35 AM
To: d...@spark.incubator.apache.org
Subject: Re: Spark SQL Query and join different data sources.
as far as i know, HQL queries try to find the schema info of all the tables in
this query from hive metastore, so it is not possible
Presto is so far good at joining different sources/databases.
I tried a simple join query in Spark SQL, it fails as the followings errors
val a = cql(select test.a from test JOIN test1 on test.a = test1.a)
a: org.apache.spark.sql.SchemaRDD =
SchemaRDD[0] at RDD at SchemaRDD.scala:104
== Query