Does anyone have some advice on the best way to deploy a Hive UDF for use
with a Spark SQL Thriftserver where the client is Tableau using Simba ODBC
Spark SQL driver.
I have seen the hive documentation that provides an example of creating the
function using a hive client ie: CREATE FUNCTION
I am building an analytics environment based on Spark and want to use HIVE in
multi-user mode i.e. not use the embedded derby database but use Postgres
and HDFS instead. I am using the included Spark Thrift Server to process
queries using Spark SQL.
The documentation gives me the impression that
I'm a bit confused about the documentation in the area of Hive support.
I want to use a remote Hive metastore/hdfs server and the documentation says
that we need to build Spark from source due to the large number of
dependencies Hive requires.
Specifically the documentation says:
Hive has a