[jira] [Commented] (SPARK-19076) Upgrade Hive dependence to Hive 2.x

William Handy (JIRA) Thu, 18 May 2017 12:23:36 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-19076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16016309#comment-16016309
 ]


William Handy commented on SPARK-19076:
---------------------------------------

It seems like it was decided that this was too difficult, but I wanted to point 
out that hive 2.1 has multithreaded writes with settings hive.mv.files.thread 
and hive.metastore.fshandler.threads. If you happen to be using spark on S3, 
these settings would be a significant performance boost.

There are several articles talking about using these settings in the context of 
"Hive on Spark", when I want to see them in "Hive _in_ Spark" instead :-/

> Upgrade Hive dependence to Hive 2.x
> -----------------------------------
>
>                 Key: SPARK-19076
>                 URL: https://issues.apache.org/jira/browse/SPARK-19076
>             Project: Spark
>          Issue Type: Improvement
>            Reporter: Dapeng Sun
>
> Currently the upstream Spark depends on Hive 1.2.1 to build package, and Hive 
> 2.0 has been released in February 2016, Hive 2.0.1 and 2.1.0  also released 
> for a long time, at Spark side, it is better to support Hive 2.0 and above.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-19076) Upgrade Hive dependence to Hive 2.x

Reply via email to