[jira] [Commented] (SPARK-19256) Hive bucketing support

Tejas Patil (JIRA) Tue, 24 Jan 2017 23:04:17 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-19256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15837302#comment-15837302
 ]


Tejas Patil commented on SPARK-19256:
-------------------------------------

BTW: In its current state, Spark writes data to hive bucketed tables but the 
outputs will not conform with hive's bucketing semantics. This can lead to data 
corruption if Spark outputs are used by Hive. I feel disabling writes to hive 
bucketing tables via Spark might be a good initial step to add. What do you 
think ?

> Hive bucketing support
> ----------------------
>
>                 Key: SPARK-19256
>                 URL: https://issues.apache.org/jira/browse/SPARK-19256
>             Project: Spark
>          Issue Type: Umbrella
>          Components: SQL
>    Affects Versions: 2.1.0
>            Reporter: Tejas Patil
>            Priority: Minor
>
> JIRA to track design discussions and tasks related to Hive bucketing support 
> in Spark.
> Proposal : 
> https://docs.google.com/document/d/1a8IDh23RAkrkg9YYAeO51F4aGO8-xAlupKwdshve2fc/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-19256) Hive bucketing support

Reply via email to