[ https://issues.apache.org/jira/browse/SPARK-19256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15837302#comment-15837302 ]
Tejas Patil commented on SPARK-19256: ------------------------------------- BTW: In its current state, Spark writes data to hive bucketed tables but the outputs will not conform with hive's bucketing semantics. This can lead to data corruption if Spark outputs are used by Hive. I feel disabling writes to hive bucketing tables via Spark might be a good initial step to add. What do you think ? > Hive bucketing support > ---------------------- > > Key: SPARK-19256 > URL: https://issues.apache.org/jira/browse/SPARK-19256 > Project: Spark > Issue Type: Umbrella > Components: SQL > Affects Versions: 2.1.0 > Reporter: Tejas Patil > Priority: Minor > > JIRA to track design discussions and tasks related to Hive bucketing support > in Spark. > Proposal : > https://docs.google.com/document/d/1a8IDh23RAkrkg9YYAeO51F4aGO8-xAlupKwdshve2fc/edit?usp=sharing -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org