[jira] [Commented] (SPARK-19256) Hive bucketing support

Tejas Patil (JIRA) Sun, 13 Aug 2017 16:37:31 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-19256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16125084#comment-16125084
 ]


Tejas Patil commented on SPARK-19256:
-------------------------------------

After the refactoring of the insertion plan node has been done, I revisited 
this and picked up where I left off. I have a unpolished but working version 
for both reader and writer side changes. Its too big of a patch, I am finding 
it hard to keep in sync with master plus self-review. I have split it into 2 
parts : reader side change and writer side change (the later is needed first as 
one cannot write test cases for reader if there is no way to populate bucketed 
data). Will send a PR for write side this week after more testing. Sharing the 
link to my personal branch if anyone wants to see : 
https://github.com/tejasapatil/spark/tree/bucket_write (it still needs more 
work in terms of handing one corner case, test cases, refactoring + polishing 
but the core functionality works)

> Hive bucketing support
> ----------------------
>
>                 Key: SPARK-19256
>                 URL: https://issues.apache.org/jira/browse/SPARK-19256
>             Project: Spark
>          Issue Type: Umbrella
>          Components: SQL
>    Affects Versions: 2.1.0
>            Reporter: Tejas Patil
>            Priority: Minor
>
> JIRA to track design discussions and tasks related to Hive bucketing support 
> in Spark.
> Proposal : 
> https://docs.google.com/document/d/1a8IDh23RAkrkg9YYAeO51F4aGO8-xAlupKwdshve2fc/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-19256) Hive bucketing support

Reply via email to