[ 
https://issues.apache.org/jira/browse/HIVE-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Wilfong resolved HIVE-3593.
---------------------------------

    Resolution: Not A Problem

Actually, the regex to get the task ID already avoids this problem.
                
> Output files of SMB join grow indefinitely
> ------------------------------------------
>
>                 Key: HIVE-3593
>                 URL: https://issues.apache.org/jira/browse/HIVE-3593
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.10.0
>            Reporter: Kevin Wilfong
>            Assignee: Kevin Wilfong
>
> The output files of a SMB join are prefixed by the big table's partition spec 
> that was used to create them.  The length of the bucket number portion of the 
> file name is updated to be the same length as the length of the task ID.  
> Since the task ID is the name of the file, this means that if the output of a 
> SMB join is used as the big table of another SMB join, the output files will 
> increase by the size of the original partition spec.  Compound this and the 
> file size can grow indefinitely.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to