[ https://issues.apache.org/jira/browse/FALCON-143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14279966#comment-14279966 ]
Adam Kawa commented on FALCON-143: ---------------------------------- Created! https://issues.apache.org/jira/browse/FALCON-996 > Enable Late data handling for hive tables > ----------------------------------------- > > Key: FALCON-143 > URL: https://issues.apache.org/jira/browse/FALCON-143 > Project: Falcon > Issue Type: Sub-task > Affects Versions: 0.3 > Reporter: Venkatesh Seetharam > Assignee: Venkatesh Seetharam > Fix For: 0.4 > > Attachments: FALCON-143-r0.patch, FALCON-143.patch > > > HCat nor Hive APIs expose internal stats about a given partition. The only > way to get the partition size is to get the location of the partition on HDFS > and then use globStatus and contentSummary APIs. > With the addition of HIVE-5317, this is going to get more complicated with > deltas and minor and major compactions with no locking. > Need to work with hive to see if there will be an API or Falcon needs to > understand the structure of the layout of the data on the file system. -- This message was sent by Atlassian JIRA (v6.3.4#6332)