Eugene Koifman created HIVE-16722:
-------------------------------------

             Summary: Converting non-acid table to acid should perform 
validation
                 Key: HIVE-16722
                 URL: https://issues.apache.org/jira/browse/HIVE-16722
             Project: Hive
          Issue Type: Sub-task
    Affects Versions: 1.0.0
            Reporter: Eugene Koifman


Converting a non acid table to acid only performs metadata validation (in 
_TransactionalValidationListener_).
The data read code path only understands certain directory layouts and file 
names and ignores (generally) files that don't match the expected format.

In Hive, directory layout and bucket file naming (especially older releases) is 
poorly enforced.

Need to add a validation step on 
{noformat}
alter table T SET TBLPROPERTIES ('transactional'='true')
{noformat}
to 
scan the file system and report any possible data loss scenarios.

Currently Acid understands bucket files name like "00000_0" and (with 
HIVE-16177) 00000_0_copy1" etc at the root of the partition.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to