Siddharth Seth created HIVE-12624:
-------------------------------------
Summary: HYBRID ORC split strategy can generate inconsistent
splits across two otherwise identical runs
Key: HIVE-12624
URL: https://issues.apache.org/jira/browse/HIVE-12624
Project: Hive
Issue Type: Bug
Reporter: Siddharth Seth
When using the HYBRID strategy to execute the same query twice - the raw splits
generated in the two runs can differ. Wondering if this has something to do
with footer caching.
cc [~prasanth_j]
14: [
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs:///...date=2015-11-03/000000_0:3+367753]
!= [
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-03/000000_0:0+367958]
65: [
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-10/000002_0:0+387632]
!= [
org.apache.hadoop.hive.ql.io.orc.OrcInputFormat:hdfs://...date=2015-11-10/000002_0:3+387427]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)