satish created HUDI-1539:
----------------------------

             Summary: Bug in HoodieCombineRealtimeRecordReader returns wrong 
results
                 Key: HUDI-1539
                 URL: https://issues.apache.org/jira/browse/HUDI-1539
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: satish


https://github.com/apache/hudi/issues/2346#issuecomment-758591316 

in a rt table
the hive query has predicate push down
there are no less than 3 splits (thus no less than 3 recordReaders in 
HoodieCombineRealtimeRecordReader), and the records satisfy the predicate are 
in the split which is in a relatively back position of the List
2 recordReaders in succession with this.currentRecordReader.next(key, value) 
returns false, as the predicate push down has filtered the baseFile.
In step 4, it leads to HoodieCombineRealtimeRecordReader::next(NullWritable 
key, ArrayWritable value) return false and the reader will stop read next. So, 
records which satisfy the predicate are in the remanined recordReaders but can 
not be read.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to