[ 
https://issues.apache.org/jira/browse/TEZ-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated TEZ-2348:
----------------------------------
    Attachment: TEZ-2348.1.patch

[~jdere] - The exception from IFile is valid as higher level API is not 
expected to call nextRawKey() when the end of file is reached.  Can you please 
check if your Hive patch is calling UnorderedKVReader.next() even after it 
returns false?.  In such situations, this error is possible.

For instance, you can check for "readers.UnorderedKVReader: Num Records read:" 
in the task attempt log. This indicates that the UnoderedKVReader has finished 
processing and no more data is available. However, if higher level APIs invoke 
UnorderedKVReader.next() again, it would end up throwing EOF exception from 
IFile.  

Attaching the patch which handles this situation from tez side.  [~sseth] - Can 
you please review?

> EOF exception during UnorderedKVReader.next()
> ---------------------------------------------
>
>                 Key: TEZ-2348
>                 URL: https://issues.apache.org/jira/browse/TEZ-2348
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.5.2
>            Reporter: Jason Dere
>         Attachments: TEZ-2348.1.patch, _tez_session_dir.tgz
>
>
> {noformat}
> Caused by: java.lang.RuntimeException: java.io.IOException: Reached EOF. 
> Completed reading 516605
>       at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord(ReduceRecordSource.java:278)
>       at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run(ReduceRecordProcessor.java:184)
>       at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:148)
>       ... 13 more
> Caused by: java.io.IOException: Reached EOF. Completed reading 516605
>       at 
> org.apache.tez.runtime.library.common.sort.impl.IFile.checkState(IFile.java:817)
>       at 
> org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.positionToNextRecord(IFile.java:698)
>       at 
> org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.readRawKey(IFile.java:731)
>       at 
> org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.nextRawKey(IFile.java:727)
>       at 
> org.apache.tez.runtime.library.common.readers.UnorderedKVReader.readNextFromCurrentReader(UnorderedKVReader.java:151)
>       at 
> org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next(UnorderedKVReader.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$KeyValuesFromKeyValue.next(ReduceRecordSource.java:439)
>       at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord(ReduceRecordSource.java:232)
>       ... 15 more
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to