[
https://issues.apache.org/jira/browse/DRILL-8486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
James Turton closed DRILL-8486.
-------------------------------
> ParquetDecodingException: could not read bytes at offset
> ---------------------------------------------------------
>
> Key: DRILL-8486
> URL: https://issues.apache.org/jira/browse/DRILL-8486
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Parquet
> Affects Versions: 1.21.1
> Reporter: Maksym Rymar
> Assignee: Maksym Rymar
> Priority: Major
> Fix For: 1.21.2
>
> Attachments: test.parquet
>
>
> Drill fails to read a parquet file with the following exception:
>
> {code:java}
> Caused by: org.apache.parquet.io.ParquetDecodingException: could not read
> bytes at offset 591804
> at
> org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:42)
> at
> org.apache.drill.exec.store.parquet.columnreaders.VarLenColumnBulkInput$ValuesReaderWrapper.getNextEntry(VarLenColumnBulkInput.java:754)
> ... 43 common frames omitted
> Caused by: java.io.EOFException: null
> at
> org.apache.parquet.bytes.SingleBufferInputStream.read(SingleBufferInputStream.java:52)
> at
> org.apache.parquet.bytes.BytesUtils.readIntLittleEndian(BytesUtils.java:83)
> at
> org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:39)
> ... 44 common frames omitted {code}
>
>
> This issue only affects queries with {{store.parquet.flat.reader.bulk}} set
> to {{{}true{}}}(by default).
> Attaching the parquet file for the reproduce: [^test.parquet].
> Query: {{select log, app_name from dfs.tmp.`test.parquet`}}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)