Maksym Rymar created DRILL-8486:
-----------------------------------
Summary: ParquetDecodingException: could not read bytes at offset
Key: DRILL-8486
URL: https://issues.apache.org/jira/browse/DRILL-8486
Project: Apache Drill
Issue Type: Bug
Components: Storage - Parquet
Affects Versions: 1.21.1
Reporter: Maksym Rymar
Assignee: Maksym Rymar
Attachments: test.parquet
Drill fails to read a parquet file with the following exception:
{code:java}
Caused by: org.apache.parquet.io.ParquetDecodingException: could not read bytes
at offset 591804
at
org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:42)
at
org.apache.drill.exec.store.parquet.columnreaders.VarLenColumnBulkInput$ValuesReaderWrapper.getNextEntry(VarLenColumnBulkInput.java:754)
... 43 common frames omitted
Caused by: java.io.EOFException: null
at
org.apache.parquet.bytes.SingleBufferInputStream.read(SingleBufferInputStream.java:52)
at
org.apache.parquet.bytes.BytesUtils.readIntLittleEndian(BytesUtils.java:83)
at
org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:39)
... 44 common frames omitted {code}
This issue only affects queries with {{store.parquet.flat.reader.bulk}} set to
{{{}true{}}}(by default).
Attaching the parquet file for the reproduce: [^test.parquet].
Query: {{select log, app_name from dfs.tmp.`test.parquet`}}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)