[ https://issues.apache.org/jira/browse/DRILL-8139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Turton resolved DRILL-8139. --------------------------------- Resolution: Fixed > Parquet CodecFactory thread safety bug > -------------------------------------- > > Key: DRILL-8139 > URL: https://issues.apache.org/jira/browse/DRILL-8139 > Project: Apache Drill > Issue Type: Bug > Components: Storage - Parquet > Affects Versions: 1.19.0 > Reporter: James Turton > Assignee: James Turton > Priority: Blocker > Attachments: recording.mp4 > > > Update: PARQUET-2126 has been opened to describe the underlying thread safety > problem in parquet-mr. The rdblue/brotli-codec is also affected. > In previously released versions of Drill, back to at least 1.17, this bug > only appears under the combination of the async column reader and the _sync_ > page reader, as per the reproduction script below. In master, the bug > appears under the async column reader and both the sync and async page > readers. > {code:java} > set `store.parquet.compression` = 'gzip'; > drop table if exists dfs.tmp.m; > create table dfs.tmp.m as select * from cp.`tpch/supplier.parquet`; > set `store.parquet.reader.pagereader.async` = false; > set `store.parquet.reader.columnreader.async` = true; > select * from dfs.tmp.m order by s_suppkey; – repeat this last query and > watch the returned data. Eventually you will also failed queries or JVM > crashes > {code} > -- This message was sent by Atlassian Jira (v8.20.1#820001)