[jira] [Commented] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-19 Thread Gang Wu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691009#comment-17691009 ] Gang Wu commented on PARQUET-2249: -- When a page only contains NaN, the page statistics

[jira] [Commented] (PARQUET-2237) Improve performance when filters in RowGroupFilter can match exactly

2023-02-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691008#comment-17691008 ] ASF GitHub Bot commented on PARQUET-2237: - wgtmac commented on PR #1023: URL: h

[GitHub] [parquet-mr] wgtmac commented on pull request #1023: PARQUET-2237 Improve performance when filters in RowGroupFilter can match exactly

2023-02-19 Thread via GitHub
wgtmac commented on PR #1023: URL: https://github.com/apache/parquet-mr/pull/1023#issuecomment-1436339171 cc @zhongyujiang Not sure if you are interested in reviewing this PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitH

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-02-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17690886#comment-17690886 ] ASF GitHub Bot commented on PARQUET-2160: - pan3793 commented on PR #982: URL: h

[GitHub] [parquet-mr] pan3793 commented on pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time.

2023-02-19 Thread via GitHub
pan3793 commented on PR #982: URL: https://github.com/apache/parquet-mr/pull/982#issuecomment-1435989086 I also encountered this memory leak when migrating data from parquet/snappy to parquet/zstd, Spark executors always occupy unreasonable off-heap memory and have a high risk of being kill

[jira] [Created] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-19 Thread Jan Finis (Jira)
Jan Finis created PARQUET-2249: -- Summary: Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs Key: PARQUET-2249 URL: https://issues.apache.org/jira/browse/PARQUET-2249 Project: Parque