[jira] [Created] (PARQUET-2356) Fix typo in DeltaBinaryPackingValuesWriter

2023-09-29 Thread Xuwei Fu (Jira)
Xuwei Fu created PARQUET-2356: - Summary: Fix typo in DeltaBinaryPackingValuesWriter Key: PARQUET-2356 URL: https://issues.apache.org/jira/browse/PARQUET-2356 Project: Parquet Issue Type: Improvem

[jira] [Commented] (PARQUET-2222) [Format] RLE encoding spec incorrect for v2 data pages

2023-06-14 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17732652#comment-17732652 ] Xuwei Fu commented on PARQUET-: --- Yes this answers my question. I think arrow parq

[jira] [Commented] (PARQUET-2222) [Format] RLE encoding spec incorrect for v2 data pages

2023-06-09 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17730992#comment-17730992 ] Xuwei Fu commented on PARQUET-: --- [~gszadovszky] Hi Gabor maybe I'm misleading. 1

[jira] [Commented] (PARQUET-2222) [Format] RLE encoding spec incorrect for v2 data pages

2023-06-09 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17730983#comment-17730983 ] Xuwei Fu commented on PARQUET-: --- I think in cpp, in 12.0.0, even if it's Format V

[jira] [Commented] (PARQUET-2256) Adding Compression for BloomFilter

2023-03-17 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17701577#comment-17701577 ] Xuwei Fu commented on PARQUET-2256: --- [~gszadovszky] Yes, I'd like to. I think having

[jira] [Created] (PARQUET-2256) Adding Compression for BloomFilter

2023-03-13 Thread Xuwei Fu (Jira)
Xuwei Fu created PARQUET-2256: - Summary: Adding Compression for BloomFilter Key: PARQUET-2256 URL: https://issues.apache.org/jira/browse/PARQUET-2256 Project: Parquet Issue Type: Improvement

[jira] [Created] (PARQUET-2255) BloomFilter and float point is ambiguous

2023-03-13 Thread Xuwei Fu (Jira)
Xuwei Fu created PARQUET-2255: - Summary: BloomFilter and float point is ambiguous Key: PARQUET-2255 URL: https://issues.apache.org/jira/browse/PARQUET-2255 Project: Parquet Issue Type: Improvemen

[jira] [Commented] (PARQUET-2222) [Format] RLE encoding spec incorrect for v2 data pages

2023-02-27 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693874#comment-17693874 ] Xuwei Fu commented on PARQUET-: --- ok, I got it. Previously I found `RLE` format re

[jira] [Commented] (PARQUET-2222) [Format] RLE encoding spec incorrect for v2 data pages

2023-02-26 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693798#comment-17693798 ] Xuwei Fu commented on PARQUET-: --- I don't understand. Isn't length the part of enc

[jira] [Commented] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-20 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691257#comment-17691257 ] Xuwei Fu commented on PARQUET-2249: --- I guess maybe we can take a look at: # [https:/

[jira] [Commented] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-20 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691256#comment-17691256 ] Xuwei Fu commented on PARQUET-2249: --- I guess NaN is not always larger than all values

[jira] [Comment Edited] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-20 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691179#comment-17691179 ] Xuwei Fu edited comment on PARQUET-2249 at 2/20/23 1:30 PM:

[jira] [Commented] (PARQUET-2249) Parquet spec (parquet.thrift) is inconsistent w.r.t. ColumnIndex + NaNs

2023-02-20 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691179#comment-17691179 ] Xuwei Fu commented on PARQUET-2249: --- Seems that iceberg provides NaN counts. And min-

[jira] [Created] (PARQUET-2241) ByteStreamSplitDecoder broken in presence of nulls

2023-02-08 Thread Xuwei Fu (Jira)
Xuwei Fu created PARQUET-2241: - Summary: ByteStreamSplitDecoder broken in presence of nulls Key: PARQUET-2241 URL: https://issues.apache.org/jira/browse/PARQUET-2241 Project: Parquet Issue Type:

[jira] [Commented] (PARQUET-1622) Add BYTE_STREAM_SPLIT encoding

2023-01-17 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17678055#comment-17678055 ] Xuwei Fu commented on PARQUET-1622: --- [~gszadovszky] [~martinradev]  Hi all, I meet a