[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695499#comment-17695499
]
ASF GitHub Bot commented on PARQUET-2159:
-
wgtmac commented on PR #1011:
URL: h
wgtmac commented on PR #1011:
URL: https://github.com/apache/parquet-mr/pull/1011#issuecomment-1451385421
I'd request sign off from @gszadovszky @shangxinli
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL abov
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695418#comment-17695418
]
ASF GitHub Bot commented on PARQUET-2159:
-
jiangjiguang commented on code in PR
jiangjiguang commented on code in PR #1011:
URL: https://github.com/apache/parquet-mr/pull/1011#discussion_r1122538089
##
.github/workflows/vector-plugins.yml:
##
@@ -0,0 +1,56 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreem
[
https://issues.apache.org/jira/browse/PARQUET-2251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mars resolved PARQUET-2251.
---
Resolution: Fixed
> Avoid generating Bloomfilter when all pages of a column are encoded by
> dictionary
>
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695408#comment-17695408
]
ASF GitHub Bot commented on PARQUET-2159:
-
jiangjiguang commented on code in PR
[
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695407#comment-17695407
]
ASF GitHub Bot commented on PARQUET-2252:
-
wgtmac commented on code in PR #1038
wgtmac commented on code in PR #1038:
URL: https://github.com/apache/parquet-mr/pull/1038#discussion_r1122511179
##
parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetFileReader.java:
##
@@ -1011,6 +1012,35 @@ public PageReadStore readFilteredRowGroup(int
blockIndex)
jiangjiguang commented on code in PR #1011:
URL: https://github.com/apache/parquet-mr/pull/1011#discussion_r1121226362
##
.github/workflows/vector-plugins.yml:
##
@@ -0,0 +1,56 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreem
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695406#comment-17695406
]
ASF GitHub Bot commented on PARQUET-2159:
-
jiangjiguang commented on code in PR
jiangjiguang commented on code in PR #1011:
URL: https://github.com/apache/parquet-mr/pull/1011#discussion_r1121226362
##
.github/workflows/vector-plugins.yml:
##
@@ -0,0 +1,56 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreem
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695405#comment-17695405
]
ASF GitHub Bot commented on PARQUET-2159:
-
jiangjiguang commented on code in PR
jiangjiguang commented on code in PR #1011:
URL: https://github.com/apache/parquet-mr/pull/1011#discussion_r1121226362
##
.github/workflows/vector-plugins.yml:
##
@@ -0,0 +1,56 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreem
wgtmac commented on code in PR #190:
URL: https://github.com/apache/parquet-format/pull/190#discussion_r1122508344
##
README.md:
##
@@ -132,6 +132,7 @@ readers and writers for the format. The types are:
- FLOAT: IEEE 32-bit floating point values
- DOUBLE: IEEE 64-bit floa
[
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695180#comment-17695180
]
ASF GitHub Bot commented on PARQUET-2252:
-
gszadovszky commented on PR #1038:
U
gszadovszky commented on PR #1038:
URL: https://github.com/apache/parquet-mr/pull/1038#issuecomment-1450409997
Since these are already used in iceberg I think it is better to have them
public and maintain backward compatibility.
--
This is an automated message from the Apache Git Service
[
https://issues.apache.org/jira/browse/PARQUET-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695171#comment-17695171
]
ASF GitHub Bot commented on PARQUET-2230:
-
wgtmac merged PR #1036:
URL: https:/
wgtmac merged PR #1036:
URL: https://github.com/apache/parquet-mr/pull/1036
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: dev-unsubscr...@parquet.apac
[
https://issues.apache.org/jira/browse/PARQUET-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695170#comment-17695170
]
ASF GitHub Bot commented on PARQUET-2230:
-
wgtmac commented on PR #1036:
URL: h
wgtmac commented on PR #1036:
URL: https://github.com/apache/parquet-mr/pull/1036#issuecomment-1450365870
> (Congrats for the committership! From now on I won't push your PRs. 😉 )
Thank you for your help all the time! @gszadovszky
--
This is an automated message from the Apache Git
[
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695166#comment-17695166
]
ASF GitHub Bot commented on PARQUET-2252:
-
wgtmac commented on PR #1038:
URL: h
wgtmac commented on PR #1038:
URL: https://github.com/apache/parquet-mr/pull/1038#issuecomment-1450359195
@gszadovszky @shangxinli Do you have any concern?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above
> What are the reasons for forcing the dictionary to be the first page?
This is by design. I guess it benefits sequential scan where the dictionary
page is read first and then followed by its encoded indices in the data
pages. Otherwise we need to seek anyway.
> can this be changed to allow for
[
https://issues.apache.org/jira/browse/PARQUET-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695073#comment-17695073
]
ASF GitHub Bot commented on PARQUET-2230:
-
gszadovszky commented on PR #1036:
U
gszadovszky commented on PR #1036:
URL: https://github.com/apache/parquet-mr/pull/1036#issuecomment-1450142055
(Congrats for the committership! From now on I won't push your PRs. :wink: )
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on
[
https://issues.apache.org/jira/browse/PARQUET-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695070#comment-17695070
]
ASF GitHub Bot commented on PARQUET-2230:
-
gszadovszky commented on PR #1036:
U
gszadovszky commented on PR #1036:
URL: https://github.com/apache/parquet-mr/pull/1036#issuecomment-1450139433
Thanks a lot, @wgtmac. It looks good to me.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above t
[
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695023#comment-17695023
]
ASF GitHub Bot commented on PARQUET-2252:
-
zhongyujiang commented on PR #1038:
zhongyujiang commented on PR #1038:
URL: https://github.com/apache/parquet-mr/pull/1038#issuecomment-1449991193
@wgtmac @rdblue Can you please help review this?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL a
[
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695022#comment-17695022
]
ASF GitHub Bot commented on PARQUET-2252:
-
zhongyujiang opened a new pull reque
zhongyujiang opened a new pull request, #1038:
URL: https://github.com/apache/parquet-mr/pull/1038
…implement page skipping.
Issue: [PARQUET-2252](https://issues.apache.org/jira/browse/PARQUET-2252)
This PR makes some methods required to implement column index filter public
to
Yujiang Zhong created PARQUET-2252:
--
Summary: Make some methods public to allow external projects to
implement page skipping
Key: PARQUET-2252
URL: https://issues.apache.org/jira/browse/PARQUET-2252
Hi Gang,
thanks for your reply.
On 01.03.23 03:09, Gang Wu wrote:
If at least one record in the beginning 2 rows is not null, then the
encoded size will be much better.
That is the workaround I have been using for the past weeks, although my
tests show that at least two values are require
33 matches
Mail list logo