[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705195#comment-17705195 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148822969 ## src/main/thrift/parquet.thrift: ## @@ -190,6 +190,35 @@ enum FieldRepetitionType { /** The field is repeated and can contain 0 or more values */ REPEA

Re: [VOTE] Release Apache Parquet 1.12.4 RC0

2023-03-26 Thread Wang, Yuming
+1. Tested this release through Spark UT: https://github.com/apache/spark/pull/40555. From: Gang Wu Date: Sunday, March 26, 2023 at 22:42 To: dev@parquet.apache.org Subject: [VOTE] Release Apache Parquet 1.12.4 RC0 External Email Hi everyone, I propose the following RC to be released as the

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705180#comment-17705180 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on code in PR #197:

[GitHub] [parquet-format] wgtmac commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148792682 ## src/main/thrift/parquet.thrift: ## @@ -190,6 +190,35 @@ enum FieldRepetitionType { /** The field is repeated and can contain 0 or more values */ REPEATED =

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705179#comment-17705179 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on code in PR #197:

[GitHub] [parquet-format] wgtmac commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148791153 ## src/main/thrift/parquet.thrift: ## @@ -190,6 +190,35 @@ enum FieldRepetitionType { /** The field is repeated and can contain 0 or more values */ REPEATED =

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705176#comment-17705176 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148775015 ## src/main/thrift/parquet.thrift: ## @@ -190,6 +190,35 @@ enum FieldRepetitionType { /** The field is repeated and can contain 0 or more values */ REPEA

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705175#comment-17705175 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148770596 ## src/main/thrift/parquet.thrift: ## @@ -190,6 +190,35 @@ enum FieldRepetitionType { /** The field is repeated and can contain 0 or more values */ REPEA

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705171#comment-17705171 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148766758 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705170#comment-17705170 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148766758 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705167#comment-17705167 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on PR #197: URL: ht

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705166#comment-17705166 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on code in PR #197:

[GitHub] [parquet-format] wgtmac commented on pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on PR #197: URL: https://github.com/apache/parquet-format/pull/197#issuecomment-1484449457 @gszadovszky @shangxinli Have time take a look? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [parquet-format] wgtmac commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148758018 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; + /** T

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705161#comment-17705161 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148748310 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2262) Fix local build failure from maven-surefire-plugin due to missing surefire.argLine

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705156#comment-17705156 ] ASF GitHub Bot commented on PARQUET-2262: - wgtmac opened a new pull request, #1

[GitHub] [parquet-mr] wgtmac opened a new pull request, #1045: PARQUET-2262: Fix local build failure due to missing surefire.argLine

2023-03-26 Thread via GitHub
wgtmac opened a new pull request, #1045: URL: https://github.com/apache/parquet-mr/pull/1045 ### Jira https://issues.apache.org/jira/browse/PARQUET-2262 ### Tests Tested it locally. ### Commits Add a default `surefire.argLine` properties to make local test h

[jira] [Created] (PARQUET-2262) Fix local build failure from maven-surefire-plugin due to missing surefire.argLine

2023-03-26 Thread Gang Wu (Jira)
Gang Wu created PARQUET-2262: Summary: Fix local build failure from maven-surefire-plugin due to missing surefire.argLine Key: PARQUET-2262 URL: https://issues.apache.org/jira/browse/PARQUET-2262 Project:

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705153#comment-17705153 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on code in PR #197:

[GitHub] [parquet-format] wgtmac commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148704782 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; + /** T

[jira] [Commented] (PARQUET-2224) Publish SBOM artifacts

2023-03-26 Thread Gang Wu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705145#comment-17705145 ] Gang Wu commented on PARQUET-2224: -- Thanks for reminding me. I have assigned it to you

[jira] [Assigned] (PARQUET-2224) Publish SBOM artifacts

2023-03-26 Thread Gang Wu (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Wu reassigned PARQUET-2224: Assignee: Dongjoon Hyun > Publish SBOM artifacts > -- > > Ke

[jira] [Commented] (PARQUET-2224) Publish SBOM artifacts

2023-03-26 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705142#comment-17705142 ] Dongjoon Hyun commented on PARQUET-2224: Thank you for resolving this, [~wgtmac

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705113#comment-17705113 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148608048 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705106#comment-17705106 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148600428 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705104#comment-17705104 ] ASF GitHub Bot commented on PARQUET-2261: - emkornfield commented on code in PR

[GitHub] [parquet-format] emkornfield commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
emkornfield commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148598484 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; +

[jira] [Commented] (PARQUET-2261) [Format] Add statistics that reflect decoded size to metadata

2023-03-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705095#comment-17705095 ] ASF GitHub Bot commented on PARQUET-2261: - wgtmac commented on code in PR #197:

[GitHub] [parquet-format] wgtmac commented on a diff in pull request #197: PARQUET-2261: Initial proposal for unencoded/uncompressed statistics

2023-03-26 Thread via GitHub
wgtmac commented on code in PR #197: URL: https://github.com/apache/parquet-format/pull/197#discussion_r1148580752 ## src/main/thrift/parquet.thrift: ## @@ -223,6 +223,17 @@ struct Statistics { */ 5: optional binary max_value; 6: optional binary min_value; + /** T

[VOTE] Release Apache Parquet 1.12.4 RC0

2023-03-26 Thread Gang Wu
Hi everyone, I propose the following RC to be released as the official Apache Parquet 1.12.4 release. The commit id is 22069e58494e7cb5d50e664c7ffa1cf1468404f8 * This corresponds to the tag: apache-parquet-1.12.4-rc0 * https://github.com/apache/parquet-mr/tree/22069e58494e7cb5d50e664c7ffa1cf14684