[jira] [Commented] (PARQUET-1381) Add merge blocks command to parquet-tools

2023-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17745963#comment-17745963 ] ASF GitHub Bot commented on PARQUET-1381: - wgtmac commented on code in PR #1121

[GitHub] [parquet-mr] wgtmac commented on a diff in pull request #1121: PARQUET-1381: Support merging of rowgroups during file rewrite

2023-07-22 Thread via GitHub
wgtmac commented on code in PR #1121: URL: https://github.com/apache/parquet-mr/pull/1121#discussion_r1271309683 ## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/rewrite/RowGroupMerger.java: ## @@ -0,0 +1,657 @@ +/* + * Licensed to the Apache Software Foundation (ASF) u

Rewrite Parquet List columns

2023-07-22 Thread Rajesh Mahindra
Hey folks, I have a bunch of parquets written with Level 2 list columns (among other columns). I was trying to extend the Parquet Rewrite tool to be able to read those parquet and only rewrite the list columns as Level 3. Any pointers on which classes or APIs i should leverage for this purpose? An