Could someone add me to the sync invite?

2020-12-15 Thread Jason Altekruse
Hi All, I'm interested in attending the next sync to discuss the possibility of not deprecating summary files, for the use case described in this previous thread. While Delta.io is in the state that it is without an open source implementation of partition pruning, summary files are a pretty easy

[jira] [Commented] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17250075#comment-17250075 ] ASF GitHub Bot commented on PARQUET-1954: - StefanXiepj commented on a change in pull request

[GitHub] [parquet-mr] StefanXiepj commented on a change in pull request #849: PARQUET-1954: TCP connection leak in parquet dump

2020-12-15 Thread GitBox
StefanXiepj commented on a change in pull request #849: URL: https://github.com/apache/parquet-mr/pull/849#discussion_r543875687 ## File path: parquet-tools/src/main/java/org/apache/parquet/tools/command/DumpCommand.java ## @@ -165,48 +165,47 @@ public static void

[jira] [Commented] (PARQUET-1776) Add Java NIO Avro OutputFile InputFile

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249930#comment-17249930 ] ASF GitHub Bot commented on PARQUET-1776: - fommil commented on a change in pull request #815:

[GitHub] [parquet-mr] fommil commented on a change in pull request #815: PARQUET-1776: NIO wrapper for Output/Input File

2020-12-15 Thread GitBox
fommil commented on a change in pull request #815: URL: https://github.com/apache/parquet-mr/pull/815#discussion_r543682706 ## File path: parquet-common/src/main/java/org/apache/parquet/io/LocalSeekableInputStream.java ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache

[jira] [Commented] (PARQUET-1776) Add Java NIO Avro OutputFile InputFile

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249929#comment-17249929 ] ASF GitHub Bot commented on PARQUET-1776: - fommil commented on a change in pull request #815:

[GitHub] [parquet-mr] fommil commented on a change in pull request #815: PARQUET-1776: NIO wrapper for Output/Input File

2020-12-15 Thread GitBox
fommil commented on a change in pull request #815: URL: https://github.com/apache/parquet-mr/pull/815#discussion_r543682706 ## File path: parquet-common/src/main/java/org/apache/parquet/io/LocalSeekableInputStream.java ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache

[jira] [Commented] (PARQUET-1950) Define core features / compliance level

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249899#comment-17249899 ] ASF GitHub Bot commented on PARQUET-1950: - timarmstrong commented on a change in pull request

[GitHub] [parquet-format] timarmstrong commented on a change in pull request #164: PARQUET-1950: Define core features

2020-12-15 Thread GitBox
timarmstrong commented on a change in pull request #164: URL: https://github.com/apache/parquet-format/pull/164#discussion_r543638383 ## File path: CoreFeatures.md ## @@ -0,0 +1,181 @@ + + +# Parquet Core Features + +This document lists the core features for each

[jira] [Commented] (PARQUET-1126) make it easy to read and write parquet files in java without depending on hadoop

2020-12-15 Thread David Mollitor (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249875#comment-17249875 ] David Mollitor commented on PARQUET-1126: - Also check out some work done (Waiting in GitHub PR)

[jira] [Commented] (PARQUET-1126) make it easy to read and write parquet files in java without depending on hadoop

2020-12-15 Thread Sam Halliday (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249856#comment-17249856 ] Sam Halliday commented on PARQUET-1126: --- in PARQUET-1953 I left some code that means that the

[jira] [Resolved] (PARQUET-1953) hadoop-common is not an optional dependency

2020-12-15 Thread Sam Halliday (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Halliday resolved PARQUET-1953. --- Resolution: Fixed > hadoop-common is not an optional dependency >

[jira] [Commented] (PARQUET-1953) hadoop-common is not an optional dependency

2020-12-15 Thread Sam Halliday (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249852#comment-17249852 ] Sam Halliday commented on PARQUET-1953: --- yup, fair enough, this is a dupe. > hadoop-common is

[jira] [Created] (PARQUET-1955) ParquetThriftOutputFormat does not store any optional primitive values in parquet file

2020-12-15 Thread Daniel Morales (Jira)
Daniel Morales created PARQUET-1955: --- Summary: ParquetThriftOutputFormat does not store any optional primitive values in parquet file Key: PARQUET-1955 URL: https://issues.apache.org/jira/browse/PARQUET-1955

[jira] [Commented] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249775#comment-17249775 ] ASF GitHub Bot commented on PARQUET-1954: - gszadovszky commented on a change in pull request

[GitHub] [parquet-mr] gszadovszky commented on a change in pull request #849: PARQUET-1954: TCP connection leak in parquet dump

2020-12-15 Thread GitBox
gszadovszky commented on a change in pull request #849: URL: https://github.com/apache/parquet-mr/pull/849#discussion_r543485871 ## File path: parquet-tools/src/main/java/org/apache/parquet/tools/command/DumpCommand.java ## @@ -165,48 +165,47 @@ public static void

[jira] [Commented] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249773#comment-17249773 ] ASF GitHub Bot commented on PARQUET-1954: - StefanXiepj commented on pull request #849: URL:

[GitHub] [parquet-mr] StefanXiepj commented on pull request #849: PARQUET-1954: TCP connection leak in parquet dump

2020-12-15 Thread GitBox
StefanXiepj commented on pull request #849: URL: https://github.com/apache/parquet-mr/pull/849#issuecomment-745393587 @Pavitheran Could you review this PR , thanks. This is an automated message from the Apache Git Service.

[jira] [Commented] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249726#comment-17249726 ] ASF GitHub Bot commented on PARQUET-1954: - StefanXiepj opened a new pull request #849: URL:

[GitHub] [parquet-mr] StefanXiepj opened a new pull request #849: PARQUET-1954: TCP connection leak in parquet dump

2020-12-15 Thread GitBox
StefanXiepj opened a new pull request #849: URL: https://github.com/apache/parquet-mr/pull/849 Make sure you have checked _all_ steps below. ### Jira My PR addresses the following [PARQUET-1954](https://issues.apache.org/jira/browse/PARQUET-1954T/) ### Tests does not

[jira] [Commented] (PARQUET-1953) hadoop-common is not an optional dependency

2020-12-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249700#comment-17249700 ] Gabor Szadovszky commented on PARQUET-1953: --- There are a couple of jiras about similar

[jira] [Commented] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249689#comment-17249689 ] Gabor Szadovszky commented on PARQUET-1954: --- Good catch, [~xiepengjie]. Would you like to

[jira] [Updated] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread xiepengjie (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xiepengjie updated PARQUET-1954: Description: Hi, When i'm trying to dump a parquet file, i find the TCP connection leak inĀ 

[jira] [Created] (PARQUET-1954) TCP connection leak in parquet dump

2020-12-15 Thread xiepengjie (Jira)
xiepengjie created PARQUET-1954: --- Summary: TCP connection leak in parquet dump Key: PARQUET-1954 URL: https://issues.apache.org/jira/browse/PARQUET-1954 Project: Parquet Issue Type: Bug

[jira] [Updated] (PARQUET-1953) hadoop-common is not an optional dependency

2020-12-15 Thread Sam Halliday (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Halliday updated PARQUET-1953: -- Description: parquet-hadoop provides the only mechanism to load .parquet files and has an

[jira] [Created] (PARQUET-1953) hadoop-common is not an optional dependency

2020-12-15 Thread Sam Halliday (Jira)
Sam Halliday created PARQUET-1953: - Summary: hadoop-common is not an optional dependency Key: PARQUET-1953 URL: https://issues.apache.org/jira/browse/PARQUET-1953 Project: Parquet Issue

[jira] [Commented] (PARQUET-1950) Define core features / compliance level

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249643#comment-17249643 ] ASF GitHub Bot commented on PARQUET-1950: - pitrou commented on a change in pull request #164:

[GitHub] [parquet-format] pitrou commented on a change in pull request #164: PARQUET-1950: Define core features

2020-12-15 Thread GitBox
pitrou commented on a change in pull request #164: URL: https://github.com/apache/parquet-format/pull/164#discussion_r543278543 ## File path: CoreFeatures.md ## @@ -0,0 +1,178 @@ + + +# Parquet Core Features + +This document lists the core features for each parquet-format

[jira] [Commented] (PARQUET-1950) Define core features / compliance level

2020-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249640#comment-17249640 ] ASF GitHub Bot commented on PARQUET-1950: - pitrou commented on a change in pull request #164:

[GitHub] [parquet-format] pitrou commented on a change in pull request #164: PARQUET-1950: Define core features

2020-12-15 Thread GitBox
pitrou commented on a change in pull request #164: URL: https://github.com/apache/parquet-format/pull/164#discussion_r543277575 ## File path: CoreFeatures.md ## @@ -0,0 +1,181 @@ + + +# Parquet Core Features + +This document lists the core features for each parquet-format