[jira] [Created] (PARQUET-1264) Update Javadoc for Java 1.8

2018-03-30 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1264: -- Summary: Update Javadoc for Java 1.8 Key: PARQUET-1264 URL: https://issues.apache.org/jira/browse/PARQUET-1264 Project: Parquet Issue Type: Improvement

[jira] [Resolved] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1263. Resolution: Fixed Assignee: Ryan Blue Merged #464. > ParquetReader's builder should use Con

[jira] [Commented] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421024#comment-16421024 ] ASF GitHub Bot commented on PARQUET-1263: - rdblue closed pull request #464: PARQ

[jira] [Commented] (PARQUET-1183) AvroParquetWriter needs OutputFile based Builder

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421017#comment-16421017 ] ASF GitHub Bot commented on PARQUET-1183: - rdblue closed pull request #460: PARQ

[jira] [Commented] (PARQUET-1183) AvroParquetWriter needs OutputFile based Builder

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421018#comment-16421018 ] ASF GitHub Bot commented on PARQUET-1183: - rdblue closed pull request #446: PARQ

[jira] [Resolved] (PARQUET-1183) AvroParquetWriter needs OutputFile based Builder

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1183. Resolution: Fixed Assignee: Ryan Blue Merged #460. Thanks [~zi] for reviewing! > AvroParque

[jira] [Commented] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421014#comment-16421014 ] ASF GitHub Bot commented on PARQUET-1263: - danielcweeks commented on issue #464:

[jira] [Commented] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421004#comment-16421004 ] ASF GitHub Bot commented on PARQUET-1263: - rdblue opened a new pull request #464

[jira] [Created] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
Ryan Blue created PARQUET-1263: -- Summary: ParquetReader's builder should use Configuration from the InputFile Key: PARQUET-1263 URL: https://issues.apache.org/jira/browse/PARQUET-1263 Project: Parquet

[jira] [Updated] (PARQUET-1263) ParquetReader's builder should use Configuration from the InputFile

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1263: --- Fix Version/s: 1.10.0 > ParquetReader's builder should use Configuration from the InputFile > --

[jira] [Resolved] (PARQUET-1184) Make DelegatingPositionOutputStream a concrete class

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1184. Resolution: Won't Fix Fix Version/s: (was: 1.10.0) > Make DelegatingPositionOutputStrea

[jira] [Commented] (PARQUET-1184) Make DelegatingPositionOutputStream a concrete class

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16420982#comment-16420982 ] Ryan Blue commented on PARQUET-1184: The reason why this is an abstract class is so

[jira] [Updated] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1028: --- Fix Version/s: 1.10.0 > [JAVA] When reading old Spark-generated files with INT96, stats are reported

[jira] [Resolved] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-1028. Resolution: Fixed Assignee: Zoltan Ivanfi > [JAVA] When reading old Spark-generated files wi

[jira] [Commented] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16420962#comment-16420962 ] Ryan Blue commented on PARQUET-1028: This was fixed by PARQUET-1065. The expected so

[jira] [Updated] (PARQUET-1055) Improve the creation of ExecutorService when reading footers

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1055: --- Fix Version/s: (was: 1.9.1) > Improve the creation of ExecutorService when reading footers > ---

[jira] [Updated] (PARQUET-1028) [JAVA] When reading old Spark-generated files with INT96, stats are reported as valid when they aren't

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1028: --- Fix Version/s: (was: 1.9.1) > [JAVA] When reading old Spark-generated files with INT96, stats ar

[jira] [Updated] (PARQUET-1174) Concurrent read micro benchmarks

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1174: --- Fix Version/s: (was: 1.9.1) > Concurrent read micro benchmarks > ---

[jira] [Updated] (PARQUET-796) Delta Encoding is not used when dictionary enabled

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-796: -- Fix Version/s: (was: 1.9.1) > Delta Encoding is not used when dictionary enabled >

[jira] [Updated] (PARQUET-1153) Parquet-thrift doesn't compile with Thrift 0.10.0

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1153: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Parquet-thrift doesn't compile with Thri

[jira] [Resolved] (PARQUET-777) Add new Parquet CLI tools

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue resolved PARQUET-777. --- Resolution: Fixed > Add new Parquet CLI tools > - > > Key: PA

[jira] [Updated] (PARQUET-1152) Parquet-thrift doesn't compile with Thrift 0.9.3

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1152: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Parquet-thrift doesn't compile with Thri

[jira] [Updated] (PARQUET-777) Add new Parquet CLI tools

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-777: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Add new Parquet CLI tools > ---

[jira] [Updated] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1135: --- Fix Version/s: (was: 1.9.1) 1.10.0 > upgrade thrift and protobuf dependencies

[jira] [Updated] (PARQUET-1115) Warn users when misusing parquet-tools merge

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1115: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Warn users when misusing parquet-tools m

[jira] [Updated] (PARQUET-1149) Upgrade Avro dependancy to 1.8.2

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1149: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Upgrade Avro dependancy to 1.8.2 > -

[jira] [Updated] (PARQUET-1141) IDs are dropped in metadata conversion

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1141: --- Fix Version/s: (was: 1.9.1) 1.10.0 > IDs are dropped in metadata conversion >

[jira] [Updated] (PARQUET-1025) Support new min-max statistics in parquet-mr

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1025: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Support new min-max statistics in parque

[jira] [Updated] (PARQUET-1077) [MR] Switch to long key ids in KEYs file

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1077: --- Fix Version/s: (was: 1.9.1) > [MR] Switch to long key ids in KEYs file > ---

[jira] [Updated] (PARQUET-791) Predicate pushing down on missing columns should work on UserDefinedPredicate too

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-791: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Predicate pushing down on missing columns s

[jira] [Updated] (PARQUET-1024) allow for case insensitive parquet-xxx prefix in PR title

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1024: --- Fix Version/s: (was: 1.9.1) 1.10.0 > allow for case insensitive parquet-xxx p

[jira] [Updated] (PARQUET-1005) Fix DumpCommand parsing to allow column projection

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1005: --- Fix Version/s: (was: 1.9.1) 1.10.0 > Fix DumpCommand parsing to allow column

[jira] [Updated] (PARQUET-1026) allow unsigned binary stats when min == max

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-1026: --- Fix Version/s: (was: 1.9.1) 1.10.0 > allow unsigned binary stats when min ==

[jira] [Updated] (PARQUET-801) Allow UserDefinedPredicates in DictionaryFilter

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-801: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Allow UserDefinedPredicates in DictionaryFi

[jira] [Updated] (PARQUET-321) Set the HDFS padding default to 8MB

2018-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated PARQUET-321: -- Fix Version/s: (was: 1.9.1) 1.10.0 > Set the HDFS padding default to 8MB > -

[jira] [Commented] (PARQUET-1251) Clarify ambiguous min/max stats for FLOAT/DOUBLE

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16420942#comment-16420942 ] ASF GitHub Bot commented on PARQUET-1251: - rdblue commented on issue #88: PARQUE

Re: parquet-mr next release with PARQUET-1217?

2018-03-30 Thread Ryan Blue
I have no plan for 1.9.1. On Fri, Mar 30, 2018 at 10:42 AM, Henry Robinson wrote: > Great! Do you know of any plans to do a 1.9.1? > > On 30 March 2018 at 09:35, Ryan Blue wrote: > >> I'm planning on getting a 1.10.0 rc out today, if I don't find problems >> with the stats changes. >> >> On Thu

Re: parquet-mr next release with PARQUET-1217?

2018-03-30 Thread Henry Robinson
Great! Do you know of any plans to do a 1.9.1? On 30 March 2018 at 09:35, Ryan Blue wrote: > I'm planning on getting a 1.10.0 rc out today, if I don't find problems > with the stats changes. > > On Thu, Mar 29, 2018 at 4:18 PM, Henry Robinson wrote: > > > Hi all - > > > > While using Spark, I g

Re: parquet-mr next release with PARQUET-1217?

2018-03-30 Thread Ryan Blue
I'm planning on getting a 1.10.0 rc out today, if I don't find problems with the stats changes. On Thu, Mar 29, 2018 at 4:18 PM, Henry Robinson wrote: > Hi all - > > While using Spark, I got hit by PARQUET-1217 today on some data written by > Impala. This is a pretty nasty bug, and one that affe

[jira] [Commented] (PARQUET-1143) Update Java for format 2.4.0 changes

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16420676#comment-16420676 ] ASF GitHub Bot commented on PARQUET-1143: - rdblue commented on issue #430: PARQU

[jira] [Commented] (PARQUET-1143) Update Java for format 2.4.0 changes

2018-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16420243#comment-16420243 ] ASF GitHub Bot commented on PARQUET-1143: - scottcarey commented on issue #430: P