[jira] [Commented] (PARQUET-284) Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592964#comment-14592964 ] Alex Levenson commented on PARQUET-284: --- welp, I'm wrong: https://gist.github.com/r

[jira] [Commented] (PARQUET-284) Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592963#comment-14592963 ] Alex Levenson commented on PARQUET-284: --- [~yanzt], I've sent a PR to clean up this

[jira] [Commented] (PARQUET-284) Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592925#comment-14592925 ] Alex Levenson commented on PARQUET-284: --- Update: the cache was actually never being

[jira] [Created] (PARQUET-310) ParquetMetadataConverter.java is too long

2015-06-18 Thread Alex Levenson (JIRA)
Alex Levenson created PARQUET-310: - Summary: ParquetMetadataConverter.java is too long Key: PARQUET-310 URL: https://issues.apache.org/jira/browse/PARQUET-310 Project: Parquet Issue Type: Bug

[jira] [Commented] (PARQUET-284) Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592873#comment-14592873 ] Alex Levenson commented on PARQUET-284: --- How important is it to cache this state? S

[jira] [Assigned] (PARQUET-284) Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Levenson reassigned PARQUET-284: - Assignee: Alex Levenson > Should use ConcurrentHashMap instead of HashMap in ParquetMeta

[jira] [Assigned] (PARQUET-201) Column with OriginalType INT_8 failed at filtering

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Levenson reassigned PARQUET-201: - Assignee: Alex Levenson > Column with OriginalType INT_8 failed at filtering > -

RE: Cannot assign Parquet issues to myself

2015-06-18 Thread Xu, Cheng A
Hi Ryan, Could you help me with the permission issue as well? My username is Ferd. Thank you! -Original Message- From: Nezih Yigitbasi [mailto:nyigitb...@netflix.com.INVALID] Sent: Friday, June 19, 2015 4:41 AM To: dev@parquet.apache.org Subject: Re: Cannot assign Parquet issues to myself

[jira] [Commented] (PARQUET-297) created_by in file meta data doesn't contain parquet library version

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592763#comment-14592763 ] Alex Levenson commented on PARQUET-297: --- tests: https://github.com/apache/parquet-m

[jira] [Resolved] (PARQUET-297) created_by in file meta data doesn't contain parquet library version

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Levenson resolved PARQUET-297. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 213 [https://git

[jira] [Assigned] (PARQUET-303) [Vectorized Reader] Add Testcases/Benchmarks for ParquetVectorizedReader

2015-06-18 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nezih Yigitbasi reassigned PARQUET-303: --- Assignee: Nezih Yigitbasi > [Vectorized Reader] Add Testcases/Benchmarks for Parquet

[jira] [Assigned] (PARQUET-299) [Vectorized Reader] ColumnVector length should be in terms of rows, not DataPages

2015-06-18 Thread Zhenxiao Luo (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenxiao Luo reassigned PARQUET-299: Assignee: Nezih Yigitbasi > [Vectorized Reader] ColumnVector length should be in terms of

[jira] [Assigned] (PARQUET-298) [Vectorized Reader] Support Complex Types (Map, Array, Struct) in Parquet Vectorized Reader

2015-06-18 Thread Zhenxiao Luo (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenxiao Luo reassigned PARQUET-298: Assignee: Nezih Yigitbasi > [Vectorized Reader] Support Complex Types (Map, Array, Struct)

Re: Cannot assign Parquet issues to myself

2015-06-18 Thread Nezih Yigitbasi
Thanks Ryan. On Thu, Jun 18, 2015 at 1:31 PM, Ryan Blue wrote: > On 06/18/2015 01:28 PM, Nezih Yigitbasi wrote: > >> Hi, >> As part of our vectorization work I want to assign some Parquet jira tasks >> to me but I guess I don't have permissions for that. Can someone help >> giving me permissions

[jira] [Commented] (PARQUET-152) Encoding issue with fixed length byte arrays

2015-06-18 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592474#comment-14592474 ] Ryan Blue commented on PARQUET-152: --- I think the RLE_DICTIONARY behavior is probably be

[jira] [Assigned] (PARQUET-300) [Vectorized Reader] Make sure all encodings work in Parquet Vectorized Reader

2015-06-18 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nezih Yigitbasi reassigned PARQUET-300: --- Assignee: Nezih Yigitbasi > [Vectorized Reader] Make sure all encodings work in Parq

Re: problem reading parquet file

2015-06-18 Thread Sergio Pena
The second bug is on https://issues.apache.org/jira/browse/PARQUET-152 The problem is that the dictionary page size is less than the fixed byte array. Just make it equals, and you will be able to read that file. - Sergio On Thu, Jun 18, 2015 at 3:36 PM, Nezih Yigitbasi < nyigitb...@netflix.com.i

Re: problem reading parquet file

2015-06-18 Thread Nezih Yigitbasi
Yep I will, seemed like a bug to me too. Thanks, Nezih On Thu, Jun 18, 2015 at 1:33 PM, Ryan Blue wrote: > The first issue looks like the delta byte array problem: > > https://issues.apache.org/jira/browse/PARQUET-246 > > The second one looks like the write side uses delta_byte_array for fixe

Re: problem reading parquet file

2015-06-18 Thread Ryan Blue
The first issue looks like the delta byte array problem: https://issues.apache.org/jira/browse/PARQUET-246 The second one looks like the write side uses delta_byte_array for fixed, but the read side doesn't expect it. File a bug? rb On 06/18/2015 12:50 PM, Nezih Yigitbasi wrote: Hi all,

Re: Cannot assign Parquet issues to myself

2015-06-18 Thread Ryan Blue
On 06/18/2015 01:28 PM, Nezih Yigitbasi wrote: Hi, As part of our vectorization work I want to assign some Parquet jira tasks to me but I guess I don't have permissions for that. Can someone help giving me permissions for that? Thanks, Nezih Fixed. I added both of your JIRA accounts as a cont

Cannot assign Parquet issues to myself

2015-06-18 Thread Nezih Yigitbasi
Hi, As part of our vectorization work I want to assign some Parquet jira tasks to me but I guess I don't have permissions for that. Can someone help giving me permissions for that? Thanks, Nezih

problem reading parquet file

2015-06-18 Thread Nezih Yigitbasi
Hi all, I have generated some test data using the method here . What I notice is if I use WriterVersion.PARQUET_2_0, the default block and page sizes, and GZIP co

[jira] [Resolved] (PARQUET-264) Update README docs for graduation

2015-06-18 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Levenson resolved PARQUET-264. --- Resolution: Fixed Fix Version/s: (was: 1.8.0) 2.0.0 Issue resol

[jira] [Commented] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-06-18 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592162#comment-14592162 ] Michael Allman commented on PARQUET-246: [~spena] I tried your patch and was able

[jira] [Commented] (PARQUET-300) [Vectorized Reader] Make sure all encodings work in Parquet Vectorized Reader

2015-06-18 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592121#comment-14592121 ] Nezih Yigitbasi commented on PARQUET-300: - My latest PR already addresses this is

[jira] [Commented] (PARQUET-299) [Vectorized Reader] ColumnVector length should be in terms of rows, not DataPages

2015-06-18 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592109#comment-14592109 ] Nezih Yigitbasi commented on PARQUET-299: - My latest changes include fix for this

[jira] [Commented] (PARQUET-298) [Vectorized Reader] Support Complex Types (Map, Array, Struct) in Parquet Vectorized Reader

2015-06-18 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592106#comment-14592106 ] Nezih Yigitbasi commented on PARQUET-298: - Added support for complex types with

[jira] [Commented] (PARQUET-246) ArrayIndexOutOfBoundsException with Parquet write version v2

2015-06-18 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14591390#comment-14591390 ] Michael Allman commented on PARQUET-246: [~spena] Thanks for helping with this. I