[jira] [Created] (PARQUET-479) Add regression tests to the build process

2016-01-29 Thread Aliaksei Sandryhaila (JIRA)
Aliaksei Sandryhaila created PARQUET-479: Summary: Add regression tests to the build process Key: PARQUET-479 URL: https://issues.apache.org/jira/browse/PARQUET-479 Project: Parquet

[jira] [Commented] (PARQUET-479) Add regression tests to the build process

2016-01-29 Thread Aliaksei Sandryhaila (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15123570#comment-15123570 ] Aliaksei Sandryhaila commented on PARQUET-479: -- In our case, regression testing will consist

[jira] [Commented] (PARQUET-479) Add regression tests to the build process

2016-01-29 Thread Aliaksei Sandryhaila (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15123583#comment-15123583 ] Aliaksei Sandryhaila commented on PARQUET-479: -- Ah, I missed that you've already added it in

[jira] [Created] (PARQUET-481) Refactor and expand reader-test

2016-01-29 Thread Aliaksei Sandryhaila (JIRA)
Aliaksei Sandryhaila created PARQUET-481: Summary: Refactor and expand reader-test Key: PARQUET-481 URL: https://issues.apache.org/jira/browse/PARQUET-481 Project: Parquet Issue

[jira] [Updated] (PARQUET-472) Clean up InputStream ownership semantics in ColumnReader

2016-01-29 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Le Dem updated PARQUET-472: -- Fix Version/s: (was: format-2.4.0) > Clean up InputStream ownership semantics in

[jira] [Created] (PARQUET-482) Organize src code file structure to have a very clear folder with public headers.

2016-01-29 Thread Nong Li (JIRA)
Nong Li created PARQUET-482: --- Summary: Organize src code file structure to have a very clear folder with public headers. Key: PARQUET-482 URL: https://issues.apache.org/jira/browse/PARQUET-482 Project:

[jira] [Updated] (PARQUET-482) Organize src code file structure to have a very clear folder with public headers.

2016-01-29 Thread Nong Li (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nong Li updated PARQUET-482: Component/s: parquet-cpp > Organize src code file structure to have a very clear folder with public >

Organizing functional components and a bottom-up testing plan for parquet-cpp

2016-01-29 Thread Wes McKinney
hi folks, Since there's so many moving pieces with creating a full-featured Parquet reader-writer, I propose we start planning out a plan to create test fixtures and tools to enable us to develop faster. Specifically, we need to achieve maximum decoupling between functional components. Every

[jira] [Updated] (PARQUET-472) Clean up InputStream ownership semantics in ColumnReader

2016-01-29 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Le Dem updated PARQUET-472: -- Fix Version/s: cpp-0.1 > Clean up InputStream ownership semantics in ColumnReader >

[jira] [Commented] (PARQUET-481) Refactor and expand reader-test

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15123827#comment-15123827 ] Wes McKinney commented on PARQUET-481: -- I feel very strongly about keeping the low-level unit tests

[jira] [Updated] (PARQUET-483) Write tests investigating failure modes with malformed encoded levels in data pages

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-483: - Summary: Write tests investigating failure modes with malformed encoded levels in data pages

[jira] [Commented] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Deepak Majeti (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124538#comment-15124538 ] Deepak Majeti commented on PARQUET-438: --- [~wesmckinn] I misunderstood the parquet-mr

[jira] [Created] (PARQUET-483) Write tests investigate failure modes with malformed encoded levels in data pages

2016-01-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-483: Summary: Write tests investigate failure modes with malformed encoded levels in data pages Key: PARQUET-483 URL: https://issues.apache.org/jira/browse/PARQUET-483

[jira] [Resolved] (PARQUET-450) Small typos/issues in parquet-format documentation

2016-01-29 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Le Dem resolved PARQUET-450. --- Resolution: Fixed Fix Version/s: format-2.4.0 Issue resolved by pull request 36

Re: Organizing functional components and a bottom-up testing plan for parquet-cpp

2016-01-29 Thread Julien Le Dem
Sounds good to me. at some point (later) we'll have to do some cross compatibility testing with parquet-mr as well to make sure everything is on the same page. CC'ing some folks who should probably chime in. On Fri, Jan 29, 2016 at 10:21 AM, Wes McKinney wrote: > hi folks, >

[jira] [Updated] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-438: - Summary: Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

[jira] [Commented] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124123#comment-15124123 ] Wes McKinney commented on PARQUET-438: -- See https://github.com/apache/parquet-cpp/pull/31 I'm

[jira] [Updated] (PARQUET-462) Implement a LevelDecoder class (like Impala) which dispatches to RLE or BIT_PACKED decoding as appropriate

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-462: - Summary: Implement a LevelDecoder class (like Impala) which dispatches to RLE or BIT_PACKED

[jira] [Resolved] (PARQUET-432) Complete a todo for method ColumnDescriptor.compareTo()

2016-01-29 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved PARQUET-432. Resolution: Fixed Issue resolved by pull request 314

[jira] [Commented] (PARQUET-467) Check for and raise error for deprecated BIT_PACKED encoding

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124182#comment-15124182 ] Wes McKinney commented on PARQUET-467: -- Per PARQUET-462 we can go ahead and implement this level

[jira] [Commented] (PARQUET-438) Update RLE encoder/decoder modules from Impala upstream changes and adapt unit tests

2016-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124173#comment-15124173 ] Wes McKinney commented on PARQUET-438: -- [~mdeepak] If you identify a specific problem with the