[jira] [Commented] (PARQUET-1084) Parquet-C++ doesn't selectively read columns with mmap'ed files

2017-12-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16288314#comment-16288314 ] Wes McKinney commented on PARQUET-1084: --- It seems this is related to the use of mm

[jira] [Resolved] (PARQUET-1177) [C++] Add more extensive compiler warnings when using Clang

2017-12-13 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1177. --- Resolution: Fixed Issue resolved by pull request 425 [https://github.com/apache/parquet-cpp/

[jira] [Updated] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2017-12-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1171: -- Fix Version/s: format-2.4.0 > [C++] Clarify valid uses for RLE, BIT_PACKED encodings > ---

[jira] [Assigned] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2017-12-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1171: - Assignee: Wes McKinney > [C++] Clarify valid uses for RLE, BIT_PACKED encodings > -

[jira] [Commented] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2017-12-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16291210#comment-16291210 ] Wes McKinney commented on PARQUET-1171: --- Per discussion here and on the mailing li

[jira] [Updated] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2017-12-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1171: -- Summary: [C++] Clarify valid uses for RLE, BIT_PACKED encodings (was: [C++] Support RLE and B

[jira] [Commented] (PARQUET-1122) [C++] Support 2-level list encoding in Arrow decoding

2017-12-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293959#comment-16293959 ] Wes McKinney commented on PARQUET-1122: --- Moving to 1.4.0 milestone. Could someone

[jira] [Updated] (PARQUET-1122) [C++] Support 2-level list encoding in Arrow decoding

2017-12-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1122: -- Fix Version/s: (was: cpp-1.3.1) cpp-1.4.0 > [C++] Support 2-level list

[jira] [Comment Edited] (PARQUET-1179) boost::shared_ptr has no candidate constructor

2017-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16300372#comment-16300372 ] Wes McKinney edited comment on PARQUET-1179 at 12/21/17 6:00 PM: -

[jira] [Updated] (PARQUET-1179) [C++] Support Apache Thrift 0.11

2017-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1179: -- Summary: [C++] Support Apache Thrift 0.11 (was: boost::shared_ptr has no candidate constructo

[jira] [Commented] (PARQUET-1179) boost::shared_ptr has no candidate constructor

2017-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16300372#comment-16300372 ] Wes McKinney commented on PARQUET-1179: --- Are you using Thrift 0.11? It looks like

[jira] [Updated] (PARQUET-1179) [C++] Support Apache Thrift 0.11

2017-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1179: -- Fix Version/s: cpp-1.4.0 > [C++] Support Apache Thrift 0.11 >

[jira] [Created] (PARQUET-1181) [C++] Devise way to store Arrow logical types not available in the Parquet metadata

2017-12-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1181: - Summary: [C++] Devise way to store Arrow logical types not available in the Parquet metadata Key: PARQUET-1181 URL: https://issues.apache.org/jira/browse/PARQUET-1181

[jira] [Commented] (PARQUET-1179) [C++] Support Apache Thrift 0.11

2018-01-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16308182#comment-16308182 ] Wes McKinney commented on PARQUET-1179: --- Can you show the entire build log when ru

[jira] [Commented] (PARQUET-1182) Parquet-cpp version 1.3.1 not tagged in git repo

2018-01-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16310648#comment-16310648 ] Wes McKinney commented on PARQUET-1182: --- You may need to {{git fetch origin --tags

[jira] [Created] (PARQUET-1186) [C++] Handling Arrow reads that overflow a BinaryArray capacity

2018-01-05 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1186: - Summary: [C++] Handling Arrow reads that overflow a BinaryArray capacity Key: PARQUET-1186 URL: https://issues.apache.org/jira/browse/PARQUET-1186 Project: Parquet

[jira] [Resolved] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2018-01-09 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1171. --- Resolution: Fixed Resolved by PR https://github.com/apache/parquet-format/commit/2696f9e0a9

[jira] [Resolved] (PARQUET-1086) [C++] Remove usage of arrow/util/compiler-util.h after 1.3.0 release

2018-01-15 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1086. --- Resolution: Fixed Issue resolved by pull request 428 [https://github.com/apache/parquet-cpp/

[jira] [Commented] (PARQUET-1084) Parquet-C++ doesn't selectively read columns with mmap'ed files

2018-01-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16327462#comment-16327462 ] Wes McKinney commented on PARQUET-1084: --- {{MAP_PRIVATE}} is the only flag being us

[jira] [Resolved] (PARQUET-1084) Parquet-C++ doesn't selectively read columns with mmap'ed files

2018-01-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1084. --- Resolution: Not A Problem Assignee: Wes McKinney Thanks [~jpivarski]. I agree with you

[jira] [Commented] (PARQUET-1084) Parquet-C++ doesn't selectively read columns with mmap'ed files

2018-01-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16328893#comment-16328893 ] Wes McKinney commented on PARQUET-1084: --- I see. I would say in that file we should

[jira] [Created] (PARQUET-1199) [C++] Support writing (and test reading) boolean values with RLE encoding

2018-01-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1199: - Summary: [C++] Support writing (and test reading) boolean values with RLE encoding Key: PARQUET-1199 URL: https://issues.apache.org/jira/browse/PARQUET-1199 Project

[jira] [Commented] (PARQUET-1199) [C++] Support writing (and test reading) boolean values with RLE encoding

2018-01-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1645#comment-1645 ] Wes McKinney commented on PARQUET-1199: --- We'll have to do some work in parquet-cpp

[jira] [Assigned] (PARQUET-1179) [C++] Support Apache Thrift 0.11

2018-01-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1179: - Assignee: Wes McKinney > [C++] Support Apache Thrift 0.11 > ---

[jira] [Commented] (PARQUET-1179) [C++] Support Apache Thrift 0.11

2018-01-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16335202#comment-16335202 ] Wes McKinney commented on PARQUET-1179: --- PR: [https://github.com/apache/parquet-cp

[jira] [Commented] (PARQUET-1186) [C++] Handling Arrow reads that overflow a BinaryArray capacity

2018-02-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16349785#comment-16349785 ] Wes McKinney commented on PARQUET-1186: --- What version of pyarrow are you using?

[jira] [Created] (PARQUET-1204) [C++] Less verbose logging from thirdparty toolchain

2018-02-07 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1204: - Summary: [C++] Less verbose logging from thirdparty toolchain Key: PARQUET-1204 URL: https://issues.apache.org/jira/browse/PARQUET-1204 Project: Parquet Is

[jira] [Commented] (PARQUET-1122) [C++] Support 2-level list encoding in Arrow decoding

2018-02-08 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16357530#comment-16357530 ] Wes McKinney commented on PARQUET-1122: --- Any chance of uploading a Parquet file or

[jira] [Commented] (PARQUET-1209) locally defined symbol ... imported in function ..

2018-02-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16361478#comment-16361478 ] Wes McKinney commented on PARQUET-1209: --- The Status dtor is inlined from {{arrow/s

[jira] [Commented] (PARQUET-1209) locally defined symbol ... imported in function ..

2018-02-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16361487#comment-16361487 ] Wes McKinney commented on PARQUET-1209: --- SO suggests this might be a static vs dyn

[jira] [Commented] (PARQUET-1209) locally defined symbol ... imported in function ..

2018-02-13 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16362593#comment-16362593 ] Wes McKinney commented on PARQUET-1209: --- OK, so when {{PARQUET_ARROW_LINKAGE}} is

[jira] [Commented] (PARQUET-1209) locally defined symbol ... imported in function ..

2018-02-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16364796#comment-16364796 ] Wes McKinney commented on PARQUET-1209: --- Maybe it's a documentation issue. I'm won

[jira] [Commented] (PARQUET-1122) [C++] Support 2-level list encoding in Arrow decoding

2018-02-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16367763#comment-16367763 ] Wes McKinney commented on PARQUET-1122: --- I am unable to investigate in detail righ

[jira] [Created] (PARQUET-1226) [C++] Fix new build warnings with clang 5.0

2018-02-19 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1226: - Summary: [C++] Fix new build warnings with clang 5.0 Key: PARQUET-1226 URL: https://issues.apache.org/jira/browse/PARQUET-1226 Project: Parquet Issue Type:

[jira] [Updated] (PARQUET-1226) [C++] Fix new build warnings with clang 5.0

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1226: -- Fix Version/s: (was: cpp-1.5.0) cpp-1.4.0 > [C++] Fix new build warning

[jira] [Resolved] (PARQUET-1226) [C++] Fix new build warnings with clang 5.0

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1226. --- Resolution: Fixed Resolved by PR https://github.com/apache/parquet-cpp/commit/c405bf36506ec

[jira] [Assigned] (PARQUET-1221) [C++] Extend release README

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1221: - Assignee: Uwe L. Korn > [C++] Extend release README > --- > >

[jira] [Resolved] (PARQUET-1221) [C++] Extend release README

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1221. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 441 [https:

[jira] [Resolved] (PARQUET-1220) [C++] Don't build Thrift examples and tutorials in the ExternalProject

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1220. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 440 [https:

[jira] [Resolved] (PARQUET-1219) [C++] Update release-candidate script links to gitbox

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1219. --- Resolution: Fixed Fix Version/s: cpp-1.4.0 Issue resolved by pull request 439 [https:

[jira] [Resolved] (PARQUET-1218) [C++] More informative error message on too short pages

2018-02-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1218. --- Resolution: Fixed Resolved by PR https://github.com/apache/parquet-cpp/commit/741012f64ee66

[jira] [Resolved] (PARQUET-1233) [CPP ]Enable option to switch between stl classes and boost classes for thrift header

2018-02-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1233. --- Resolution: Fixed Issue resolved by pull request 443 [https://github.com/apache/parquet-cpp/

[jira] [Updated] (PARQUET-1239) [C++] Arrow table reads error when overflowing capacity of BinaryArray

2018-03-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1239: -- Summary: [C++] Arrow table reads error when overflowing capacity of BinaryArray (was: [Python

[jira] [Moved] (PARQUET-1239) [Python] ParquetFile.read does not accommodate large binary data

2018-03-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-2242 to PARQUET-1239: -- Fix Version/s: (was: 0.9.0) cpp-1.5.0

[jira] [Commented] (PARQUET-1239) [C++] Arrow table reads error when overflowing capacity of BinaryArray

2018-03-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16382757#comment-16382757 ] Wes McKinney commented on PARQUET-1239: --- I moved this issue to Apache Parquet beca

[jira] [Updated] (PARQUET-1243) [C++] Improve quality of error message for zero-length files, otherwise corrupted files

2018-03-11 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1243: -- Description: Currently, the error looks like, even for zero-size files: {code} if (file_s

[jira] [Created] (PARQUET-1243) [C++] Improve quality of error message for zero-length files, otherwise corrupted files

2018-03-11 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1243: - Summary: [C++] Improve quality of error message for zero-length files, otherwise corrupted files Key: PARQUET-1243 URL: https://issues.apache.org/jira/browse/PARQUET-1243

[jira] [Assigned] (PARQUET-1245) [Python] Segfault when writing Arrow table with duplicate columns

2018-03-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1245: - Assignee: (was: Antoine Pitrou) Fix Version/s: (was: 0.9.0)

[jira] [Updated] (PARQUET-1245) [C++] Segfault when writing Arrow table with duplicate columns

2018-03-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1245: -- Summary: [C++] Segfault when writing Arrow table with duplicate columns (was: [Python] Segfau

[jira] [Assigned] (PARQUET-1245) [C++] Segfault when writing Arrow table with duplicate columns

2018-03-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1245: - Assignee: Antoine Pitrou > [C++] Segfault when writing Arrow table with duplicate colum

[jira] [Resolved] (PARQUET-1245) [C++] Segfault when writing Arrow table with duplicate columns

2018-03-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1245. --- Resolution: Fixed Issue resolved by pull request 447 [https://github.com/apache/parquet-cpp/

[jira] [Commented] (PARQUET-1249) Clarify encoding schemes for boolean types

2018-03-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16411940#comment-16411940 ] Wes McKinney commented on PARQUET-1249: --- Added you > Clarify encoding schemes for

[jira] [Resolved] (PARQUET-1166) [API Proposal] Add GetRecordBatchReader in parquet/arrow/reader.h

2018-03-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1166. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 445 [https:

[jira] [Assigned] (PARQUET-1166) [API Proposal] Add GetRecordBatchReader in parquet/arrow/reader.h

2018-03-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1166: - Assignee: Xianjin YE > [API Proposal] Add GetRecordBatchReader in parquet/arrow/reader.

[jira] [Created] (PARQUET-1306) [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems

2018-05-21 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1306: - Summary: [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems Key: PARQUET-1306 URL: https://issues.apache.org/jira/browse/PARQUET-1306

[jira] [Updated] (PARQUET-1306) [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems

2018-05-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1306: -- Description: I would like to see if it's possible to modularize the build system in Apache Ar

[jira] [Created] (PARQUET-1323) [C++] Fix compiler warnings with clang-6.0

2018-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1323: - Summary: [C++] Fix compiler warnings with clang-6.0 Key: PARQUET-1323 URL: https://issues.apache.org/jira/browse/PARQUET-1323 Project: Parquet Issue Type:

[jira] [Commented] (PARQUET-1324) [C++] Read Parquet dictionary encoded ColumnChunks directly into an Arrow DictionaryArray

2018-06-13 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16511551#comment-16511551 ] Wes McKinney commented on PARQUET-1324: --- This has been discussed many other place

[jira] [Created] (PARQUET-1345) [C++] It is possible to overflow a TMemoryBuffer when serializing the file metadata

2018-07-09 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1345: - Summary: [C++] It is possible to overflow a TMemoryBuffer when serializing the file metadata Key: PARQUET-1345 URL: https://issues.apache.org/jira/browse/PARQUET-1345

[jira] [Resolved] (PARQUET-1349) [C++] PARQUET_RPATH_ORIGIN is not picked by the build

2018-07-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1349. --- Resolution: Fixed Issue resolved by pull request 476 [https://github.com/apache/parquet-cpp

[jira] [Resolved] (PARQUET-1350) [C++] Use abstract ResizableBuffer instead of concrete PoolBuffer

2018-07-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1350. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 477 [https

[jira] [Assigned] (PARQUET-1323) [C++] Fix compiler warnings with clang-6.0

2018-07-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1323: - Assignee: Wes McKinney > [C++] Fix compiler warnings with clang-6.0 >

[jira] [Resolved] (PARQUET-1323) [C++] Fix compiler warnings with clang-6.0

2018-07-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1323. --- Resolution: Fixed Issue resolved by pull request 478 [https://github.com/apache/parquet-cpp

[jira] [Created] (PARQUET-1354) [C++] Fix deprecated Arrow builder API usages

2018-07-23 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1354: - Summary: [C++] Fix deprecated Arrow builder API usages Key: PARQUET-1354 URL: https://issues.apache.org/jira/browse/PARQUET-1354 Project: Parquet Issue Typ

[jira] [Assigned] (PARQUET-1354) [C++] Fix deprecated Arrow builder API usages

2018-07-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1354: - Assignee: Wes McKinney > [C++] Fix deprecated Arrow builder API usages > -

[jira] [Created] (PARQUET-1360) [C++] Minor API + style changes follow up to PARQUET-1348

2018-07-28 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1360: - Summary: [C++] Minor API + style changes follow up to PARQUET-1348 Key: PARQUET-1360 URL: https://issues.apache.org/jira/browse/PARQUET-1360 Project: Parquet

[jira] [Updated] (PARQUET-1360) [C++] Minor API + style changes follow up to PARQUET-1348

2018-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1360: -- Description: see comments in https://github.com/apache/parquet-cpp/pull/481 > [C++] Minor API

[jira] [Assigned] (PARQUET-1360) [C++] Minor API + style changes follow up to PARQUET-1348

2018-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1360: - Assignee: Wes McKinney > [C++] Minor API + style changes follow up to PARQUET-1348 > -

[jira] [Created] (PARQUET-1362) [C++] Validate or add option to validate arrow::Table schema in parquet::arrow::FileWriter::WriteTable

2018-07-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1362: - Summary: [C++] Validate or add option to validate arrow::Table schema in parquet::arrow::FileWriter::WriteTable Key: PARQUET-1362 URL: https://issues.apache.org/jira/browse/PARQ

[jira] [Resolved] (PARQUET-1360) [C++] Minor API + style changes follow up to PARQUET-1348

2018-07-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1360. --- Resolution: Fixed Resolved by https://github.com/apache/parquet-cpp/commit/853abb96e95bd44

[jira] [Commented] (PARQUET-1361) [C++] 1.4.1 library allows creation of parquet file w/NULL values for INT types

2018-07-31 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16564043#comment-16564043 ] Wes McKinney commented on PARQUET-1361: --- The problem is that the user's data has

[jira] [Resolved] (PARQUET-1366) [C++] Streamline use of Arrow bit-util.h

2018-08-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1366. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 483 [https

[jira] [Updated] (PARQUET-1370) [C++] Read consecutive column chunks in a single scan

2018-08-04 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1370: -- Summary: [C++] Read consecutive column chunks in a single scan (was: Read consecutive column

[jira] [Commented] (PARQUET-1370) Read consecutive column chunks in a single scan

2018-08-04 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16569210#comment-16569210 ] Wes McKinney commented on PARQUET-1370: --- I have opened some issues related to buf

[jira] [Commented] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2018-08-08 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16573720#comment-16573720 ] Wes McKinney commented on PARQUET-1352: --- Either you can contribute to the nested

[jira] [Commented] (PARQUET-1241) Use LZ4 frame format

2018-08-14 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16579852#comment-16579852 ] Wes McKinney commented on PARQUET-1241: --- [~ee07b291] would you be able to contrib

[jira] [Resolved] (PARQUET-1378) [c++] Allow RowGroups with zero rows to be written

2018-08-15 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1378. --- Resolution: Fixed Fix Version/s: (was: 1.5.0) cpp-1.5.0 Issue

[jira] [Updated] (PARQUET-1380) move Bloom filter test binary to parquet-testing repo

2018-08-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1380: -- Fix Version/s: cpp-1.5.0 > move Bloom filter test binary to parquet-testing repo > --

[jira] [Created] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-16 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1384: - Summary: [C++] Clang compiler warnings in bloom_filter-test.cc Key: PARQUET-1384 URL: https://issues.apache.org/jira/browse/PARQUET-1384 Project: Parquet I

[jira] [Updated] (PARQUET-1384) [C++] Clang compiler warnings in bloom_filter-test.cc

2018-08-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1384: -- Description: {code} [69/95] Building CXX object src/parquet/CMakeFiles/bloom_filter-test.dir

[jira] [Updated] (PARQUET-1380) [C++] move Bloom filter test binary to parquet-testing repo

2018-08-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1380: -- Summary: [C++] move Bloom filter test binary to parquet-testing repo (was: move Bloom filter

[jira] [Created] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-16 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1385: - Summary: [C++] bloom_filter-test is very slow under valgrind Key: PARQUET-1385 URL: https://issues.apache.org/jira/browse/PARQUET-1385 Project: Parquet Iss

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2018-08-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: Wes McKinney > [C++] bloom_filter-test is very slow under valgrind > ---

[jira] [Resolved] (PARQUET-1308) [C++] parquet::arrow should use thread pool, not ParallelFor

2018-08-16 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1308. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 467 [https

[jira] [Resolved] (PARQUET-1382) [C++] Prepare for arrow::test namespace removal

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1382. --- Resolution: Fixed Fix Version/s: cpp-1.5.0 Issue resolved by pull request 487 [https

[jira] [Resolved] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1256. --- Resolution: Fixed Issue resolved by pull request 450 [https://github.com/apache/parquet-cpp

[jira] [Assigned] (PARQUET-1256) [C++] Add --print-key-value-metadata option to parquet_reader tool

2018-08-17 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1256: - Assignee: Jacek Pliszka > [C++] Add --print-key-value-metadata option to parquet_reade

[jira] [Updated] (PARQUET-1390) Upgrade to Arrow 0.10.0

2018-08-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1390: -- Component/s: parquet-mr > Upgrade to Arrow 0.10.0 > --- > >

[jira] [Updated] (PARQUET-1390) [Java] Upgrade to Arrow 0.10.0

2018-08-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1390: -- Summary: [Java] Upgrade to Arrow 0.10.0 (was: Upgrade to Arrow 0.10.0) > [Java] Upgrade to A

[jira] [Commented] (PARQUET-1160) [C++] Implement BYTE_ARRAY-backed Decimal reads

2018-08-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16584937#comment-16584937 ] Wes McKinney commented on PARQUET-1160: --- Can you submit a pull request? > [C++]

[jira] [Commented] (PARQUET-1394) [C++] Linking errors with boost_regex

2018-08-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16586053#comment-16586053 ] Wes McKinney commented on PARQUET-1394: --- This may be related to the Anaconda comp

[jira] [Commented] (PARQUET-1241) Use LZ4 frame format

2018-08-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16587678#comment-16587678 ] Wes McKinney commented on PARQUET-1241: --- Please either add a new codec or add an

[jira] [Updated] (PARQUET-1403) [C++] Coerce Arrow half-precision float to float32

2018-08-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1403: -- Summary: [C++] Coerce Arrow half-precision float to float32 (was: Can't save a df using Parq

[jira] [Commented] (PARQUET-1403) [C++] Coerce Arrow half-precision float to float32

2018-08-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16590290#comment-16590290 ] Wes McKinney commented on PARQUET-1403: --- Agreed, updated the issue title > [C++]

[jira] [Updated] (PARQUET-1370) [C++] Read consecutive column chunks in a single scan

2018-08-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1370: -- Fix Version/s: cpp-1.6.0 > [C++] Read consecutive column chunks in a single scan > --

[jira] [Commented] (PARQUET-1370) [C++] Read consecutive column chunks in a single scan

2018-08-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16592961#comment-16592961 ] Wes McKinney commented on PARQUET-1370: --- That would be a question for [~pitrou] o

[jira] [Updated] (PARQUET-1405) [C++] 'Couldn't deserialize thrift' error when reading large binary column

2018-08-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1405: -- Fix Version/s: cpp-1.6.0 > [C++] 'Couldn't deserialize thrift' error when reading large binar

[jira] [Moved] (PARQUET-1405) [C++] 'Couldn't deserialize thrift' error when reading large binary column

2018-08-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-3138 to PARQUET-1405: -- Fix Version/s: (was: 0.12.0) Affects Version/s: (was:

[jira] [Commented] (PARQUET-1405) [C++] 'Couldn't deserialize thrift' error when reading large binary column

2018-08-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16595670#comment-16595670 ] Wes McKinney commented on PARQUET-1405: --- Moved this to Parquet > [C++] 'Couldn't

[jira] [Commented] (PARQUET-1406) unit test fails on some cases

2018-08-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16596768#comment-16596768 ] Wes McKinney commented on PARQUET-1406: --- Can you post the output of {{ctest -VV}}

<    7   8   9   10   11   12   13   14   15   16   >