[jira] [Assigned] (PARQUET-1463) [C++] Utilize revamped common hashing machinery for dictionary encoding

2018-11-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1463: - Assignee: Antoine Pitrou > [C++] Utilize revamped common hashing machinery for diction

[jira] [Commented] (PARQUET-1463) [C++] Utilize revamped common hashing machinery for dictionary encoding

2018-11-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16704070#comment-16704070 ] Wes McKinney commented on PARQUET-1463: --- Using the following setup {code} import

[jira] [Created] (PARQUET-1467) [C++] Remove ChunkedAllocator code, now unused

2018-12-01 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1467: - Summary: [C++] Remove ChunkedAllocator code, now unused Key: PARQUET-1467 URL: https://issues.apache.org/jira/browse/PARQUET-1467 Project: Parquet Issue Ty

[jira] [Assigned] (PARQUET-1467) [C++] Remove ChunkedAllocator code, now unused

2018-12-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1467: - Assignee: Wes McKinney > [C++] Remove ChunkedAllocator code, now unused >

[jira] [Resolved] (PARQUET-1467) [C++] Remove ChunkedAllocator code, now unused

2018-12-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1467. --- Resolution: Fixed Issue resolved by pull request 3069 [https://github.com/apache/arrow/pull

[jira] [Created] (PARQUET-1468) [C++] Consolidate RecordReader, ColumnReader code paths

2018-12-03 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1468: - Summary: [C++] Consolidate RecordReader, ColumnReader code paths Key: PARQUET-1468 URL: https://issues.apache.org/jira/browse/PARQUET-1468 Project: Parquet

[jira] [Created] (PARQUET-1469) [C++] DefinitionLevelsToBitmap can overwrite prior decoded data

2018-12-03 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1469: - Summary: [C++] DefinitionLevelsToBitmap can overwrite prior decoded data Key: PARQUET-1469 URL: https://issues.apache.org/jira/browse/PARQUET-1469 Project: Parquet

[jira] [Commented] (PARQUET-1469) [C++] DefinitionLevelsToBitmap can overwrite prior decoded data

2018-12-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16707983#comment-16707983 ] Wes McKinney commented on PARQUET-1469: --- This is only likely to impact people rea

[jira] [Resolved] (PARQUET-1469) [C++] DefinitionLevelsToBitmap can overwrite prior decoded data

2018-12-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1469. --- Resolution: Fixed Issue resolved by pull request 3080 [https://github.com/apache/arrow/pull

[jira] [Created] (PARQUET-1471) [C++] Out of bounds access in statistics UpdateSpaced when writing optional list with null list slots

2018-12-05 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1471: - Summary: [C++] Out of bounds access in statistics UpdateSpaced when writing optional list with null list slots Key: PARQUET-1471 URL: https://issues.apache.org/jira/browse/PARQU

[jira] [Commented] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2018-12-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16711838#comment-16711838 ] Wes McKinney commented on PARQUET-1352: --- The issue is still open. Not going to ge

[jira] [Comment Edited] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2018-12-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16711838#comment-16711838 ] Wes McKinney edited comment on PARQUET-1352 at 12/6/18 6:36 PM: -

[jira] [Resolved] (PARQUET-1471) [C++] Out of bounds access in statistics UpdateSpaced when writing optional list with null list slots

2018-12-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1471. --- Resolution: Fixed Issue resolved by pull request 3114 [https://github.com/apache/arrow/pull

[jira] [Commented] (PARQUET-1470) Inputstream leakage in ParquetFileWriter.appendFile

2018-12-11 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16717363#comment-16717363 ] Wes McKinney commented on PARQUET-1470: --- [~ArnaudL] we all propose changes to the

[jira] [Resolved] (PARQUET-1473) [C++] Add helper function that converts ParquetVersion to human-friendly string

2018-12-13 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1473. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 3148 [http

[jira] [Updated] (PARQUET-1479) [Java] Arrow read write support

2018-12-15 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1479: -- Component/s: parquet-mr > [Java] Arrow read write support > --- >

[jira] [Assigned] (PARQUET-1481) [C++] SEGV when reading corrupt parquet file

2018-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1481: - Assignee: Wes McKinney (was: Hatem Helal) > [C++] SEGV when reading corrupt parquet f

[jira] [Commented] (PARQUET-1481) [C++] SEGV when reading corrupt parquet file

2018-12-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16726806#comment-16726806 ] Wes McKinney commented on PARQUET-1481: --- The Thrift metadata is corrupt, but it i

[jira] [Updated] (PARQUET-1482) [C++] Unable to read data from parquet file generated with parquetjs

2018-12-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1482: -- Component/s: parquet-cpp > [C++] Unable to read data from parquet file generated with parquet

[jira] [Updated] (PARQUET-1481) [C++] SEGV when reading corrupt parquet file

2018-12-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1481: -- Affects Version/s: cpp-1.6.0 > [C++] SEGV when reading corrupt parquet file > ---

[jira] [Resolved] (PARQUET-1481) [C++] SEGV when reading corrupt parquet file

2018-12-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1481. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Resolved in https://github.com/apache/ar

[jira] [Updated] (PARQUET-474) [C++] InputStream and RandomAccessSource classes are not threadsafe

2018-12-27 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-474: - Summary: [C++] InputStream and RandomAccessSource classes are not threadsafe (was: InputStream

[jira] [Commented] (PARQUET-474) [C++] InputStream and RandomAccessSource classes are not threadsafe

2018-12-27 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16729683#comment-16729683 ] Wes McKinney commented on PARQUET-474: -- This will be resolved by ARROW-1422 (the bu

[jira] [Resolved] (PARQUET-1484) [C++] Improve memory usage of FileMetaDataBuilder

2018-12-31 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1484. --- Resolution: Fixed Issue resolved by pull request 3277 [https://github.com/apache/arrow/pull

[jira] [Updated] (PARQUET-1482) [C++] Unable to read data from parquet file generated with parquetjs

2019-01-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1482: -- Fix Version/s: cpp-1.6.0 > [C++] Unable to read data from parquet file generated with parquet

[jira] [Commented] (PARQUET-1482) [C++] Unable to read data from parquet file generated with parquetjs

2019-01-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16733266#comment-16733266 ] Wes McKinney commented on PARQUET-1482: --- Done > [C++] Unable to read data from p

[jira] [Assigned] (PARQUET-1482) [C++] Unable to read data from parquet file generated with parquetjs

2019-01-03 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1482: - Assignee: Rylan Dmello (was: Hatem Helal) > [C++] Unable to read data from parquet fi

[jira] [Resolved] (PARQUET-690) [C++] Investigate / improve performance of Thrift utilities

2019-01-04 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-690. -- Resolution: Fixed Issue resolved by pull request 3268 [https://github.com/apache/arrow/pull/32

[jira] [Moved] (PARQUET-1494) [C++] Can't access parquet statistics on binary columns

2019-01-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-4293 to PARQUET-1494: -- Workflow: patch-available, re-open possible (was: jira) Key

[jira] [Updated] (PARQUET-1494) [C++] Can't access parquet statistics on binary columns

2019-01-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1494: -- Component/s: parquet-avro > [C++] Can't access parquet statistics on binary columns > ---

[jira] [Updated] (PARQUET-1494) [C++] Can't access parquet statistics on binary columns

2019-01-18 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1494: -- Component/s: (was: parquet-avro) parquet-cpp > [C++] Can't access parque

[jira] [Resolved] (PARQUET-1494) [C++] Can't access parquet statistics on binary columns

2019-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1494. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 3441 [http

[jira] [Assigned] (PARQUET-1494) [C++] Can't access parquet statistics on binary columns

2019-01-23 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1494: - Assignee: Ildar > [C++] Can't access parquet statistics on binary columns > --

[jira] [Created] (PARQUET-1508) [C++] Enable reading from ByteArray and FixedLenByteArray decoders directly into arrow::BinaryBuilder or arrow::BinaryDictionaryBuilder

2019-01-24 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1508: - Summary: [C++] Enable reading from ByteArray and FixedLenByteArray decoders directly into arrow::BinaryBuilder or arrow::BinaryDictionaryBuilder Key: PARQUET-1508 URL: https://i

[jira] [Resolved] (PARQUET-1508) [C++] Enable reading from ByteArray and FixedLenByteArray decoders directly into arrow::BinaryBuilder or arrow::BinaryDictionaryBuilder

2019-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1508. --- Resolution: Fixed Issue resolved by pull request 3492 [https://github.com/apache/arrow/pull

[jira] [Created] (PARQUET-1519) [C++] Remove use of "extern template class" from parquet/column_reader.h

2019-01-29 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1519: - Summary: [C++] Remove use of "extern template class" from parquet/column_reader.h Key: PARQUET-1519 URL: https://issues.apache.org/jira/browse/PARQUET-1519 Project:

[jira] [Assigned] (PARQUET-1519) [C++] Remove use of "extern template class" from parquet/column_reader.h

2019-01-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1519: - Assignee: Wes McKinney > [C++] Remove use of "extern template class" from parquet/colu

[jira] [Resolved] (PARQUET-1519) [C++] Remove use of "extern template class" from parquet/column_reader.h

2019-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1519. --- Resolution: Fixed Issue resolved by pull request 3525 [https://github.com/apache/arrow/pull

[jira] [Commented] (PARQUET-1519) [C++] Remove use of "extern template class" from parquet/column_reader.h

2019-01-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756242#comment-16756242 ] Wes McKinney commented on PARQUET-1519: --- See macros in https://github.com/apache

[jira] [Created] (PARQUET-1521) [C++] Do not use "extern template class" with parquet::ColumnWriter

2019-02-04 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1521: - Summary: [C++] Do not use "extern template class" with parquet::ColumnWriter Key: PARQUET-1521 URL: https://issues.apache.org/jira/browse/PARQUET-1521 Project: Parq

[jira] [Created] (PARQUET-1523) [C++] Vectorize comparator interface

2019-02-04 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1523: - Summary: [C++] Vectorize comparator interface Key: PARQUET-1523 URL: https://issues.apache.org/jira/browse/PARQUET-1523 Project: Parquet Issue Type: Improv

[jira] [Moved] (PARQUET-1525) [C++] remove dependency on getopt in parquet tools

2019-02-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-4456 to PARQUET-1525: -- Fix Version/s: (was: 0.13.0) cpp-1.6.0 C

[jira] [Deleted] (PARQUET-1524) [C++] remove dependency on getopt in parquet tools

2019-02-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney deleted PARQUET-1524: -- > [C++] remove dependency on getopt in parquet tools > -

[jira] [Resolved] (PARQUET-1525) [C++] remove dependency on getopt in parquet tools

2019-02-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1525. --- Resolution: Fixed Issue resolved by pull request 3545 [https://github.com/apache/arrow/pull

[jira] [Commented] (PARQUET-1523) [C++] Vectorize comparator interface

2019-02-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16761048#comment-16761048 ] Wes McKinney commented on PARQUET-1523: --- OK. Do you want to try to remove the usa

[jira] [Updated] (PARQUET-1521) [C++] Do not use "extern template class" with parquet::ColumnWriter

2019-02-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1521: -- Description: As continued cleaning, similar to parquet::TypedColumnReader I will do similar r

[jira] [Moved] (PARQUET-1526) [C++] parquet cpp - improve examples

2019-02-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-3949 to PARQUET-1526: -- Component/s: (was: C++) parquet-cpp Workf

[jira] [Commented] (PARQUET-1537) [C++] The patch for PARQUET-1508 leads to infinite loop and infinite memory allocation when reading very sparse ByteArray columns

2019-02-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16774511#comment-16774511 ] Wes McKinney commented on PARQUET-1537: --- I moved this issue to PARQUET > [C++] T

[jira] [Updated] (PARQUET-1537) [C++] The patch for PARQUET-1508 leads to infinite loop and infinite memory allocation when reading very sparse ByteArray columns

2019-02-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1537: -- Fix Version/s: cpp-1.6.0 > [C++] The patch for PARQUET-1508 leads to infinite loop and infini

[jira] [Moved] (PARQUET-1537) [C++] The patch for PARQUET-1508 leads to infinite loop and infinite memory allocation when reading very sparse ByteArray columns

2019-02-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-4650 to PARQUET-1537: -- Fix Version/s: (was: 0.13.0) Component/s: (was: C++)

[jira] [Resolved] (PARQUET-1482) [C++] Unable to read data from parquet file generated with parquetjs

2019-03-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1482. --- Resolution: Fixed Issue resolved by pull request 3312 [https://github.com/apache/arrow/pull

[jira] [Commented] (PARQUET-1022) [C++] Append mode in parquet-cpp

2019-03-12 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16790596#comment-16790596 ] Wes McKinney commented on PARQUET-1022: --- I think theoretically you can write row

[jira] [Commented] (PARQUET-1411) [C++] Upgrade to use LogicalType annotations instead of ConvertedType

2019-04-22 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823563#comment-16823563 ] Wes McKinney commented on PARQUET-1411: --- [~tpboudreau] I'll add you to the Contri

[jira] [Commented] (PARQUET-1405) [C++] 'Couldn't deserialize thrift' error when reading large binary column

2019-04-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16829735#comment-16829735 ] Wes McKinney commented on PARQUET-1405: --- We can add an option to not write statis

[jira] [Created] (PARQUET-1566) [C++] Indicate if null count, distinct count are present in column statistics

2019-05-01 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1566: - Summary: [C++] Indicate if null count, distinct count are present in column statistics Key: PARQUET-1566 URL: https://issues.apache.org/jira/browse/PARQUET-1566 Pro

[jira] [Assigned] (PARQUET-1413) [C++] Remove virtual calls from parquet::Comparator hot paths on writing file

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1413: - Assignee: Wes McKinney > [C++] Remove virtual calls from parquet::Comparator hot paths

[jira] [Closed] (PARQUET-1413) [C++] Remove virtual calls from parquet::Comparator hot paths on writing file

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-1413. - > [C++] Remove virtual calls from parquet::Comparator hot paths on writing file > -

[jira] [Resolved] (PARQUET-1413) [C++] Remove virtual calls from parquet::Comparator hot paths on writing file

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1413. --- Resolution: Duplicate Duplicate of PARQUET-1523 > [C++] Remove virtual calls from parquet:

[jira] [Closed] (PARQUET-567) C++: Add a "file iterator" abstraction and API-compatible interface for scanning directories of Parquet files

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-567. > C++: Add a "file iterator" abstraction and API-compatible interface for > scanning directories of P

[jira] [Resolved] (PARQUET-567) C++: Add a "file iterator" abstraction and API-compatible interface for scanning directories of Parquet files

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-567. -- Resolution: Won't Fix Assignee: Wes McKinney We are planning to address this use case do

[jira] [Closed] (PARQUET-810) [C++] Read from file with schema evolution

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-810. > [C++] Read from file with schema evolution > -- > >

[jira] [Resolved] (PARQUET-810) [C++] Read from file with schema evolution

2019-05-01 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-810. -- Resolution: Won't Fix I think this is an application-level concern. We provide both low-leve

[jira] [Closed] (PARQUET-476) Add a utility function to print the raw repetition / definition levels to an std::ostream

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-476. > Add a utility function to print the raw repetition / definition levels to an > std::ostream > -

[jira] [Resolved] (PARQUET-476) Add a utility function to print the raw repetition / definition levels to an std::ostream

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-476. -- Resolution: Won't Fix Assignee: (was: Deepak Majeti) > Add a utility function to pri

[jira] [Updated] (PARQUET-1546) [C++] page level min / max written by parquet-cpp is not recognized by parquet-tools

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1546: -- Summary: [C++] page level min / max written by parquet-cpp is not recognized by parquet-tool

[jira] [Resolved] (PARQUET-474) [C++] InputStream and RandomAccessSource classes are not threadsafe

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-474. -- Resolution: Won't Fix Assignee: (was: Wes McKinney) > [C++] InputStream and RandomAc

[jira] [Closed] (PARQUET-474) [C++] InputStream and RandomAccessSource classes are not threadsafe

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-474. > [C++] InputStream and RandomAccessSource classes are not threadsafe > --

[jira] [Updated] (PARQUET-749) [C++] Schema building ParquetFileWriter

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-749: - Summary: [C++] Schema building ParquetFileWriter (was: Schema building ParquetFileWriter) > [C

[jira] [Assigned] (PARQUET-800) [C++] Provide public API to access dictionary-encoded indices and values

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-800: Assignee: Wes McKinney (was: Uwe L. Korn) > [C++] Provide public API to access dictionar

[jira] [Updated] (PARQUET-1159) [C++] Compatibility with C++ iterators

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1159: -- Summary: [C++] Compatibility with C++ iterators (was: Compatibility with C++ iterators) > [

[jira] [Updated] (PARQUET-1169) [C++] Segment fault when using NextBatch of parquet::arrow::ColumnReader in parquet-cpp

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1169: -- Summary: [C++] Segment fault when using NextBatch of parquet::arrow::ColumnReader in parquet-

[jira] [Updated] (PARQUET-1224) [C++] Implement specification-compliant floating point comparison

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1224: -- Summary: [C++] Implement specification-compliant floating point comparison (was: [parquet-cp

[jira] [Assigned] (PARQUET-1243) [C++] Improve quality of error message for zero-length files, otherwise corrupted files

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1243: - Assignee: Wes McKinney > [C++] Improve quality of error message for zero-length files,

[jira] [Resolved] (PARQUET-1252) [C++] Pass BOOST_ROOT and Boost_NAMESPACE on to Thrift EP

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1252. --- Resolution: Fixed Assignee: Uwe L. Korn As far as I can tell this is done. Resolving,

[jira] [Assigned] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1352: - Assignee: Wes McKinney > [CPP] Trying to write an arrow table with structs to a parque

[jira] [Updated] (PARQUET-1404) [C++] Add index pages to the format to support efficient page skipping to parquet-cpp

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1404: -- Summary: [C++] Add index pages to the format to support efficient page skipping to parquet-cp

[jira] [Assigned] (PARQUET-1523) [C++] Vectorize comparator interface

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1523: - Assignee: Wes McKinney (was: Deepak Majeti) > [C++] Vectorize comparator interface >

[jira] [Resolved] (PARQUET-1523) [C++] Vectorize comparator interface

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1523. --- Resolution: Fixed Issue resolved by pull request 4233 [https://github.com/apache/arrow/pull

[jira] [Commented] (PARQUET-1567) [C++][Parquet] Bad initialization in statistics computation

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16832035#comment-16832035 ] Wes McKinney commented on PARQUET-1567: --- [~fsaintjacques] I moved this to PARQUET

[jira] [Moved] (PARQUET-1567) [C++][Parquet] Bad initialization in statistics computation

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-5251 to PARQUET-1567: -- Component/s: (was: C++) parquet-cpp Workf

[jira] [Resolved] (PARQUET-1405) [C++] 'Couldn't deserialize thrift' error when reading large binary column

2019-05-02 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1405. --- Resolution: Fixed Issue resolved by pull request 4230 [https://github.com/apache/arrow/pull

[jira] [Created] (PARQUET-1569) [C++] Consolidate testing header files

2019-05-03 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1569: - Summary: [C++] Consolidate testing header files Key: PARQUET-1569 URL: https://issues.apache.org/jira/browse/PARQUET-1569 Project: Parquet Issue Type: Impr

[jira] [Updated] (PARQUET-1569) [C++] Consolidate testing header files

2019-05-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1569: -- Fix Version/s: (was: 1.10.1) cpp-1.6.0 > [C++] Consolidate testing hea

[jira] [Resolved] (PARQUET-1569) [C++] Consolidate testing header files

2019-05-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1569. --- Resolution: Fixed Fix Version/s: (was: cpp-1.6.0) 1.10.1 Issu

[jira] [Moved] (PARQUET-1571) Can't read data from parquet file in C++ library

2019-05-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-5282 to PARQUET-1571: -- Workflow: patch-available, re-open possible (was: jira) Key

[jira] [Updated] (PARQUET-1571) Can't read data from parquet file in C++ library

2019-05-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1571: -- Component/s: parquet-cpp > Can't read data from parquet file in C++ library > ---

[jira] [Commented] (PARQUET-1571) [C++] Can't read data from parquet file in C++ library

2019-05-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16835235#comment-16835235 ] Wes McKinney commented on PARQUET-1571: --- I moved this from ARROW to PARQUET. Can

[jira] [Updated] (PARQUET-1571) [C++] Can't read data from parquet file in C++ library

2019-05-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1571: -- Summary: [C++] Can't read data from parquet file in C++ library (was: Can't read data from p

[jira] [Resolved] (PARQUET-1571) [C++] Can't read data from parquet file in C++ library

2019-05-08 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1571. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 4275 [http

[jira] [Created] (PARQUET-1583) [C++] Remove parquet::Vector class

2019-05-20 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1583: - Summary: [C++] Remove parquet::Vector class Key: PARQUET-1583 URL: https://issues.apache.org/jira/browse/PARQUET-1583 Project: Parquet Issue Type: Improvem

[jira] [Assigned] (PARQUET-1583) [C++] Remove parquet::Vector class

2019-05-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1583: - Assignee: Wes McKinney > [C++] Remove parquet::Vector class >

[jira] [Resolved] (PARQUET-1582) [C++] Add ToString method ColumnDescriptor

2019-05-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1582. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 4338 [http

[jira] [Assigned] (PARQUET-1582) [C++] Add ToString method ColumnDescriptor

2019-05-20 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1582: - Assignee: Micah Kornfield > [C++] Add ToString method ColumnDescriptor > -

[jira] [Resolved] (PARQUET-1402) [C++] incorrect calculation column start offset for files created by parquet-mr 1.8.1

2019-05-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1402. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 4359 [http

[jira] [Resolved] (PARQUET-1243) [C++] Improve quality of error message for zero-length files, otherwise corrupted files

2019-05-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1243. --- Resolution: Fixed Fix Version/s: (was: cpp-1.5.0) cpp-1.6.0 I

[jira] [Resolved] (PARQUET-1411) [C++] Upgrade to use LogicalType annotations instead of ConvertedType

2019-05-27 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1411. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 4185 [http

[jira] [Commented] (PARQUET-1422) [C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers

2019-05-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16850097#comment-16850097 ] Wes McKinney commented on PARQUET-1422: --- [~pitrou] I ran into a snag working on t

[jira] [Commented] (PARQUET-1422) [C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers

2019-05-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16850249#comment-16850249 ] Wes McKinney commented on PARQUET-1422: --- I went ahead and ported the "expanding-p

[jira] [Resolved] (PARQUET-1517) [C++] Update cpp crypto package to match signed-off specification

2019-05-29 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1517. --- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 3520 [http

[jira] [Resolved] (PARQUET-1422) [C++] Use Arrow IO interfaces natively rather than current parquet:: wrappers

2019-05-31 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1422. --- Resolution: Fixed Issue resolved by pull request 4404 [https://github.com/apache/arrow/pull

<    9   10   11   12   13   14   15   16   17   >