[jira] [Updated] (PARQUET-2110) Fix Typos in LogicalTypes.md

2022-01-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-2110: -- Fix Version/s: format-2.10.0 > Fix Typos in LogicalTypes.md > >

[jira] [Resolved] (PARQUET-2110) Fix Typos in LogicalTypes.md

2022-01-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-2110. --- Resolution: Fixed Resolved in PR https://github.com/apache/parquet-format/pull/181 > Fix

[jira] [Commented] (PARQUET-1345) [C++] It is possible to overflow a TMemoryBuffer when serializing the file metadata

2020-10-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205494#comment-17205494 ] Wes McKinney commented on PARQUET-1345: --- Can you make a repro? Seems like something we should see

[jira] [Assigned] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-09-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1878: - Assignee: Patrick Pai > [C++] lz4 codec is not compatible with Hadoop Lz4Codec >

[jira] [Resolved] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-09-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1878. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7789

[jira] [Commented] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17186124#comment-17186124 ] Wes McKinney commented on PARQUET-1904: --- Done. I also made you an administrator so you can do

[jira] [Updated] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1904: -- Fix Version/s: cpp-1.6.0 > [C++] Export file_offset in RowGroupMetaData >

[jira] [Assigned] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1904: - Assignee: Simon Bertron > [C++] Export file_offset in RowGroupMetaData >

[jira] [Resolved] (PARQUET-1845) [C++] Int96 memory images in test cases assume only little-endian

2020-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1845. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6981

[jira] [Assigned] (PARQUET-1845) [C++] Int96 memory images in test cases assume only little-endian

2020-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1845: - Assignee: Kazuaki Ishizaki > [C++] Int96 memory images in test cases assume only

[jira] [Resolved] (PARQUET-1882) [C++] Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1882. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7718

[jira] [Updated] (PARQUET-1882) [C++] Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1882: -- Summary: [C++] Writing an all-null column and then reading it with buffered_stream aborts

[jira] [Resolved] (PARQUET-1839) [C++] values_read not updated in ReadBatchSpaced

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1839. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7717

[jira] [Updated] (PARQUET-1839) [C++] values_read not updated in ReadBatchSpaced

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1839: -- Summary: [C++] values_read not updated in ReadBatchSpaced (was: values_read not updated in

[jira] [Commented] (PARQUET-1882) Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17154724#comment-17154724 ] Wes McKinney commented on PARQUET-1882: --- Can you provide a reproducible code example? > Writing

[jira] [Commented] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-06-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17139954#comment-17139954 ] Wes McKinney commented on PARQUET-1878: --- [~chairmank] can you also send an e-mail to

[jira] [Updated] (PARQUET-1241) [C++] Use LZ4 frame format

2020-06-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1241: -- Fix Version/s: cpp-1.6.0 > [C++] Use LZ4 frame format > -- > >

[jira] [Resolved] (PARQUET-1877) [C++] Reconcile container size with string size for memory issues

2020-06-17 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1877. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7465

[jira] [Assigned] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1859: - Assignee: (was: Wes McKinney) > [C++] Require error message when using

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: (was: Wes McKinney) > [C++] bloom_filter-test is very slow under

[jira] [Assigned] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1352: - Assignee: (was: Wes McKinney) > [CPP] Trying to write an arrow table with structs

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: (was: Wes McKinney) > [C++] bloom_filter-test is very slow under

[jira] [Assigned] (PARQUET-838) [CPP] Unable to read files written by parquet-cpp from parquet-tools

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-838: Assignee: (was: Wes McKinney) > [CPP] Unable to read files written by parquet-cpp

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: Wes McKinney > [C++] bloom_filter-test is very slow under valgrind >

[jira] [Assigned] (PARQUET-443) Schema resolution: map encoding

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-443: Assignee: (was: Wes McKinney) > Schema resolution: map encoding >

[jira] [Assigned] (PARQUET-441) Schema resolution: one, two, and three-level array encoding

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-441: Assignee: (was: Wes McKinney) > Schema resolution: one, two, and three-level array

[jira] [Commented] (PARQUET-1869) [C++] Large decimal values don't roundtrip correctly

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17123699#comment-17123699 ] Wes McKinney commented on PARQUET-1869: --- I'm pretty sure this is a problem with conversion from

[jira] [Assigned] (PARQUET-1855) [C++] Improve documentation on MetaData ownership

2020-05-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1855: - Assignee: Francois Saint-Jacques > [C++] Improve documentation on MetaData ownership

[jira] [Resolved] (PARQUET-1855) [C++] Improve documentation on MetaData ownership

2020-05-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1855. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7244

[jira] [Resolved] (PARQUET-1861) [Documentation][C++] Explain ReaderProperters.buffer_stream*

2020-05-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1861. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7221

[jira] [Resolved] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1865. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7237

[jira] [Assigned] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1865: - Assignee: Wes McKinney > [C++] Failure from C++17 feature used in

[jira] [Created] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1865: - Summary: [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc Key: PARQUET-1865 URL: https://issues.apache.org/jira/browse/PARQUET-1865 Project:

[jira] [Updated] (PARQUET-1861) [Documentation][C++] Explain ReaderProperters.buffer_stream*

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1861: -- Summary: [Documentation][C++] Explain ReaderProperters.buffer_stream* (was: [Documentation]

[jira] [Resolved] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1857. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7108

[jira] [Commented] (PARQUET-1858) [Python] [Rust] Parquet read file fails with batch size 1_000_000 and 41 row groups

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100758#comment-17100758 ] Wes McKinney commented on PARQUET-1858: --- Yes it looks like the file written by Rust is malformed.

[jira] [Assigned] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1859: - Assignee: Wes McKinney > [C++] Require error message when using

[jira] [Commented] (PARQUET-1858) [Python] [Rust] Parquet read file fails with batch size 1_000_000 and 41 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100260#comment-17100260 ] Wes McKinney commented on PARQUET-1858: --- The PLAIN encoding for the boolean type is possibly

[jira] [Created] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-05-05 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1859: - Summary: [C++] Require error message when using ParquetException::EofException Key: PARQUET-1859 URL: https://issues.apache.org/jira/browse/PARQUET-1859 Project:

[jira] [Commented] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100084#comment-17100084 ] Wes McKinney commented on PARQUET-1857: --- I put up a PR for the first problem you reported. If

[jira] [Moved] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-8677 to PARQUET-1857: -- Component/s: (was: Rust) (was:

[jira] [Created] (PARQUET-1856) [C++] Test suite assumes that Snappy support is built

2020-05-04 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1856: - Summary: [C++] Test suite assumes that Snappy support is built Key: PARQUET-1856 URL: https://issues.apache.org/jira/browse/PARQUET-1856 Project: Parquet

[jira] [Resolved] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1820. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6744

[jira] [Assigned] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1820: - Assignee: David Li > [C++] Use a column filter hint to inform read prefetching in

[jira] [Updated] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1820: -- Summary: [C++] Use a column filter hint to inform read prefetching in Arrow reads (was:

[jira] [Commented] (PARQUET-1404) [C++] Add index pages to the format to support efficient page skipping to parquet-cpp

2020-04-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17090617#comment-17090617 ] Wes McKinney commented on PARQUET-1404: --- Do you want to keep the discussion in one place, i.e. on

[jira] [Updated] (PARQUET-1327) [C++] Bloom filter read/write implementation

2020-04-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1327: -- Summary: [C++] Bloom filter read/write implementation (was: [C++]Bloom filter read/write

[jira] [Updated] (PARQUET-1828) [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation

2020-04-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1828: -- Summary: [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation (was: Add a

[jira] [Created] (PARQUET-1846) [C++] Remove deprecated IO classes and related functions

2020-04-19 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1846: - Summary: [C++] Remove deprecated IO classes and related functions Key: PARQUET-1846 URL: https://issues.apache.org/jira/browse/PARQUET-1846 Project: Parquet

[jira] [Resolved] (PARQUET-1835) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1835. --- Resolution: Fixed Issue resolved by pull request 6848

[jira] [Updated] (PARQUET-1834) Add Apache 2.0 license to README.md files in parquet-testing

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1834: -- Fix Version/s: cpp-1.6.0 > Add Apache 2.0 license to README.md files in parquet-testing >

[jira] [Resolved] (PARQUET-1834) Add Apache 2.0 license to README.md files in parquet-testing

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1834. --- Resolution: Fixed Resolved by PR

[jira] [Resolved] (PARQUET-1829) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-03-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1829. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6728

[jira] [Resolved] (PARQUET-458) [C++] Implement support for DataPageV2

2020-03-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-458. -- Resolution: Fixed Issue resolved by pull request 6481

[jira] [Commented] (PARQUET-1786) [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance

2020-03-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066085#comment-17066085 ] Wes McKinney commented on PARQUET-1786: --- Please leave resolved issues in "Resolved" state

[jira] [Resolved] (PARQUET-1823) [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader->RowGroup(i)->Column(j)

2020-03-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1823. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6674

[jira] [Resolved] (PARQUET-1819) [C++] Fix crashes on corrupt IPC input (OSS-Fuzz)

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1819. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6659

[jira] [Updated] (PARQUET-1814) [C++] TestInt96ParquetIO failure on Windows

2020-03-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1814: -- Fix Version/s: cpp-1.6.0 > [C++] TestInt96ParquetIO failure on Windows >

[jira] [Updated] (PARQUET-1813) [C++] Remove logging statement in unit test

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1813: -- Summary: [C++] Remove logging statement in unit test (was: [C++] Weird error output in

[jira] [Assigned] (PARQUET-1813) [C++] Weird error output in tests

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1813: - Assignee: Wes McKinney > [C++] Weird error output in tests >

[jira] [Commented] (PARQUET-1813) [C++] Weird error output in tests

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17058286#comment-17058286 ] Wes McKinney commented on PARQUET-1813: --- I missed the debug output in my code review

[jira] [Resolved] (PARQUET-1663) [C++] Provide API to check the presence of complex data types

2020-03-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1663. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5490

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2020-03-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17053455#comment-17053455 ] Wes McKinney commented on PARQUET-1300: --- Anyone interested in looking at packaging issues for

[jira] [Resolved] (PARQUET-1810) [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz)

2020-03-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1810. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6537

[jira] [Resolved] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-03-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1780. --- Resolution: Fixed Issue resolved by pull request 6370

[jira] [Assigned] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-03-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1780: - Assignee: Gamage Omega Ishendra > [C++] Set ColumnMetadata.encoding_stats field >

[jira] [Updated] (PARQUET-1797) [C++] Fix fuzzing errors

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1797: -- Summary: [C++] Fix fuzzing errors (was: Fix fuzzing errors) > [C++] Fix fuzzing errors >

[jira] [Created] (PARQUET-1798) [C++] Review logic around automatic assignment of field_id's

2020-02-14 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1798: - Summary: [C++] Review logic around automatic assignment of field_id's Key: PARQUET-1798 URL: https://issues.apache.org/jira/browse/PARQUET-1798 Project: Parquet

[jira] [Resolved] (PARQUET-1788) [C++] ColumnWriter has undefined behavior when writing arrow chunks

2020-02-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1788. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6378

[jira] [Resolved] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1716. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6005

[jira] [Assigned] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1716: - Assignee: Martin Radev > [C++] Add support for BYTE_STREAM_SPLIT encoding >

[jira] [Commented] (PARQUET-1783) [C++] Parquet statistics wrong for dictionary type

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030199#comment-17030199 ] Wes McKinney commented on PARQUET-1783: --- I suppose it's good at least that the min/max are not

[jira] [Commented] (PARQUET-1783) [C++] Parquet statistics wrong for dictionary type

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030200#comment-17030200 ] Wes McKinney commented on PARQUET-1783: --- Do we need to create a corresponding Arrow issue so this

[jira] [Created] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-01-28 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1780: - Summary: [C++] Set ColumnMetadata.encoding_stats field Key: PARQUET-1780 URL: https://issues.apache.org/jira/browse/PARQUET-1780 Project: Parquet Issue

[jira] [Resolved] (PARQUET-1747) [C++] Access to ColumnChunkMetaData fails when encryption is on

2020-01-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1747. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6150

[jira] [Updated] (PARQUET-1772) [C++] ParquetFileWriter: Data overwritten when output stream opened in append mode

2020-01-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1772: -- Component/s: parquet-cpp > [C++] ParquetFileWriter: Data overwritten when output stream

[jira] [Resolved] (PARQUET-1766) [C++] parquet NaN/null double statistics can result in endless loop

2020-01-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1766. --- Resolution: Fixed Issue resolved by pull request 6167

[jira] [Resolved] (PARQUET-1769) [C++] Update to parquet-format 2.8.0

2020-01-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1769. --- Resolution: Fixed Issue resolved by pull request 6200

[jira] [Created] (PARQUET-1769) [C++] Update to parquet-format 2.8.0

2020-01-14 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1769: - Summary: [C++] Update to parquet-format 2.8.0 Key: PARQUET-1769 URL: https://issues.apache.org/jira/browse/PARQUET-1769 Project: Parquet Issue Type:

[jira] [Resolved] (PARQUET-1701) [C++] Stream API: Add support for optional fields

2020-01-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1701. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5928

[jira] [Updated] (PARQUET-1701) [C++] Stream API: Add support for optional fields

2020-01-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1701: -- Component/s: parquet-cpp > [C++] Stream API: Add support for optional fields >

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014781#comment-17014781 ] Wes McKinney commented on PARQUET-1698: --- Currently in the C++ library, IO calls are issued

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014744#comment-17014744 ] Wes McKinney commented on PARQUET-1698: --- I think the pre-buffering should probably be implemented

[jira] [Moved] (PARQUET-1766) [C++] parquet NaN/null double statistics can result in endless loop

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-7376 to PARQUET-1766: -- Component/s: (was: C++) parquet-cpp

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014637#comment-17014637 ] Wes McKinney commented on PARQUET-1698: --- [~lidavidm] I missed the part about "wide datasets". I

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013982#comment-17013982 ] Wes McKinney commented on PARQUET-1698: --- [~lidavidm] I'm quite interested to compare the rather

[jira] [Commented] (PARQUET-1720) [C++] Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17000192#comment-17000192 ] Wes McKinney commented on PARQUET-1720: --- Assuming this is a C++ issue. Can you provide detail? >

[jira] [Updated] (PARQUET-1720) [C++] Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1720: -- Summary: [C++] Parquet JSONPrint not showing version correctly (was: Parquet JSONPrint not

[jira] [Updated] (PARQUET-1720) Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1720: -- Component/s: parquet-cpp > Parquet JSONPrint not showing version correctly >

[jira] [Commented] (PARQUET-1715) [C++] Add the Parquet code samples to CI + Refactor Parquet Encryption Samples

2019-12-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16995175#comment-16995175 ] Wes McKinney commented on PARQUET-1715: --- Done > [C++] Add the Parquet code samples to CI +

[jira] [Assigned] (PARQUET-1715) [C++] Add the Parquet code samples to CI + Refactor Parquet Encryption Samples

2019-12-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1715: - Assignee: Gal Lushi > [C++] Add the Parquet code samples to CI + Refactor Parquet

[jira] [Closed] (PARQUET-1718) Store int16 as int16

2019-12-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-1718. - > Store int16 as int16 > > > Key: PARQUET-1718 >

[jira] [Updated] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1716: -- Summary: [C++] Add support for BYTE_STREAM_SPLIT encoding (was: [C++][Parquet] Add support

[jira] [Moved] (PARQUET-1716) [C++][Parquet] Add support for Parquet's BYTE_STREAM_SPLIT encoding

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-5913 to PARQUET-1716: -- Component/s: (was: C++) parquet-cpp

[jira] [Commented] (PARQUET-1712) [C++] Stop using deprecated APIs in examples

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16992361#comment-16992361 ] Wes McKinney commented on PARQUET-1712: --- Done > [C++] Stop using deprecated APIs in examples >

[jira] [Assigned] (PARQUET-1712) [C++] Stop using deprecated APIs in examples

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1712: - Assignee: Kenta Murata > [C++] Stop using deprecated APIs in examples >

[jira] [Closed] (PARQUET-1713) [C++] Refactor Parquet Code Samples to use Result APIs

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-1713. - > [C++] Refactor Parquet Code Samples to use Result APIs >

[jira] [Resolved] (PARQUET-1709) [C++] Avoid unnecessary temporary std::shared_ptr copies

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1709. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5949

[jira] [Updated] (PARQUET-1709) [C++] Avoid unnecessary temporary std::shared_ptr copies

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1709: -- Component/s: parquet-cpp > [C++] Avoid unnecessary temporary std::shared_ptr copies >

[jira] [Resolved] (PARQUET-1702) [C++] Make BufferedRowGroupWriter compatible with parquet encryption

2019-12-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1702. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5903

  1   2   3   4   5   6   7   8   9   10   >