[jira] [Comment Edited] (PARQUET-2241) ByteStreamSplitDecoder broken in presence of nulls

2023-02-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17688363#comment-17688363 ] Gabor Szadovszky edited comment on PARQUET-2241 at 2/14/23 8:37 AM: -

[jira] [Commented] (PARQUET-2241) ByteStreamSplitDecoder broken in presence of nulls

2023-02-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17688363#comment-17688363 ] Gabor Szadovszky commented on PARQUET-2241: --- [~wgtmac], realted to your quest

[jira] [Created] (PARQUET-2243) Support zstd-jni in DirectCodecFactory

2023-02-14 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-2243: - Summary: Support zstd-jni in DirectCodecFactory Key: PARQUET-2243 URL: https://issues.apache.org/jira/browse/PARQUET-2243 Project: Parquet Issue Ty

[jira] [Resolved] (PARQUET-2244) Dictionary filter may skip row-groups incorrectly when evaluating notIn

2023-02-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2244. --- Resolution: Fixed > Dictionary filter may skip row-groups incorrectly when evaluati

[jira] [Assigned] (PARQUET-2244) Dictionary filter may skip row-groups incorrectly when evaluating notIn

2023-02-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2244: - Assignee: Yujiang Zhong > Dictionary filter may skip row-groups incorrectly wh

[jira] [Resolved] (PARQUET-2228) ParquetRewriter supports more than one input file

2023-02-21 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2228. --- Resolution: Fixed > ParquetRewriter supports more than one input file > ---

[jira] [Resolved] (PARQUET-2241) ByteStreamSplitDecoder broken in presence of nulls

2023-02-21 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2241. --- Resolution: Fixed > ByteStreamSplitDecoder broken in presence of nulls > --

[jira] [Assigned] (PARQUET-2247) Fail-fast if CapacityByteArrayOutputStream write overflow

2023-02-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2247: - Assignee: Gabor Szadovszky > Fail-fast if CapacityByteArrayOutputStream write

[jira] [Resolved] (PARQUET-2247) Fail-fast if CapacityByteArrayOutputStream write overflow

2023-02-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2247. --- Resolution: Fixed > Fail-fast if CapacityByteArrayOutputStream write overflow > ---

[jira] [Assigned] (PARQUET-2247) Fail-fast if CapacityByteArrayOutputStream write overflow

2023-02-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2247: - Assignee: dzcxzl (was: Gabor Szadovszky) > Fail-fast if CapacityByteArrayOutp

[jira] [Resolved] (PARQUET-2243) Support zstd-jni in DirectCodecFactory

2023-02-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2243. --- Resolution: Fixed > Support zstd-jni in DirectCodecFactory > --

[jira] [Assigned] (PARQUET-2246) Add short circuit logic to column index filter

2023-02-23 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2246: - Assignee: Yujiang Zhong > Add short circuit logic to column index filter > ---

[jira] [Resolved] (PARQUET-2246) Add short circuit logic to column index filter

2023-02-23 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-2246. --- Resolution: Fixed > Add short circuit logic to column index filter > --

[jira] [Assigned] (PARQUET-2254) Build a BloomFilter with a more precise size

2023-03-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2254: - Assignee: Mars > Build a BloomFilter with a more precise size > --

[jira] [Commented] (PARQUET-2254) Build a BloomFilter with a more precise size

2023-03-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17697301#comment-17697301 ] Gabor Szadovszky commented on PARQUET-2254: --- I think this is a good idea. Mea

[jira] [Commented] (PARQUET-2254) Build a BloomFilter with a more precise size

2023-03-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17697510#comment-17697510 ] Gabor Szadovszky commented on PARQUET-2254: --- 1) I think, for creating bloom f

[jira] [Commented] (PARQUET-2255) BloomFilter and float point is ambiguous

2023-03-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17699712#comment-17699712 ] Gabor Szadovszky commented on PARQUET-2255: --- Bloom filters are for searching

[jira] [Commented] (PARQUET-2255) BloomFilter and float point is ambiguous

2023-03-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17699732#comment-17699732 ] Gabor Szadovszky commented on PARQUET-2255: --- But we don't build the dictionar

[jira] [Commented] (PARQUET-1690) Integer Overflow of BinaryStatistics#isSmallerThan()

2023-03-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17701561#comment-17701561 ] Gabor Szadovszky commented on PARQUET-1690: --- [~humanoid], I don't know/rememb

[jira] [Commented] (PARQUET-2258) Storing toString fields in FilterPredicate instances can lead to memory pressure

2023-03-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17701568#comment-17701568 ] Gabor Szadovszky commented on PARQUET-2258: --- Thanks for fixing this, [~abstra

[jira] [Assigned] (PARQUET-2256) Adding Compression for BloomFilter

2023-03-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-2256: - Assignee: Xuwei Fu > Adding Compression for BloomFilter >

[jira] [Commented] (PARQUET-2256) Adding Compression for BloomFilter

2023-03-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17701575#comment-17701575 ] Gabor Szadovszky commented on PARQUET-2256: --- [~mwish], would you mind to do s

[jira] [Commented] (PARQUET-2276) ParquetReader reads do not work with Hadoop version 2.8.5

2023-04-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17713635#comment-17713635 ] Gabor Szadovszky commented on PARQUET-2276: --- I think it is fine to drop suppo

[jira] [Commented] (PARQUET-1152) Parquet-thrift doesn't compile with Thrift 0.9.3

2017-10-31 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16226741#comment-16226741 ] Gabor Szadovszky commented on PARQUET-1152: --- The new parquet-format release {{

[jira] [Assigned] (PARQUET-1025) Support new min-max statistics in parquet-mr

2017-11-07 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1025: - Assignee: Gabor Szadovszky > Support new min-max statistics in parquet-mr > ---

[jira] [Commented] (PARQUET-1025) Support new min-max statistics in parquet-mr

2017-11-20 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16259224#comment-16259224 ] Gabor Szadovszky commented on PARQUET-1025: --- To implement the new statistics w

[jira] [Commented] (PARQUET-1025) Support new min-max statistics in parquet-mr

2017-11-24 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16265132#comment-16265132 ] Gabor Szadovszky commented on PARQUET-1025: --- On the 11/22 Parquet Sync meeting

[jira] [Created] (PARQUET-1170) Implement toString based on logical type so values will be represented properly in tools/logs etc.

2017-12-06 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1170: - Summary: Implement toString based on logical type so values will be represented properly in tools/logs etc. Key: PARQUET-1170 URL: https://issues.apache.org/jira/browse/

[jira] [Assigned] (PARQUET-386) Printing out the statistics of metadata in parquet-tools

2017-12-29 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-386: Assignee: Gabor Szadovszky > Printing out the statistics of metadata in parquet-to

[jira] [Created] (PARQUET-1198) Bump java source and target to java8

2018-01-18 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1198: - Summary: Bump java source and target to java8 Key: PARQUET-1198 URL: https://issues.apache.org/jira/browse/PARQUET-1198 Project: Parquet Issue Type

[jira] [Created] (PARQUET-1201) Implement index pages

2018-01-22 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1201: - Summary: Implement index pages Key: PARQUET-1201 URL: https://issues.apache.org/jira/browse/PARQUET-1201 Project: Parquet Issue Type: New Feature

[jira] [Commented] (PARQUET-1201) Implement page indexes

2018-02-01 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16348720#comment-16348720 ] Gabor Szadovszky commented on PARQUET-1201: --- [~rdblue], I've added a pull requ

[jira] [Commented] (PARQUET-922) Add index pages to the format to support efficient page skipping

2018-02-07 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16356582#comment-16356582 ] Gabor Szadovszky commented on PARQUET-922: -- Hi [~legend], I am working on it. A

[jira] [Resolved] (PARQUET-1206) Parquet properties are ignored at table creation time

2018-02-08 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1206. --- Resolution: Invalid There is a huge difference between Hive and Impala. Hive uses pa

[jira] [Updated] (PARQUET-1207) Write index page in parquet file

2018-02-11 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1207: -- Description: PARQUET-922 has been resolved, parquet-format 2.4.0 supported index page.

[jira] [Commented] (PARQUET-1207) Write index page in parquet file

2018-02-11 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16360019#comment-16360019 ] Gabor Szadovszky commented on PARQUET-1207: --- Hi [~legend], When I've created

[jira] [Commented] (PARQUET-1207) Write index page in parquet file

2018-02-12 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16360407#comment-16360407 ] Gabor Szadovszky commented on PARQUET-1207: --- Hi [~legend], The first phase I'

[jira] [Updated] (PARQUET-1201) Write column indexes

2018-02-13 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1201: -- Summary: Write column indexes (was: Implement page indexes) > Write column indexes >

[jira] [Updated] (PARQUET-1201) Write column indexes

2018-02-13 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1201: -- Description: Write the column indexes described in PARQUET-922. This is the first phas

[jira] [Updated] (PARQUET-1201) Write column indexes

2018-02-13 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1201: -- Description: Write the column indexes described in PARQUET-922. This is the first phas

[jira] [Created] (PARQUET-1211) Write column indexes: read/write API

2018-02-13 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1211: - Summary: Write column indexes: read/write API Key: PARQUET-1211 URL: https://issues.apache.org/jira/browse/PARQUET-1211 Project: Parquet Issue Type

[jira] [Created] (PARQUET-1212) Write column indexes: Show indexes in tools

2018-02-13 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1212: - Summary: Write column indexes: Show indexes in tools Key: PARQUET-1212 URL: https://issues.apache.org/jira/browse/PARQUET-1212 Project: Parquet Iss

[jira] [Created] (PARQUET-1213) Write column indexes: Limit index size

2018-02-13 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1213: - Summary: Write column indexes: Limit index size Key: PARQUET-1213 URL: https://issues.apache.org/jira/browse/PARQUET-1213 Project: Parquet Issue Ty

[jira] [Created] (PARQUET-1214) Write column indexes: Truncate min/max values

2018-02-13 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1214: - Summary: Write column indexes: Truncate min/max values Key: PARQUET-1214 URL: https://issues.apache.org/jira/browse/PARQUET-1214 Project: Parquet I

[jira] [Assigned] (PARQUET-1213) Write column indexes: Limit index size

2018-02-14 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1213: - Assignee: Gabor Szadovszky > Write column indexes: Limit index size > -

[jira] [Assigned] (PARQUET-1214) Write column indexes: Truncate min/max values

2018-02-14 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1214: - Assignee: Gabor Szadovszky > Write column indexes: Truncate min/max values > --

[jira] [Created] (PARQUET-1217) Incorrect check for null min/max values in StatisticsFilter

2018-02-15 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1217: - Summary: Incorrect check for null min/max values in StatisticsFilter Key: PARQUET-1217 URL: https://issues.apache.org/jira/browse/PARQUET-1217 Project: Parq

[jira] [Updated] (PARQUET-1217) Missing check for null min/max values in StatisticsFilter

2018-02-15 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1217: -- Summary: Missing check for null min/max values in StatisticsFilter (was: Incorrect ch

[jira] [Updated] (PARQUET-1217) Incorrect handling of missing values in Statistics

2018-02-16 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1217: -- Summary: Incorrect handling of missing values in Statistics (was: Missing check for n

[jira] [Updated] (PARQUET-1217) Incorrect handling of missing values in Statistics

2018-02-16 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1217: -- Description: As per the parquet-format specs the min/max values in statistics are opti

[jira] [Created] (PARQUET-1234) Release Parquet format 2.5.0

2018-02-21 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1234: - Summary: Release Parquet format 2.5.0 Key: PARQUET-1234 URL: https://issues.apache.org/jira/browse/PARQUET-1234 Project: Parquet Issue Type: Task

[jira] [Updated] (PARQUET-1234) Release Parquet format 2.5.0

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1234: -- Fix Version/s: format-2.5.0 > Release Parquet format 2.5.0 > -

[jira] [Updated] (PARQUET-1234) Release Parquet format 2.5.0

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1234: -- Affects Version/s: format-2.5.0 > Release Parquet format 2.5.0 > -

[jira] [Updated] (PARQUET-1145) Add license to .gitignore and .travis.yml

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1145: -- Fix Version/s: format-2.5.0 > Add license to .gitignore and .travis.yml >

[jira] [Updated] (PARQUET-1064) Deprecate type-defined sort ordering for INTERVAL type

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1064: -- Fix Version/s: format-2.5.0 > Deprecate type-defined sort ordering for INTERVAL type >

[jira] [Updated] (PARQUET-1156) dev/merge_parquet_pr.py problems

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1156: -- Fix Version/s: format-2.5.0 > dev/merge_parquet_pr.py problems > -

[jira] [Updated] (PARQUET-1065) Deprecate type-defined sort ordering for INT96 type

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1065: -- Fix Version/s: format-2.5.0 > Deprecate type-defined sort ordering for INT96 type > --

[jira] [Updated] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1171: -- Fix Version/s: (was: format-2.4.0) format-2.5.0 > [C++] Clarify

[jira] [Updated] (PARQUET-1197) Log rat failures

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1197: -- Fix Version/s: format-2.5.0 > Log rat failures > > >

[jira] [Updated] (PARQUET-1201) Write column indexes

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1201: -- Fix Version/s: format-2.5.0 > Write column indexes > > >

[jira] [Updated] (PARQUET-1222) Definition of float and double sort order is ambigious

2018-02-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1222: -- Fix Version/s: format-2.5.0 > Definition of float and double sort order is ambigious >

[jira] [Created] (PARQUET-1246) Ignore float/double statistics in case of NaN

2018-03-13 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1246: - Summary: Ignore float/double statistics in case of NaN Key: PARQUET-1246 URL: https://issues.apache.org/jira/browse/PARQUET-1246 Project: Parquet I

[jira] [Assigned] (PARQUET-1212) Write column indexes: Show indexes in tools

2018-03-14 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1212: - Assignee: Gabor Szadovszky > Write column indexes: Show indexes in tools >

[jira] [Created] (PARQUET-1251) Describe handling of the ambigous min/max statistics for FLOAT/DOUBLE

2018-03-20 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1251: - Summary: Describe handling of the ambigous min/max statistics for FLOAT/DOUBLE Key: PARQUET-1251 URL: https://issues.apache.org/jira/browse/PARQUET-1251 Pro

[jira] [Updated] (PARQUET-1222) Definition of float and double sort order is ambiguous

2018-03-20 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1222: -- Fix Version/s: (was: format-2.5.0) > Definition of float and double sort order is

[jira] [Updated] (PARQUET-1251) Clarify ambiguous min/max stats for FLOAT/DOUBLE

2018-03-20 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1251: -- Summary: Clarify ambiguous min/max stats for FLOAT/DOUBLE (was: Describe handling of

[jira] [Updated] (PARQUET-1251) Clarify ambiguous min/max stats for FLOAT/DOUBLE

2018-03-20 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1251: -- Fix Version/s: format-2.5.0 > Clarify ambiguous min/max stats for FLOAT/DOUBLE > -

[jira] [Assigned] (PARQUET-1236) Upgrade org.slf4j:slf4j-api:1.7.2 to 1.7.12

2018-03-22 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1236: - Assignee: PandaMonkey > Upgrade org.slf4j:slf4j-api:1.7.2 to 1.7.12 > -

[jira] [Commented] (PARQUET-1173) com.fasterxml.jackson.core.jackson dependency harmonization

2018-03-22 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16409429#comment-16409429 ] Gabor Szadovszky commented on PARQUET-1173: --- I don't think we should synchroni

[jira] [Created] (PARQUET-1258) Update scm developer connection to github

2018-03-28 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1258: - Summary: Update scm developer connection to github Key: PARQUET-1258 URL: https://issues.apache.org/jira/browse/PARQUET-1258 Project: Parquet Issue

[jira] [Assigned] (PARQUET-1261) Parquet-format interns strings when reading filemetadata

2018-04-03 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1261: - Assignee: Robert Kruszewski > Parquet-format interns strings when reading filem

[jira] [Assigned] (PARQUET-1234) Release Parquet format 2.5.0

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1234: - Assignee: Gabor Szadovszky > Release Parquet format 2.5.0 > ---

[jira] [Resolved] (PARQUET-1234) Release Parquet format 2.5.0

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1234. --- Resolution: Fixed > Release Parquet format 2.5.0 > > >

[jira] [Closed] (PARQUET-1156) dev/merge_parquet_pr.py problems

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1156. - > dev/merge_parquet_pr.py problems > > > Ke

[jira] [Closed] (PARQUET-1197) Log rat failures

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1197. - > Log rat failures > > > Key: PARQUET-1197 >

[jira] [Closed] (PARQUET-1064) Deprecate type-defined sort ordering for INTERVAL type

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1064. - > Deprecate type-defined sort ordering for INTERVAL type > -

[jira] [Closed] (PARQUET-323) INT96 should be marked as deprecated

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-323. > INT96 should be marked as deprecated > > >

[jira] [Closed] (PARQUET-1242) parquet.thrift refers to wrong releases for the new compressions

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1242. - > parquet.thrift refers to wrong releases for the new compressions > ---

[jira] [Closed] (PARQUET-1234) Release Parquet format 2.5.0

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1234. - > Release Parquet format 2.5.0 > > > Key: PARQU

[jira] [Closed] (PARQUET-1065) Deprecate type-defined sort ordering for INT96 type

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1065. - > Deprecate type-defined sort ordering for INT96 type >

[jira] [Closed] (PARQUET-1145) Add license to .gitignore and .travis.yml

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1145. - > Add license to .gitignore and .travis.yml > - > >

[jira] [Closed] (PARQUET-1258) Update scm developer connection to github

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1258. - > Update scm developer connection to github > - > >

[jira] [Closed] (PARQUET-1236) Upgrade org.slf4j:slf4j-api:1.7.2 to 1.7.12

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1236. - > Upgrade org.slf4j:slf4j-api:1.7.2 to 1.7.12 > ---

[jira] [Closed] (PARQUET-1251) Clarify ambiguous min/max stats for FLOAT/DOUBLE

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1251. - > Clarify ambiguous min/max stats for FLOAT/DOUBLE > ---

[jira] [Closed] (PARQUET-1171) [C++] Clarify valid uses for RLE, BIT_PACKED encodings

2018-04-18 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky closed PARQUET-1171. - > [C++] Clarify valid uses for RLE, BIT_PACKED encodings > -

[jira] [Created] (PARQUET-1275) Travis fails with missing protobuf tar on branch 1.8.x

2018-04-19 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1275: - Summary: Travis fails with missing protobuf tar on branch 1.8.x Key: PARQUET-1275 URL: https://issues.apache.org/jira/browse/PARQUET-1275 Project: Parquet

[jira] [Updated] (PARQUET-1275) Travis fails with missing protobuf tar on branch 1.8.x

2018-04-19 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1275: -- Fix Version/s: 1.8.3 > Travis fails with missing protobuf tar on branch 1.8.x > --

[jira] [Updated] (PARQUET-1275) Travis fails on branch 1.8.x

2018-04-19 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1275: -- Summary: Travis fails on branch 1.8.x (was: Travis fails with missing protobuf tar on

[jira] [Created] (PARQUET-1277) Release Parquet-mr 1.8.3

2018-04-19 Thread Gabor Szadovszky (JIRA)
Gabor Szadovszky created PARQUET-1277: - Summary: Release Parquet-mr 1.8.3 Key: PARQUET-1277 URL: https://issues.apache.org/jira/browse/PARQUET-1277 Project: Parquet Issue Type: Task

[jira] [Assigned] (PARQUET-1277) Release Parquet-mr 1.8.3

2018-04-19 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1277: - Assignee: Gabor Szadovszky > Release Parquet-mr 1.8.3 > ---

[jira] [Updated] (PARQUET-1277) Release Parquet-mr 1.8.3

2018-04-19 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1277: -- Component/s: parquet-mr > Release Parquet-mr 1.8.3 > > >

[jira] [Updated] (PARQUET-580) Potentially unnecessary creation of large int[] in IntList for columns that aren't used

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-580: - Fix Version/s: 1.8.2 > Potentially unnecessary creation of large int[] in IntList for col

[jira] [Updated] (PARQUET-372) Parquet stats can have awkwardly large values

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-372: - Fix Version/s: 1.8.2 > Parquet stats can have awkwardly large values > --

[jira] [Updated] (PARQUET-413) Test failures for Java 8

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-413: - Fix Version/s: 1.8.2 > Test failures for Java 8 > > >

[jira] [Updated] (PARQUET-669) Allow reading file footers from input streams when writing metadata files

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-669: - Fix Version/s: 1.8.2 > Allow reading file footers from input streams when writing metadat

[jira] [Updated] (PARQUET-342) Can't build Parquet on Java 6

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-342: - Fix Version/s: 1.8.2 > Can't build Parquet on Java 6 > - > >

[jira] [Updated] (PARQUET-642) Improve performance of ByteBuffer based read / write paths

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-642: - Fix Version/s: 1.8.2 > Improve performance of ByteBuffer based read / write paths > -

[jira] [Updated] (PARQUET-529) Avoid evoking job.toString() in ParquetLoader

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-529: - Fix Version/s: 1.8.2 > Avoid evoking job.toString() in ParquetLoader > --

[jira] [Updated] (PARQUET-348) shouldIgnoreStatistics too noisy

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-348: - Fix Version/s: 1.8.2 > shouldIgnoreStatistics too noisy > ---

[jira] [Updated] (PARQUET-422) Fix a potential bug in MessageTypeParser where we ignore and overwrite the initial value of a method parameter

2018-04-21 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-422: - Fix Version/s: 1.8.2 > Fix a potential bug in MessageTypeParser where we ignore and overw

<    1   2   3   4   5   6   7   8   9   >