[jira] [Resolved] (PARQUET-1791) Add 'prune' command to parquet-tools

2020-02-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1791. --- Resolution: Fixed > Add 'prune' command to parquet-tools >

[jira] [Commented] (PARQUET-1381) Add merge blocks command to parquet-tools

2020-02-24 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17043505#comment-17043505 ] Gabor Szadovszky commented on PARQUET-1381: --- I don't think anyone is working on it. Feel free

[jira] [Resolved] (PARQUET-1802) CompressionCodec class not found if the codec class is not in the same defining classloader as the CodecFactory class

2020-02-24 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1802. --- Resolution: Fixed > CompressionCodec class not found if the codec class is not in

[jira] [Commented] (PARQUET-1808) SimpleGroup.toString() uses String += and so has poor performance

2020-03-03 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050013#comment-17050013 ] Gabor Szadovszky commented on PARQUET-1808: --- [~tiddman], Thanks for filing this issue.

[jira] [Commented] (PARQUET-1784) Column-wise configuration

2020-02-06 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17031377#comment-17031377 ] Gabor Szadovszky commented on PARQUET-1784: --- [~garawalid], The idea is to use a "root" key

[jira] [Commented] (PARQUET-1787) Expected distinct numbers is not parsed correctly

2020-02-06 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17031399#comment-17031399 ] Gabor Szadovszky commented on PARQUET-1787: --- I'm working on a general concept of allowing

[jira] [Comment Edited] (PARQUET-1787) Expected distinct numbers is not parsed correctly

2020-02-06 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17031399#comment-17031399 ] Gabor Szadovszky edited comment on PARQUET-1787 at 2/6/20 9:26 AM: ---

[jira] [Commented] (PARQUET-1784) Column-wise configuration

2020-02-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17032196#comment-17032196 ] Gabor Szadovszky commented on PARQUET-1784: --- [~garawalid], Thanks for the research and the

[jira] [Created] (PARQUET-1784) Column-wise configuration

2020-02-05 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1784: - Summary: Column-wise configuration Key: PARQUET-1784 URL: https://issues.apache.org/jira/browse/PARQUET-1784 Project: Parquet Issue Type: New

[jira] [Updated] (PARQUET-1784) Column-wise configuration

2020-02-05 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1784: -- Description: After adding some new statistics and encodings into Parquet it is

[jira] [Updated] (PARQUET-1784) Column-wise configuration

2020-02-05 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1784: -- Description: After adding some new statistics and encodings into Parquet it is

[jira] [Updated] (PARQUET-1784) Column-wise configuration

2020-02-05 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1784: -- Description: After adding some new statistics and encodings into Parquet it is

[jira] [Commented] (PARQUET-1792) Add 'mask' command to parquet-tools/parquet-cli

2020-02-11 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17034294#comment-17034294 ] Gabor Szadovszky commented on PARQUET-1792: --- If you are talking about one file at a time you

[jira] [Resolved] (PARQUET-1794) Random data generation may cause flaky tests

2020-02-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1794. --- Resolution: Fixed > Random data generation may cause flaky tests >

[jira] [Commented] (PARQUET-1801) Add column index support for 'prune' command in Parquet-tools/cli

2020-02-17 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038172#comment-17038172 ] Gabor Szadovszky commented on PARQUET-1801: --- Currently, only column indexes are the special

[jira] [Resolved] (PARQUET-1796) Bump Apache Avro to 1.9.2

2020-02-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1796. --- Resolution: Fixed > Bump Apache Avro to 1.9.2 > - > >

[jira] [Assigned] (PARQUET-1802) CompressionCodec class not found if the codec class is not in the same defining classloader as the CodecFactory class

2020-02-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1802: - Assignee: Terence Yim > CompressionCodec class not found if the codec class

[jira] [Commented] (PARQUET-1774) Release parquet 1.11.1

2020-02-19 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039845#comment-17039845 ] Gabor Szadovszky commented on PARQUET-1774: --- Waiting for Spark to confirm that

[jira] [Updated] (PARQUET-1796) Bump Apache Avro to 1.9.2

2020-02-19 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1796: -- Fix Version/s: 1.11.1 > Bump Apache Avro to 1.9.2 > - > >

[jira] [Commented] (PARQUET-1744) Some filters throws ArrayIndexOutOfBoundsException

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011792#comment-17011792 ] Gabor Szadovszky commented on PARQUET-1744: --- Thanks for creating this issue. The problem is

[jira] [Updated] (PARQUET-1744) Some filters throws ArrayIndexOutOfBoundsException

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1744: -- Fix Version/s: 1.11.1 > Some filters throws ArrayIndexOutOfBoundsException >

[jira] [Updated] (PARQUET-1740) Make ParquetFileReader.getFilteredRecordCount public

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1740: -- Fix Version/s: 1.11.1 > Make ParquetFileReader.getFilteredRecordCount public >

[jira] [Resolved] (PARQUET-1740) Make ParquetFileReader.getFilteredRecordCount public

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1740. --- Resolution: Fixed > Make ParquetFileReader.getFilteredRecordCount public >

[jira] [Assigned] (PARQUET-1740) Make ParquetFileReader.getFilteredRecordCount public

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1740: - Assignee: Yuming Wang > Make ParquetFileReader.getFilteredRecordCount public

[jira] [Commented] (PARQUET-1746) Changed the data order after DataFrame reuse

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011872#comment-17011872 ] Gabor Szadovszky commented on PARQUET-1746: --- What exactly is reordered here? If it is a list

[jira] [Commented] (PARQUET-1745) No result for partition key included in Parquet file

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011868#comment-17011868 ] Gabor Szadovszky commented on PARQUET-1745: --- Unfortunately, I don't understand what exactly

[jira] [Assigned] (PARQUET-1744) Some filters throws ArrayIndexOutOfBoundsException

2020-01-09 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1744: - Assignee: Gabor Szadovszky > Some filters throws

[jira] [Commented] (PARQUET-1746) Changed the data order after DataFrame reuse

2020-01-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17015736#comment-17015736 ] Gabor Szadovszky commented on PARQUET-1746: --- For me the issue is reproducible with the

[jira] [Resolved] (PARQUET-1765) Invalid filteredRowCount in InternalParquetRecordReader

2020-01-16 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1765. --- Resolution: Fixed > Invalid filteredRowCount in InternalParquetRecordReader >

[jira] [Created] (PARQUET-1765) Invalid filteredRowCount in InternalParquetRecordReader

2020-01-13 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1765: - Summary: Invalid filteredRowCount in InternalParquetRecordReader Key: PARQUET-1765 URL: https://issues.apache.org/jira/browse/PARQUET-1765 Project: Parquet

[jira] [Commented] (PARQUET-1745) No result for partition key included in Parquet file

2020-01-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014309#comment-17014309 ] Gabor Szadovszky commented on PARQUET-1745: --- The problem here is Spark sets a projection to

[jira] [Created] (PARQUET-1774) Release parquet 1.11.1

2020-01-22 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1774: - Summary: Release parquet 1.11.1 Key: PARQUET-1774 URL: https://issues.apache.org/jira/browse/PARQUET-1774 Project: Parquet Issue Type: Task

[jira] [Resolved] (PARQUET-1745) No result for partition key included in Parquet file

2020-01-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1745. --- Resolution: Not A Bug Closing this issue as "Not a Bug". See my previous comment

[jira] [Resolved] (PARQUET-1746) Changed the data order after DataFrame reuse

2020-01-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1746. --- Resolution: Not A Problem The related Spark test generates 22 parquet files. The

[jira] [Resolved] (PARQUET-1703) Update API compatibility check

2020-01-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1703. --- Resolution: Fixed > Update API compatibility check >

[jira] [Updated] (PARQUET-1739) Make Spark SQL support Column indexes

2020-01-08 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1739: -- Fix Version/s: 1.11.1 > Make Spark SQL support Column indexes >

[jira] [Assigned] (PARQUET-1699) Could not resolve org.apache.yetus:audience-annotations:0.11.0

2020-04-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1699: - Assignee: Priyank Bagrecha > Could not resolve

[jira] [Resolved] (PARQUET-1699) Could not resolve org.apache.yetus:audience-annotations:0.11.0

2020-04-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1699. --- Resolution: Fixed > Could not resolve org.apache.yetus:audience-annotations:0.11.0

[jira] [Commented] (PARQUET-1739) Make Spark SQL support Column indexes

2020-04-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17084075#comment-17084075 ] Gabor Szadovszky commented on PARQUET-1739: --- [~yumwang], Have you succeeded to implement the

[jira] [Resolved] (PARQUET-1832) Travis fails with too long output

2020-04-15 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1832. --- Resolution: Fixed > Travis fails with too long output >

[jira] [Created] (PARQUET-1844) Removed Hadoop transitive dependency on commons-lang

2020-04-17 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1844: - Summary: Removed Hadoop transitive dependency on commons-lang Key: PARQUET-1844 URL: https://issues.apache.org/jira/browse/PARQUET-1844 Project: Parquet

[jira] [Commented] (PARQUET-1830) Vectorized API to support Column Index in Apache Spark

2020-03-27 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17068431#comment-17068431 ] Gabor Szadovszky commented on PARQUET-1830: --- [~FelixKJose], the feature of having a

[jira] [Commented] (PARQUET-1826) Document hadoop configuration options

2020-04-01 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17072587#comment-17072587 ] Gabor Szadovszky commented on PARQUET-1826: --- I was not able to find any proper documentation

[jira] [Created] (PARQUET-1832) Travis fails with too long output

2020-04-01 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1832: - Summary: Travis fails with too long output Key: PARQUET-1832 URL: https://issues.apache.org/jira/browse/PARQUET-1832 Project: Parquet Issue Type:

[jira] [Commented] (PARQUET-1830) Vectorized API to support Column Index in Apache Spark

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17070830#comment-17070830 ] Gabor Szadovszky commented on PARQUET-1830: --- [~FelixKJose], agreed. So this jira is to track

[jira] [Assigned] (PARQUET-1827) UUID type currently not supported by parquet-mr

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1827: - Assignee: Gabor Szadovszky > UUID type currently not supported by parquet-mr

[jira] [Resolved] (PARQUET-1805) Refactor the configuration for bloom filters

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1805. --- Resolution: Fixed > Refactor the configuration for bloom filters >

[jira] [Resolved] (PARQUET-1817) Crypto Properties Factory

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1817. --- Resolution: Fixed > Crypto Properties Factory > - > >

[jira] [Commented] (PARQUET-1830) Vectorized API to support Column Index in Apache Spark

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17070971#comment-17070971 ] Gabor Szadovszky commented on PARQUET-1830: --- [~FelixKJose], you said you would prefer option

[jira] [Commented] (PARQUET-1830) Vectorized API to support Column Index in Apache Spark

2020-03-30 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17070993#comment-17070993 ] Gabor Szadovszky commented on PARQUET-1830: --- Agreed. That's what I wanted to say some

[jira] [Created] (PARQUET-1833) InternalParquetRecordWriter - Too much memory used

2020-04-02 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1833: - Summary: InternalParquetRecordWriter - Too much memory used Key: PARQUET-1833 URL: https://issues.apache.org/jira/browse/PARQUET-1833 Project: Parquet

[jira] [Updated] (PARQUET-1828) Add a SSE2 path for the ByteStreamSplit encoder implementation

2020-03-26 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1828: -- Component/s: parquet-cpp > Add a SSE2 path for the ByteStreamSplit encoder

[jira] [Assigned] (PARQUET-1816) Add intersection API to BloomFilter interface

2020-03-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1816: - Assignee: Walid Gara > Add intersection API to BloomFilter interface >

[jira] [Assigned] (PARQUET-1743) Add equals to BlockSplitBloomFilter

2020-03-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1743: - Assignee: Walid Gara > Add equals to BlockSplitBloomFilter >

[jira] [Assigned] (PARQUET-1826) Document hadoop configuration options

2020-03-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1826: - Assignee: Walid Gara Based on our discussion in the Parquet sync I'm

[jira] [Assigned] (PARQUET-1815) Add union API to BloomFilter interface

2020-03-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1815: - Assignee: Walid Gara > Add union API to BloomFilter interface >

[jira] [Assigned] (PARQUET-1787) Expected distinct numbers is not parsed correctly

2020-03-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1787: - Assignee: Walid Gara > Expected distinct numbers is not parsed correctly >

[jira] [Created] (PARQUET-1826) Document hadoop configuration options

2020-03-25 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1826: - Summary: Document hadoop configuration options Key: PARQUET-1826 URL: https://issues.apache.org/jira/browse/PARQUET-1826 Project: Parquet Issue

[jira] [Resolved] (PARQUET-1844) Removed Hadoop transitive dependency on commons-lang

2020-04-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1844. --- Resolution: Fixed > Removed Hadoop transitive dependency on commons-lang >

[jira] [Updated] (PARQUET-1844) Removed Hadoop transitive dependency on commons-lang

2020-04-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky updated PARQUET-1844: -- Affects Version/s: 1.11.0 > Removed Hadoop transitive dependency on commons-lang >

[jira] [Assigned] (PARQUET-1844) Removed Hadoop transitive dependency on commons-lang

2020-04-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1844: - Assignee: Gabor Szadovszky > Removed Hadoop transitive dependency on

[jira] [Resolved] (PARQUET-1826) Document hadoop configuration options

2020-04-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1826. --- Resolution: Fixed > Document hadoop configuration options >

[jira] [Resolved] (PARQUET-1863) Remove use of add-test-source mojo in parquet-protobuf

2020-05-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1863. --- Resolution: Fixed > Remove use of add-test-source mojo in parquet-protobuf >

[jira] [Assigned] (PARQUET-1863) Remove use of add-test-source mojo in parquet-protobuf

2020-05-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1863: - Assignee: Laurent Goujon > Remove use of add-test-source mojo in

[jira] [Resolved] (PARQUET-1862) A mistake of Parquet Format Thrift definition file's comment

2020-05-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1862. --- Resolution: Fixed > A mistake of Parquet Format Thrift definition file's comment >

[jira] [Assigned] (PARQUET-1862) A mistake of Parquet Format Thrift definition file's comment

2020-05-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1862: - Assignee: Liam Su > A mistake of Parquet Format Thrift definition file's

[jira] [Resolved] (PARQUET-1808) SimpleGroup.toString() uses String += and so has poor performance

2020-05-07 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1808. --- Resolution: Fixed > SimpleGroup.toString() uses String += and so has poor

[jira] [Commented] (PARQUET-1815) Add union API to BloomFilter interface

2020-03-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17061641#comment-17061641 ] Gabor Szadovszky commented on PARQUET-1815: --- If one would like to use bloom filters out of

[jira] [Resolved] (PARQUET-1811) Update download links

2020-03-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1811. --- Resolution: Fixed > Update download links > - > >

[jira] [Commented] (PARQUET-1815) Add union API to BloomFilter interface

2020-03-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17061545#comment-17061545 ] Gabor Szadovszky commented on PARQUET-1815: --- The currently implemented filters in parquet-mr

[jira] [Commented] (PARQUET-41) Add bloom filters to parquet statistics

2020-03-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17061558#comment-17061558 ] Gabor Szadovszky commented on PARQUET-41: - [~junma], the target release for this feature is

[jira] [Commented] (PARQUET-1816) Add intersection API to BloomFilter interface

2020-03-18 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17061581#comment-17061581 ] Gabor Szadovszky commented on PARQUET-1816: --- Please, find my comment at PARQUET-1815. > Add

[jira] [Commented] (PARQUET-1822) Parquet without Hadoop dependencies

2020-09-02 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17189036#comment-17189036 ] Gabor Szadovszky commented on PARQUET-1822: --- [~belugabehr], based on the current community

[jira] [Commented] (PARQUET-1923) parquet-tools 1.11.0: TestSimpleRecordConverter fails with ExceptionInInitializerError on openjdk 15

2020-10-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17213662#comment-17213662 ] Gabor Szadovszky commented on PARQUET-1923: --- [~bayandin], please, double check on master if

[jira] [Assigned] (PARQUET-1923) parquet-tools 1.11.0: TestSimpleRecordConverter fails with ExceptionInInitializerError on openjdk 15

2020-10-14 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1923: - Assignee: Alexander Bayandin > parquet-tools 1.11.0:

[jira] [Resolved] (PARQUET-1920) Fix issue with reading parquet files with too large column chunks

2020-10-12 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1920. --- Resolution: Fixed > Fix issue with reading parquet files with too large column

[jira] [Resolved] (PARQUET-1895) Update jackson-databind

2020-10-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1895. --- Resolution: Fixed > Update jackson-databind > --- > >

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-19 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216534#comment-17216534 ] Gabor Szadovszky commented on PARQUET-1927: --- [~shangxinli], I am not sure I get the problem.

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217525#comment-17217525 ] Gabor Szadovszky commented on PARQUET-1927: --- I get it now. Thanks for explaining. I guess

[jira] [Resolved] (PARQUET-1455) [parquet-protobuf] Handle "unknown" enum values for parquet-protobuf

2020-08-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1455. --- Resolution: Fixed > [parquet-protobuf] Handle "unknown" enum values for

[jira] [Resolved] (PARQUET-1897) Failing individual module build/test

2020-08-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1897. --- Assignee: Gabor Szadovszky Resolution: Duplicate > Failing individual module

[jira] [Resolved] (PARQUET-1774) Release parquet 1.11.1

2020-08-25 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1774. --- Resolution: Fixed Missed to close this one after the release. Now, it is done. >

[jira] [Created] (PARQUET-1902) Invoke mvn clean in Travis

2020-08-24 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1902: - Summary: Invoke mvn clean in Travis Key: PARQUET-1902 URL: https://issues.apache.org/jira/browse/PARQUET-1902 Project: Parquet Issue Type: Bug

[jira] [Resolved] (PARQUET-1896) [Maven] parquet-tools build is broken

2020-08-24 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1896. --- Resolution: Fixed > [Maven] parquet-tools build is broken >

[jira] [Resolved] (PARQUET-1902) Invoke mvn clean in Travis

2020-08-24 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1902. --- Resolution: Duplicate > Invoke mvn clean in Travis > -- >

[jira] [Commented] (PARQUET-1901) Add filter null check for ColumnIndex

2020-08-24 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183312#comment-17183312 ] Gabor Szadovszky commented on PARQUET-1901: --- It is clear we shall handle this case properly.

[jira] [Commented] (PARQUET-1178) Parquet modular encryption

2020-09-28 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203214#comment-17203214 ] Gabor Szadovszky commented on PARQUET-1178: --- I hope, we can do a release candidate next

[jira] [Commented] (PARQUET-1178) Parquet modular encryption

2020-09-28 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203178#comment-17203178 ] Gabor Szadovszky commented on PARQUET-1178: --- [~mike_dias], the Spark community is still

[jira] [Assigned] (PARQUET-1868) Parquet reader options toggle for bloom filter toggles dictionary filtering

2020-06-02 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1868: - Assignee: Ryan Rupp > Parquet reader options toggle for bloom filter toggles

[jira] [Resolved] (PARQUET-1868) Parquet reader options toggle for bloom filter toggles dictionary filtering

2020-06-02 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1868. --- Resolution: Fixed > Parquet reader options toggle for bloom filter toggles

[jira] [Commented] (PARQUET-1864) How to generate a file with UUID as a Logical type

2020-05-20 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112041#comment-17112041 ] Gabor Szadovszky commented on PARQUET-1864: --- UUID is not yet implemented in parquet-mr. See

[jira] [Commented] (PARQUET-1842) Update Jackson Databind version to address CVE

2020-06-02 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17123800#comment-17123800 ] Gabor Szadovszky commented on PARQUET-1842: --- [~pofriel], unfortunately I do not have to much

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17218845#comment-17218845 ] Gabor Szadovszky commented on PARQUET-1927: --- Rechecked the code again and found that 

[jira] [Resolved] (PARQUET-1528) Add JSON support to `parquet-tools head`

2020-10-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky resolved PARQUET-1528. --- Resolution: Fixed > Add JSON support to `parquet-tools head` >

[jira] [Assigned] (PARQUET-1528) Add JSON support to `parquet-tools head`

2020-10-22 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1528: - Assignee: Raphaël Afanyan > Add JSON support to `parquet-tools head` >

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-27 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17221260#comment-17221260 ] Gabor Szadovszky commented on PARQUET-1927: ---

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-26 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220590#comment-17220590 ] Gabor Szadovszky commented on PARQUET-1927: --- [~sha...@uber.com], sorry for keep bothering

[jira] [Created] (PARQUET-1897) Failing individual module build/test

2020-08-12 Thread Gabor Szadovszky (Jira)
Gabor Szadovszky created PARQUET-1897: - Summary: Failing individual module build/test Key: PARQUET-1897 URL: https://issues.apache.org/jira/browse/PARQUET-1897 Project: Parquet Issue

[jira] [Commented] (PARQUET-1676) Remove hive modules

2020-08-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176889#comment-17176889 ] Gabor Szadovszky commented on PARQUET-1676: --- [~fokko], do you want to pick this up for

[jira] [Commented] (PARQUET-1666) Remove Unused Modules

2020-08-13 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176888#comment-17176888 ] Gabor Szadovszky commented on PARQUET-1666: --- I think, we are good to remove the Hive modules

<    1   2   3   4   5   6   7   8   9   >