[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2018-06-25 Thread Gidon Gershinsky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523308#comment-16523308 ] Gidon Gershinsky commented on PARQUET-1300: --- [~thamha] Sounds good, I'll upda

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2018-06-25 Thread Tham (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523291#comment-16523291 ] Tham commented on PARQUET-1300: --- [~gershinsky] My code is based on 2 pull requests you cr

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2018-06-25 Thread Gidon Gershinsky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523271#comment-16523271 ] Gidon Gershinsky commented on PARQUET-1300: --- The Java version > [C++] Parque

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2018-06-25 Thread Gidon Gershinsky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523267#comment-16523267 ] Gidon Gershinsky commented on PARQUET-1300: --- [~thamha] this is excellent news

How to set default maximum size of bloom filter?

2018-06-25 Thread 俊杰陈
Hi devs I'm now implementing bloom filter feature and need to set a default maximum value for bloom filter size for a block. According to calculation here , I plan to set maximum size to 1/8 of parquet

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2018-06-25 Thread Tham (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523113#comment-16523113 ] Tham commented on PARQUET-1300: --- [~mdeepak] [~gershinsky] I've just discovered this ticke

[jira] [Updated] (PARQUET-1334) [C++] memory_map parameter seems missleading in parquet file opener

2018-06-25 Thread Philipp Hoch (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Hoch updated PARQUET-1334: -- Affects Version/s: (was: 1.9.0) cpp-1.4.0 > [C++] memory_map param

[jira] [Updated] (PARQUET-1333) [C++] Reading of files with dictionary size 0 fails on Windows with bad_alloc

2018-06-25 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn updated PARQUET-1333: - Fix Version/s: cpp-1.5.0 > [C++] Reading of files with dictionary size 0 fails on Windows with

[jira] [Updated] (PARQUET-1333) [C++] Reading of files with dictionary size 0 fails on Windows with bad_alloc

2018-06-25 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn updated PARQUET-1333: - Affects Version/s: (was: 1.9.0) cpp-1.4.0 > [C++] Reading of files w

[jira] [Commented] (PARQUET-1338) PrimitiveType.equals throw NPE

2018-06-25 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16522204#comment-16522204 ] ASF GitHub Bot commented on PARQUET-1338: - wangyum opened a new pull request #4

[jira] [Updated] (PARQUET-1338) PrimitiveType.equals throw NPE

2018-06-25 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1338: Labels: pull-request-available (was: ) > PrimitiveType.equals throw NPE > --

[jira] [Created] (PARQUET-1338) PrimitiveType.equals throw NPE

2018-06-25 Thread Yuming Wang (JIRA)
Yuming Wang created PARQUET-1338: Summary: PrimitiveType.equals throw NPE Key: PARQUET-1338 URL: https://issues.apache.org/jira/browse/PARQUET-1338 Project: Parquet Issue Type: Bug

[jira] [Assigned] (PARQUET-1338) PrimitiveType.equals throw NPE

2018-06-25 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned PARQUET-1338: Assignee: Yuming Wang > PrimitiveType.equals throw NPE > --

[jira] [Updated] (PARQUET-1336) PrimitiveComparator should implements Serializable

2018-06-25 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated PARQUET-1336: - Summary: PrimitiveComparator should implements Serializable (was: BinaryComparator should imp

Re: Estimated row-group size is significantly higher than the written one

2018-06-25 Thread Gabor Szadovszky
Thanks a lot, Ryan. Created the JIRA PARQUET-1337 to track it. On Sat, Jun 23, 2018 at 1:29 AM Ryan Blue wrote: > I think you're right about the cause. The current estimate is what is > buffered in memory, so it includes all of the intermediate data for the > last page before it is finalized and