[jira] [Created] (PARQUET-1595) Parquet proto writer de-nest Protobuf wrapper classes

2019-06-11 Thread Ying Xu (JIRA)
Ying Xu created PARQUET-1595: Summary: Parquet proto writer de-nest Protobuf wrapper classes Key: PARQUET-1595 URL: https://issues.apache.org/jira/browse/PARQUET-1595 Project: Parquet Issue Type:

[jira] [Created] (PARQUET-1594) Parquet File is not able to read from Spark and Hive

2019-06-11 Thread Prashanth pampanna desai (JIRA)
Prashanth pampanna desai created PARQUET-1594: - Summary: Parquet File is not able to read from Spark and Hive Key: PARQUET-1594 URL: https://issues.apache.org/jira/browse/PARQUET-1594 Proje

[jira] [Updated] (PARQUET-1580) Page-level CRC checksum verification for DataPageV1

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1580: Labels: pull-request-available (was: ) > Page-level CRC checksum verification for DataPa

[jira] [Commented] (PARQUET-1580) Page-level CRC checksum verification for DataPageV1

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861280#comment-16861280 ] ASF GitHub Bot commented on PARQUET-1580: - bbraams commented on pull request #6

[jira] [Updated] (PARQUET-1580) Page-level CRC checksum verification for DataPageV1

2019-06-11 Thread Boudewijn Braams (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boudewijn Braams updated PARQUET-1580: -- Summary: Page-level CRC checksum verification for DataPageV1 (was: Page-level checks

[jira] [Updated] (PARQUET-1580) Page-level checksum verification for DataPageV1

2019-06-11 Thread Boudewijn Braams (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boudewijn Braams updated PARQUET-1580: -- Summary: Page-level checksum verification for DataPageV1 (was: Implement page-level

[jira] [Updated] (PARQUET-1593) Replace the example usage in parquet-cli's help message with an actually existent subcommand

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1593: Labels: pull-request-available (was: ) > Replace the example usage in parquet-cli's help

[jira] [Commented] (PARQUET-1593) Replace the example usage in parquet-cli's help message with an actually existent subcommand

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861160#comment-16861160 ] ASF GitHub Bot commented on PARQUET-1593: - sekikn commented on pull request #64

[jira] [Created] (PARQUET-1593) Replace the example usage in parquet-cli's help message with an actually existent subcommand

2019-06-11 Thread Kengo Seki (JIRA)
Kengo Seki created PARQUET-1593: --- Summary: Replace the example usage in parquet-cli's help message with an actually existent subcommand Key: PARQUET-1593 URL: https://issues.apache.org/jira/browse/PARQUET-1593

[jira] [Commented] (PARQUET-1589) Bump Java to 8

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861053#comment-16861053 ] ASF GitHub Bot commented on PARQUET-1589: - zivanfi commented on pull request #1

[jira] [Updated] (PARQUET-1590) [parquet-format] Add Java 11 to Travis

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1590: --- Summary: [parquet-format] Add Java 11 to Travis (was: Build against Java 11) > [parquet-f

[jira] [Reopened] (PARQUET-1590) Build against Java 11

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi reopened PARQUET-1590: > Build against Java 11 > - > > Key: PARQUET-1590 >

[jira] [Updated] (PARQUET-1499) [parquet-mr] Add Java 11 to Travis

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1499: --- Summary: [parquet-mr] Add Java 11 to Travis (was: Add Java 11 build to the repository) >

[jira] [Updated] (PARQUET-1592) update hash naming of bloom filter

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-1592: Labels: pull-request-available (was: ) > update hash naming of bloom filter > --

[jira] [Commented] (PARQUET-1592) update hash naming of bloom filter

2019-06-11 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16860996#comment-16860996 ] ASF GitHub Bot commented on PARQUET-1592: - chenjunjiedada commented on pull req

bloomfilter and tokenisation

2019-06-11 Thread Manik Singla
Hey Team I have started using parquet recently. Kind of data I save is something like *raw hostname cluster serviceName * where raw is actual log lines. For raw, dictionary doesn't work as we no 2 log lines are same. But if we tokenise terms in dictionary, then dictionary can help here to f

Re: [vote] Merge bloom-filter branch to master

2019-06-11 Thread 俊杰陈
Thanks Zoltan I planed to update naming issue when we have another update. Let me open a jira to do this. For the hash choice from Todd, both of xxh3 and murmur3 can be coexist at same time, so I planed to add xxh3 later since it needs some effort to implement and benchmark. On Tue, Jun 11, 20

[jira] [Resolved] (PARQUET-1588) Bump Apache Thrift to 0.12.0

2019-06-11 Thread Fokko Driesprong (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fokko Driesprong resolved PARQUET-1588. --- Resolution: Duplicate > Bump Apache Thrift to 0.12.0 >

[jira] [Resolved] (PARQUET-1590) Build against Java 11

2019-06-11 Thread Fokko Driesprong (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fokko Driesprong resolved PARQUET-1590. --- Resolution: Duplicate > Build against Java 11 > - > >

[jira] [Created] (PARQUET-1592) update hash naming of bloom filter

2019-06-11 Thread Junjie Chen (JIRA)
Junjie Chen created PARQUET-1592: Summary: update hash naming of bloom filter Key: PARQUET-1592 URL: https://issues.apache.org/jira/browse/PARQUET-1592 Project: Parquet Issue Type: Sub-task

Re: Add support for Java 11

2019-06-11 Thread Zoltan Ivanfi
Sound great, thanks! Zoltan On Tue, Jun 11, 2019 at 1:41 PM Driesprong, Fokko wrote: > > I've missed that one. Thanks Zoltan. > > It is quite interesting. For example, we need to update the parquet-format > first, in order to update the Scala version of parquet-mr >

Re: Add support for Java 11

2019-06-11 Thread Driesprong, Fokko
I've missed that one. Thanks Zoltan. It is quite interesting. For example, we need to update the parquet-format first, in order to update the Scala version of parquet-mr . Would be great to fix all the remaining issues that are blocking for parqu

Re: Add support for Java 11

2019-06-11 Thread Zoltan Ivanfi
Hi Fokko, Have you seen https://issues.apache.org/jira/browse/PARQUET-1551 and its children? There are some more blocking issues mentioned there. Br, Zoltan On Mon, Jun 10, 2019 at 9:19 PM Driesprong, Fokko wrote: > > Hi all, > > I'm working towards making Parquet compatible with Java 11. Woul

Re: [vote] Merge bloom-filter branch to master

2019-06-11 Thread Zoltan Ivanfi
Hi, It has been merged into master but has not been released yet. In fact, I asked for a minor change before releasing it: https://github.com/apache/parquet-format/commit/54839ad5e04314c944fed8aa4bc6cf15e4a58698#r31084264 It may seem like a nit, but I think the naming of the parquet structures is