[Announce] new Parquet committer Constantin Muraru

2018-05-21 Thread Julien Le Dem
We are happy to announce that Constantin has accepted to become a Parquet committer. Welcome Constantin!

Re: Parquet Data Help

2018-05-21 Thread Julien Le Dem
This sounds like a hive question rather than a parquet question. Did you try posting on the hive mailing list? On Mon, May 21, 2018 at 12:59 AM, Shubham gurav wrote: > Hey Dev, > > Currently using Hive 0.13 and our database is in parquet format. When i > extract the data the output contains unic

[jira] [Commented] (PARQUET-1295) Parquet libraries do not follow proper semantic versioning

2018-05-21 Thread Vlad Rozov (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16483216#comment-16483216 ] Vlad Rozov commented on PARQUET-1295: - Parquet libraries do not follow proper semant

[jira] [Updated] (PARQUET-1306) [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems

2018-05-21 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1306: -- Description: I would like to see if it's possible to modularize the build system in Apache Ar

[jira] [Created] (PARQUET-1306) [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems

2018-05-21 Thread Wes McKinney (JIRA)
Wes McKinney created PARQUET-1306: - Summary: [C++] Improve code reuse and reduce redundancy between Arrow and Parquet C++ build systems Key: PARQUET-1306 URL: https://issues.apache.org/jira/browse/PARQUET-1306

Re: More breaking changes in the Java API and how to deal with them

2018-05-21 Thread Ryan Blue
This is why I think we should define a parquet-api module and actually make a proper API. Until we do that, we'll still have confusion over what is public or not. On Fri, May 18, 2018 at 9:25 AM, Zoltan Ivanfi wrote: > Hi, > > In recent weeks several breaking changes have been discovered in mino

[jira] [Commented] (PARQUET-1295) Parquet libraries do not follow proper semantic versioning

2018-05-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16483153#comment-16483153 ] Ryan Blue commented on PARQUET-1295: Since there is not a well-defined public API, I

[jira] [Commented] (PARQUET-951) Missing field id support in parquet metadata

2018-05-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16482831#comment-16482831 ] ASF GitHub Bot commented on PARQUET-951: qinghui-xu commented on issue #410: [PAR

Parquet Data Help

2018-05-21 Thread Shubham gurav
Hey Dev, Currently using Hive 0.13 and our database is in parquet format. When i extract the data the output contains unicode characters like thorn delimiters - รพ or replacement characters (Unicode characters). So do we have to migrate to the latest version or Hive 0.13.1 supports parquet data.