[GitHub] [orc] stiga-huang opened a new pull request #1020: ORC-1098: [C++] Support specifying type ids or column names in cpp tools
stiga-huang opened a new pull request #1020: URL: https://github.com/apache/orc/pull/1020 ### What changes were proposed in this pull request? This is a follow-up task of #921. Currently we have options for the tools to work on specified top-level column fields. However, ACID ORC files usually have nested structure. We need the type ids to specify nested columns. As an extension, adding support for column names will also be helpful. So we don't need to manually convert column names to type ids. This PR extracts the option parsing codes into ToolsHelper. So similiar cpp tools can share the same option set. ### Why are the changes needed? It makes the tools more useful in practice. ### How was this patch tested? Added unit tests for the new options. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@orc.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Created] (ORC-1098) [C++] Support specifying column names and type ids in tools
Quanlong Huang created ORC-1098: --- Summary: [C++] Support specifying column names and type ids in tools Key: ORC-1098 URL: https://issues.apache.org/jira/browse/ORC-1098 Project: ORC Issue Type: Improvement Components: C++, tools Reporter: Quanlong Huang Assignee: Quanlong Huang This is a follow-up task of ORC-1012. Currently we have options for the tools to work on specified top-level column fields. However, ACID ORC files usually have nested structure. We need the type ids to specify nested columns. As an extension, adding support for column names will also be helpful. So we don't need to manually convert column names to type ids. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ORC-1097) ORC c++ library should provide version of memory managed outside
agoodboy created ORC-1097: - Summary: ORC c++ library should provide version of memory managed outside Key: ORC-1097 URL: https://issues.apache.org/jira/browse/ORC-1097 Project: ORC Issue Type: New Feature Components: C++ Reporter: agoodboy -- This message was sent by Atlassian Jira (v8.20.1#820001)
Re: [VOTE] Release Apache ORC 1.6.13 (RC0)
+1 - C++ build and test on Mac - Java build and test on Mac env with JDK11 - Checksum and signature checked Cheers, Yiqun Dongjoon Hyun 于2022年1月18日周二 12:38写道: > I'll start with my +1. > > RC1 is tested like the following. > > - Checksum and signature checked. > - All C++/Java build and UT passed. > - Docker test passed like Apache ORC 1.6.12 > (ubuntu18, ubuntu20, centos7, centos8, debian9, debian10) > - Apache Spark Integration Test passed. > https://github.com/dongjoon-hyun/spark/pull/79 (with Apache Spark > branch-3.2) > > Please note the following. > > 1. Like the other ORC 1.6.x and older versions, > there is a known C++ compiler issue on Debian 11, ORC-1041. > 2. Unfortunately, we missed the timing for Apache Spark 3.2.1 RC2. > This might be a part of Apache Spark 3.2.2. > > Thanks, > Dongjoon. > > On 2022/01/18 03:03:08 Dongjoon Hyun wrote: > > Please vote on releasing the following candidate as Apache ORC version > > 1.6.13. > > > > [ ] +1 Release this package as Apache ORC 1.6.13 > > [ ] -1 Do not release this package because ... > > > > TAG: > > https://github.com/apache/orc/releases/tag/release-1.6.13rc0 > > > > RELEASE FILES: > > https://dist.apache.org/repos/dist/dev/orc/orc-1.6.13-rc0 > > > > STAGING REPOSITORY: > > https://repository.apache.org/content/repositories/orgapacheorc-1054/ > > > > LIST OF JIRA ISSUES: > > https://issues.apache.org/jira/projects/ORC/versions/12351163 > > https://github.com/apache/orc/milestone/5?closed=1 > > > > This vote will be open for 72 hours. > > > > Thanks, > > Dongjoon. > > >