[GitHub] [orc] stiga-huang opened a new pull request #1020: ORC-1098: [C++] Support specifying type ids or column names in cpp tools

2022-01-18 Thread GitBox


stiga-huang opened a new pull request #1020:
URL: https://github.com/apache/orc/pull/1020


   
   
   ### What changes were proposed in this pull request?
   
   This is a follow-up task of #921. Currently we have options for the tools to 
work on specified top-level column fields. However, ACID ORC files usually have 
nested structure. We need the type ids to specify nested columns. As an 
extension, adding support for column names will also be helpful. So we don't 
need to manually convert column names to type ids.
   
   This PR extracts the option parsing codes into ToolsHelper. So similiar cpp 
tools can share the same option set.
   
   ### Why are the changes needed?
   
   It makes the tools more useful in practice.
   
   ### How was this patch tested?
   
   Added unit tests for the new options.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@orc.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (ORC-1098) [C++] Support specifying column names and type ids in tools

2022-01-18 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1098:
---

 Summary: [C++] Support specifying column names and type ids in 
tools
 Key: ORC-1098
 URL: https://issues.apache.org/jira/browse/ORC-1098
 Project: ORC
  Issue Type: Improvement
  Components: C++, tools
Reporter: Quanlong Huang
Assignee: Quanlong Huang


This is a follow-up task of ORC-1012. Currently we have options for the tools 
to work on specified top-level column fields. However, ACID ORC files usually 
have nested structure. We need the type ids to specify nested columns. As an 
extension, adding support for column names will also be helpful. So we don't 
need to manually convert column names to type ids.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ORC-1097) ORC c++ library should provide version of memory managed outside

2022-01-18 Thread agoodboy (Jira)
agoodboy created ORC-1097:
-

 Summary: ORC c++ library should provide version of memory managed 
outside
 Key: ORC-1097
 URL: https://issues.apache.org/jira/browse/ORC-1097
 Project: ORC
  Issue Type: New Feature
  Components: C++
Reporter: agoodboy






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


Re: [VOTE] Release Apache ORC 1.6.13 (RC0)

2022-01-18 Thread Yiqun Zhang
+1

- C++ build and test on Mac
- Java build and test on Mac env with JDK11
- Checksum and signature checked

Cheers,
Yiqun

Dongjoon Hyun  于2022年1月18日周二 12:38写道:

> I'll start with my +1.
>
> RC1 is tested like the following.
>
> - Checksum and signature checked.
> - All C++/Java build and UT passed.
> - Docker test passed like Apache ORC 1.6.12
>   (ubuntu18, ubuntu20, centos7, centos8, debian9, debian10)
> - Apache Spark Integration Test passed.
>   https://github.com/dongjoon-hyun/spark/pull/79 (with Apache Spark
> branch-3.2)
>
> Please note the following.
>
> 1. Like the other ORC 1.6.x and older versions,
> there is a known C++ compiler issue on Debian 11, ORC-1041.
> 2. Unfortunately, we missed the timing for Apache Spark 3.2.1 RC2.
> This might be a part of Apache Spark 3.2.2.
>
> Thanks,
> Dongjoon.
>
> On 2022/01/18 03:03:08 Dongjoon Hyun wrote:
> > Please vote on releasing the following candidate as Apache ORC version
> > 1.6.13.
> >
> > [ ] +1 Release this package as Apache ORC 1.6.13
> > [ ] -1 Do not release this package because ...
> >
> > TAG:
> > https://github.com/apache/orc/releases/tag/release-1.6.13rc0
> >
> > RELEASE FILES:
> > https://dist.apache.org/repos/dist/dev/orc/orc-1.6.13-rc0
> >
> > STAGING REPOSITORY:
> > https://repository.apache.org/content/repositories/orgapacheorc-1054/
> >
> > LIST OF JIRA ISSUES:
> > https://issues.apache.org/jira/projects/ORC/versions/12351163
> > https://github.com/apache/orc/milestone/5?closed=1
> >
> > This vote will be open for 72 hours.
> >
> > Thanks,
> > Dongjoon.
> >
>