Chunyang Wen created ORC-97:
-------------------------------

             Summary: Support column name selection in ReaderOptions
                 Key: ORC-97
                 URL: https://issues.apache.org/jira/browse/ORC-97
             Project: Orc
          Issue Type: Improvement
          Components: C++
    Affects Versions: 1.2.0
            Reporter: Chunyang Wen


After orc-92 patch, column id selection is supported. But actually select 
sub-type by name is more useful.

In my project, we use period(.) to separate nested field names.

<s1:struct<s2:struct<int1: int>>>
we choose int1 by s1.s2.int1 which will be passed 
include(std::list<std:string>).

In my implementation: first I build a map for name and column id, and then   
direct the function call to includeTypes. If this is a candidate solution, I 
will provide a patch for review soon.

When a sub-type is selected, all his child types should be selected also, as 
O'Malley pointed out in orc-92.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to