[jira] [Commented] (ARROW-10219) [C++] csv::TableReader column names, Read() arguments

Weston Pace (Jira) Wed, 23 Jun 2021 10:41:04 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368379#comment-17368379
 ]


Weston Pace commented on ARROW-10219:
-------------------------------------

It would probably be column_names and not schema.  The table reader can do late 
inference so it may not know the final schema until the final table is read.  
But column_names should be pretty straightforward to add.

> [C++] csv::TableReader column names, Read() arguments
> -----------------------------------------------------
>
>                 Key: ARROW-10219
>                 URL: https://issues.apache.org/jira/browse/ARROW-10219
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Neal Richardson
>            Assignee: Weston Pace
>            Priority: Major
>             Fix For: 5.0.0
>
>
> Some feature requests:
> * csv::TableReader {{column_names}} method, and/or {{schema}} method. This 
> will (in most cases) require IO to get these from the file, but that's fine. 
> There are use cases (we've seen in R) where it would help to be able to get 
> the names from the file (e.g. when you specify column types, it's a map of 
> column name to type, so you can't currently specify types without also 
> specifying names)
> * Add Read(std::vector<int>) like how feather (and parquet?) have so that you 
> don't have to parse and allocate columns you don't want.
> cc [~apitrou] [~romainfrancois]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (ARROW-10219) [C++] csv::TableReader column names, Read() arguments

Reply via email to