[
https://issues.apache.org/jira/browse/ARROW-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368379#comment-17368379
]
Weston Pace commented on ARROW-10219:
-------------------------------------
It would probably be column_names and not schema. The table reader can do late
inference so it may not know the final schema until the final table is read.
But column_names should be pretty straightforward to add.
> [C++] csv::TableReader column names, Read() arguments
> -----------------------------------------------------
>
> Key: ARROW-10219
> URL: https://issues.apache.org/jira/browse/ARROW-10219
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Neal Richardson
> Assignee: Weston Pace
> Priority: Major
> Fix For: 5.0.0
>
>
> Some feature requests:
> * csv::TableReader {{column_names}} method, and/or {{schema}} method. This
> will (in most cases) require IO to get these from the file, but that's fine.
> There are use cases (we've seen in R) where it would help to be able to get
> the names from the file (e.g. when you specify column types, it's a map of
> column name to type, so you can't currently specify types without also
> specifying names)
> * Add Read(std::vector<int>) like how feather (and parquet?) have so that you
> don't have to parse and allocate columns you don't want.
> cc [~apitrou] [~romainfrancois]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)