[ 
https://issues.apache.org/jira/browse/ARROW-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-3401:
--------------------------------
    Fix Version/s:     (was: 0.12.0)
                   0.13.0

> [C++] Pluggable statistics collector API for unconvertible CSV values
> ---------------------------------------------------------------------
>
>                 Key: ARROW-3401
>                 URL: https://issues.apache.org/jira/browse/ARROW-3401
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 0.13.0
>
>
> It would be useful to be able to collect statistics (e.g. distinct value 
> counts) about values in a column of a CSV file that cannot be converted to a 
> desired data type. 
> When conversion fails, the converters can call into an abstract API like
> {code}
> statistics_->CannotConvert(token, size);
> {code}
> or something similar



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to