[ 
https://issues.apache.org/jira/browse/ARROW-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rok Mihevc updated ARROW-3401:
------------------------------
    External issue URL: https://github.com/apache/arrow/issues/19729

> [C++] Pluggable statistics collector API for unconvertible CSV values
> ---------------------------------------------------------------------
>
>                 Key: ARROW-3401
>                 URL: https://issues.apache.org/jira/browse/ARROW-3401
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>
> It would be useful to be able to collect statistics (e.g. distinct value 
> counts) about values in a column of a CSV file that cannot be converted to a 
> desired data type. 
> When conversion fails, the converters can call into an abstract API like
> {code}
> statistics_->CannotConvert(token, size);
> {code}
> or something similar



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to