Wes McKinney created ARROW-3401:
-----------------------------------

             Summary: [C++] Pluggable statistics collector API for 
unconvertible CSV values
                 Key: ARROW-3401
                 URL: https://issues.apache.org/jira/browse/ARROW-3401
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++
            Reporter: Wes McKinney
             Fix For: 0.12.0


It would be useful to be able to collect statistics (e.g. distinct value 
counts) about values in a column of a CSV file that cannot be converted to a 
desired data type. 

When conversion fails, the converters can call into an abstract API like

{code}
statistics_->CannotConvert(token, size);
{code}

or something similar



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to