Wes McKinney created ARROW-3401: ----------------------------------- Summary: [C++] Pluggable statistics collector API for unconvertible CSV values Key: ARROW-3401 URL: https://issues.apache.org/jira/browse/ARROW-3401 Project: Apache Arrow Issue Type: New Feature Components: C++ Reporter: Wes McKinney Fix For: 0.12.0
It would be useful to be able to collect statistics (e.g. distinct value counts) about values in a column of a CSV file that cannot be converted to a desired data type. When conversion fails, the converters can call into an abstract API like {code} statistics_->CannotConvert(token, size); {code} or something similar -- This message was sent by Atlassian JIRA (v7.6.3#76005)