Andrew Lamb created ARROW-12339: ----------------------------------- Summary: [Rust][DataFusion] COUNT DISTINCT does not support for `Boolean` Key: ARROW-12339 URL: https://issues.apache.org/jira/browse/ARROW-12339 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Reporter: Andrew Lamb
If you try to run a `COUNT (DISTINCT ..)` query on a float column you get the following error: thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', datafusion/src/scalar.rs:342:22 Reproducer: {code} echo "foo,1.23" > /tmp/foo.csv ./target/debug/datafusion-cli > CREATE EXTERNAL TABLE t (a varchar, b float) STORED AS CSV LOCATION > '/tmp/foo.csv'; 0 rows in set. Query took 0 seconds. > select count(distinct a) from t; +-------------------+ | COUNT(DISTINCT a) | +-------------------+ | 1 | +-------------------+ 1 rows in set. Query took 0 seconds. > select count(distinct b) from t; thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', datafusion/src/scalar.rs:342:22 note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace ArrowError(ExternalError(Canceled)) {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)