F Méthot created DRILL-4609:
-------------------------------

             Summary: Select true,true,true from ... does not always output 
true,true,true
                 Key: DRILL-4609
                 URL: https://issues.apache.org/jira/browse/DRILL-4609
             Project: Apache Drill
          Issue Type: Bug
          Components: Client - CLI, Query Planning & Optimization, Storage - 
Writer
    Affects Versions: 1.6.0, 1.5.0
         Environment: Linux Redhat
tested in cluster (hdfs) and embedded mode
            Reporter: F Méthot


Doing a simple "select true, true, true from table" won't output true,true,true 
on all generated rows.

Step to reproduce.
generate a simple CSV files:

for i in {1..1000000}; do echo "Allo"; done > /users/fmethot/test.csv

Open a new fresh drill CLI.

Just to help for validation, switch output to CSV: 

alter session set `store.format`='csv' 

generate a table like this:

create table TEST_OUT as (select true,true,true,true from 
dfs.`/users/fmethot/test.csv')

Check content of /users/fmethot/test.csv
You will find false values in there!


If you generate another table, on the same session, the same way, chances are 
the value will be fine (all true). We can only reproduce this on the first CTAS 
run. 

We came to test this select pattern after we realize our custom boolean UDF (as 
well as the one provided in Drill like "ilike") were not outputting consistent 
deterministic results (same input were implausibly generating random boolean 
output). We hope that fixing this ticket will also fix our issue with boolean 
UDFs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to