Ramana Inukonda Nagaraj created DRILL-1131:
----------------------------------------------

             Summary: Drill should ignore files in starting with . _
                 Key: DRILL-1131
                 URL: https://issues.apache.org/jira/browse/DRILL-1131
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
            Reporter: Ramana Inukonda Nagaraj


Files containing . and _ as the first characters are ignored by hive and others 
are these are typically logs and status files written out by tools like 
mapreduce. Drill should not read them when querying a directory containing a 
list of parquet files.

Currently it fails with the error:
message: "Failure while setting up Foreman. < AssertionError:[ Internal error: 
Error while applying rule DrillPushProjIntoScan, args 
[rel#78:ProjectRel.NONE.ANY([]).[](child=rel#15:Subset#1.ENUMERABLE.ANY([]).[],p_partkey=$1,p_type=$2),
 rel#8:EnumerableTableAccessRel.ENUMERABLE.ANY([]).[](table=[dfs, 
drillTestDirDencTpchSF100, part])] ] < DrillRuntimeException:[ 
java.io.IOException: Could not read footer: java.io.IOException: Could not read 
footer for file com.mapr.fs.MapRFileStatus@99c9d45e ] < IOException:[ Could not 
read footer: java.io.IOException: Could not read footer for file 
com.mapr.fs.MapRFileStatus@99c9d45e ] < IOException:[ Could not read footer for 
file com.mapr.fs.MapRFileStatus@99c9d45e ] < IOException:[ Open failed for 
file: /drill/testdata/dencSF100/part/.impala_insert_staging, error: Invalid 
argument (22) ]"




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to