Amit Katti created DRILL-1172:
---------------------------------

             Summary: Unable to query Hive tables from Drill on CDH4
                 Key: DRILL-1172
                 URL: https://issues.apache.org/jira/browse/DRILL-1172
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Hive
         Environment: CDH4, CDH5, vanilla hadoop
            Reporter: Amit Katti


I am running Drill on CDH4.
I have created a Hive Storage plugin as follows:
{code}
{
  "type" : "hive",
  "enabled" : true,
  "configProps" : {
    "hive.metastore.uris" : "thrift://10.10.30.156:9083",
    "fs.default.name" : "hdfs://10.10.30.156:8020/",
    "hive.metastore.sasl.enabled" : "false"
  }
}
{code}

I connect to sqlline as follows:
 /opt/drill/apache-drill-1.0.0-m2-incubating-SNAPSHOT/bin/sqlline -u 
"jdbc:drill:schema=hive;zk=10.10.30.156:2181"

When I run a query against a hive table it errors out as follows:
{code}
0: jdbc:drill:schema=hive> select * from `student`;
Query failed: org.apache.drill.exec.rpc.RpcException: Remote failure while 
running query.[error_id: "1a2b3324-66bf-46cd-8353-2c5a7d52c2aa"
endpoint {
  address: "perfnode156.perf.lab"
  user_port: 31010
  control_port: 31011
  data_port: 31012
}
error_type: 0
message: "Failure while setting up Foreman. < AssertionError:[ Internal error: 
Error while applying rule DrillPushProjIntoScan, args 
[rel#5:ProjectRel.NONE.ANY([]).[](child=rel#4:Subset#0.ENUMERABLE.ANY([]).[],rownum=$0,name=$1,age=$2,gpa=$3,studentnum=$4),
 rel#2:EnumerableTableAccessRel.ENUMERABLE.ANY([]).[](table=[hive, student])] ] 
< IllegalArgumentException:[ Wrong FS: 
hdfs://perfnode156.perf.lab:8020/user/hive/warehouse/student, expected: 
file:/// ]"
]
Error: exception while executing query (state=,code=0)
{code}

I have even tried setting the below property in the hive storage plugin but it 
still errors out:
"fs.defaultFS" : "hdfs://10.10.30.156:8020/"

The exception in the drillbit.log is 
{code}
2014-07-22 23:46:16,056 [4454e978-5f18-440e-97c8-1f01cf4475cf:foreman] ERROR 
o.a.drill.exec.work.foreman.Foreman - Error 
07265edc-756e-4aa2-9bef-a244501a072e: Failure while setting up Foreman.
java.lang.IllegalArgumentException: Wrong FS: 
hdfs://perfnode156.perf.lab:8020/user/hive/warehouse/student, expected: file:///
        at org.apache.hadoop.fs.FileSystem.checkPath(FileSystem.java:627) 
~[hadoop-common-2.0.0-cdh4.7.0.jar:na]
        at 
org.apache.hadoop.fs.RawLocalFileSystem.pathToFile(RawLocalFileSystem.java:69) 
~[hadoop-common-2.0.0-cdh4.7.0.jar:na]
        at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:464)
 ~[hadoop-common-2.0.0-cdh4.7.0.jar:na]
        at 
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:380) 
~[hadoop-common-2.0.0-cdh4.7.0.jar:na]
        at org.apache.hadoop.fs.FileSystem.exists(FileSystem.java:1380) 
~[hadoop-common-2.0.0-cdh4.7.0.jar:na]
        at 
org.apache.drill.exec.store.hive.HiveScan.getSplits(HiveScan.java:164) 
~[drill-storage-hive-core-1.0.0-m2-incubating-SNAPSHOT.jar:1.0.0-m2-incubating-SNAPSHOT]
        at org.apache.drill.exec.store.hive.HiveScan.<init>(HiveScan.java:121) 
~[drill-storage-hive-core-1.0.0-m2-incubating-SNAPSHOT.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.store.hive.HiveStoragePlugin.getPhysicalScan(HiveStoragePlugin.java:75)
 
~[drill-storage-hive-core-1.0.0-m2-incubating-SNAPSHOT.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.store.hive.HiveStoragePlugin.getPhysicalScan(HiveStoragePlugin.java:39)
 
~[drill-storage-hive-core-1.0.0-m2-incubating-SNAPSHOT.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.store.AbstractStoragePlugin.getPhysicalScan(AbstractStoragePlugin.java:53)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.planner.logical.DrillTable.getGroupScan(DrillTable.java:54)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.planner.logical.DrillPushProjIntoScan.onMatch(DrillPushProjIntoScan.java:53)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.eigenbase.relopt.volcano.VolcanoRuleCall.onMatch(VolcanoRuleCall.java:221) 
~[optiq-core-0.7-20140710.204128-10.jar:na]
        at 
org.eigenbase.relopt.volcano.VolcanoPlanner.findBestExp(VolcanoPlanner.java:653)
 ~[optiq-core-0.7-20140710.204128-10.jar:na]
        at 
net.hydromatic.optiq.prepare.PlannerImpl.transform(PlannerImpl.java:271) 
~[optiq-core-0.7-20140710.204128-10.jar:na]
        at 
org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.convertToDrel(DefaultSqlHandler.java:136)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.getPlan(DefaultSqlHandler.java:116)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.planner.sql.DrillSqlWorker.getPlan(DrillSqlWorker.java:129)
 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at org.apache.drill.exec.work.foreman.Foreman.runSQL(Foreman.java:400) 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at org.apache.drill.exec.work.foreman.Foreman.run(Foreman.java:216) 
~[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
org.apache.drill.exec.work.WorkManager$RunnableWrapper.run(WorkManager.java:242)
 
[drill-java-exec-1.0.0-m2-incubating-SNAPSHOT-rebuffed.jar:1.0.0-m2-incubating-SNAPSHOT]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_55]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_55]
        at java.lang.Thread.run(Thread.java:744) [na:1.7.0_55]
{code}

In the 
incubator-drill/contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java
 class, the error is occurring because the FileSystem object being created in 
the getSplits() method is pointing to file:///
This FileSystem object is being created from a JobConf object.
The contents of the JobConf object are:
Configuration: core-default.xml, core-site.xml, mapred-default.xml, 
mapred-site.xml, yarn-default.xml, yarn-site.xml, hdfs-default.xml, 
hdfs-site.xml

The fs.default.name is set to file:/// in 
/opt/cloudera/parcels/CDH-4.7.0-1.cdh4.7.0.p0.40/share/doc/hadoop-2.0.0+1604/hadoop-project-dist/hadoop-common/core-default.xml.
However changing that to hdfs://10.10.30.156:8020/ doesn't make any difference.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to