Manuel Godbert created TEZ-3461:
-----------------------------------

             Summary: Tez not working in local mode for M/R jobs
                 Key: TEZ-3461
                 URL: https://issues.apache.org/jira/browse/TEZ-3461
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Manuel Godbert


I have map/reduce jobs that work as expected within YARN, and I want to see if 
Tez can help me improving their performance. Alas, I am experiencing issues and 
I want to understand what happens, to see if I can adapt my code or if I can 
suggest Tez enhancements. For this I need to be able to debug jobs from within 
eclipse, with breakpoints in Tez source code etc.

I am working on a linux (ubuntu) platform
I use the latest Tez version I found, i.e. 0.9.0-SNAPSHOT (also tried with 
0.7.0)
I have set up the hortonworks mini dev cluster 
https://github.com/hortonworks/mini-dev-cluster
I am trying to run the basic WordCount2 code found here 
https://hadoop.apache.org/docs/r2.7.2/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html#Example:_WordCount_v2.0
I added the following code to have tez running locally:
    conf.set("mapreduce.framework.name", "yarn-tez");
    conf.setBoolean("tez.local.mode", true);
    conf.set("fs.default.name", "file:///");
    conf.setBoolean("tez.runtime.optimize.local.fetch", true);

And I am getting the following error:

2016-09-27 18:32:34 Running Dag: dag_1474992804027_0003_1
2016-09-27 18:32:34 Running Dag: dag_1474992804027_0003_1
Exception in thread "main" java.lang.NullPointerException
        at 
org.apache.tez.client.LocalClient.getApplicationReport(LocalClient.java:153)
        at 
org.apache.tez.dag.api.client.rpc.DAGClientRPCImpl.getAppReport(DAGClientRPCImpl.java:231)
        at 
org.apache.tez.dag.api.client.rpc.DAGClientRPCImpl.createAMProxyIfNeeded(DAGClientRPCImpl.java:251)
        at 
org.apache.tez.dag.api.client.rpc.DAGClientRPCImpl.getDAGStatus(DAGClientRPCImpl.java:96)
        at 
org.apache.tez.dag.api.client.DAGClientImpl.getDAGStatusViaAM(DAGClientImpl.java:360)
        at 
org.apache.tez.dag.api.client.DAGClientImpl.getDAGStatusInternal(DAGClientImpl.java:220)
        at 
org.apache.tez.dag.api.client.DAGClientImpl.getDAGStatus(DAGClientImpl.java:268)
        at 
org.apache.tez.dag.api.client.MRDAGClient.getDAGStatus(MRDAGClient.java:58)
        at 
org.apache.tez.mapreduce.client.YARNRunner.getJobStatus(YARNRunner.java:710)
        at 
org.apache.tez.mapreduce.client.YARNRunner.submitJob(YARNRunner.java:650)
        at 
org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:240)
        at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1290)
        at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1287)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
        at org.apache.hadoop.mapreduce.Job.submit(Job.java:1287)
        at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1308)
        at WordCount2.main(WordCount2.java:136)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to