Better error message in pig_*.log
---------------------------------
Key: PIG-2429
URL: https://issues.apache.org/jira/browse/PIG-2429
Project: Pig
Issue Type: Bug
Affects Versions: 0.9.0
Reporter: Anitha Raju
Priority: Minor
Hi,
The below script sometimes fails giving null error message and some times with
correct error message in the pig_*.log
Script
{code}
register 'a.py' using jython as test;
A = load 'test.txt' as (x:chararray);
B = foreach A generate
test.a(x) as y;
C = group B by (y);
store C into 'C';
{code}
where a.py is
{code}
@outputSchema("n:chararray")
def a(n):
n = n.lower();
return n;
{code}
input- test.txt
{code}
a
{code}
I have not put null check in a.py so that the script would error out.
This script when run with 0.9 version, fails printing in the pig_*.log either
null error message or the correct error message
Null Error message
===============================================================================
Pig Stack Trace
---------------
ERROR 2244: Job failed, hadoop does not return any error message
org.apache.pig.backend.executionengine.ExecException: ERROR 2244: Job failed,
hadoop does not return any error message
at
org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:139)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:192)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:164)
at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:81)
at org.apache.pig.Main.run(Main.java:561)
at org.apache.pig.Main.main(Main.java:111)
================================================================================
Correct Error message
===================
Backend error message
---------------------
org.apache.pig.backend.executionengine.ExecException: ERROR 0: Error executing
function
at
org.apache.pig.scripting.jython.JythonFunction.exec(JythonFunction.java:106)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNext(POUserFunc.java:216)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNext(POUserFunc.java:305)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.getNext(PhysicalOperator.java:322)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POForEach.processPlan(POForEach.java:332)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POForEach.getNext(POForEach.java:284)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:290)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POLocalRearrange.getNext(POLocalRearrange.java:256)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.runPipeline(PigGenericMapBase.java:267)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.map(PigGenericMapBase.java:262)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.map(PigGenericMapBase.java:64)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.Child$4.run(Child.java:261)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
at org.apache.hadoop.mapred.Child.main(Child.java:255)
Caused by: Traceback (most recent call last):
File "a.py", line 5, in n
AttributeError: 'NoneType' object has no attribute 'lower'
at org.python.core.PyException.fillInStackTrace(PyException.java:70)
at java.lang.Throwable.<init>(Throwable.java:181)
at java.lang.Exception.<init>(Exception.java:29)
at java.lang.RuntimeException.<init>(RuntimeException.java:32)
at org.python.core.PyException.<init>(PyException.java:46)
at org.python.core.PyException.<init>(PyException.java:43)
at org.python.core.PyException.<init>(PyException.java:61)
at org.python.core.Py.AttributeError(Py.java:145)
at org.python.core.PyObject.noAttributeError(PyObject.java:936)
at org.python.core.PyObject.__getattr__(PyObject.java:931)
at org.python.pycode._pyx3.n$1(a.py:6)
at org.python.pycode._pyx3.call_function(a.py)
at org.python.core.PyTableCode.call(PyTableCode.java:165)
at org.python.core.PyBaseCode.call(PyBaseCode.java:297)
at org.python.core.PyFunction.__call__(PyFunction.java:370)
at org.python.core.PyFunction.__call__(PyFunction.java:360)
at org.python.core.PyFunction.__call__(PyFunction.java:355)
at
org.apache.pig.scripting.jython.JythonFunction.exec(JythonFunction.java:103)
... 18 more
=======================
Putting a couple of sysout in the code, i found it was happening when in
Launcher.getStats
------------------------------
TaskReport[] mapRep = jobClient.getMapTaskReports(MRJobID);
System.out.println("going to get error message");
getErrorMessages(mapRep, "map", errNotDbg, pigContext);
System.out.println("got the error message");
totalHadoopTimeSpent += computeTimeSpent(mapRep);
------------------------------
whenever mapRed becomes null, it gives null error message.
Can it be made better?
Regards,
Anitha
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira