Hi, I am having problems running a very simple cpython udf with Pig 0.12, Python 2.7.3, and Hadoop 1.2.1.
I have the following cpython udf: from pig_util import outputSchema @outputSchema("as:int") def square(num): if num == None: return None return ((num) * (num)) And then in my pig script: a = load '/etc/passwd' using PigStorage(':'); register 'test.py' using org.apache.pig.scripting.streaming.python.PythonScriptEngine as myfuncs; b = foreach a generate myfuncs.square(3); dump b I get the error: java.lang.Exception: org.apache.pig.impl.streaming.StreamingUDFException: LINE : KeyError: 'square' at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:354) Caused by: org.apache.pig.impl.streaming.StreamingUDFException: LINE : KeyError: 'square' I also tried to run it in MapReduce mode but I still get the same error. Any ideas? Thanks, Nezih