bug with input file joining optimization in Pig
-----------------------------------------------
Key: PIG-2266
URL: https://issues.apache.org/jira/browse/PIG-2266
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.9.0
Reporter: Joseph Adler
In
src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MRCompiler.java,
the function hasTooManyInputFiles instantiated a LoadFunc instance, then calls
setLocation before calling setUDFContextSignature. This is inconsistent with
the documentation for the LoadFunc interface (see
http://pig.apache.org/docs/r0.9.0/api/org/apache/pig/LoadFunc.html#setUDFContextSignature(java.lang.String)).
(We've written UDFs that assume that setUDFContextSignature is called first.)
I think you can fix this by adding
loader.setUDFContextSignature(ld.getSignature());
Before
loader.setLocation(location, job);
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira