[ 
https://issues.apache.org/jira/browse/PIG-2266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13098408#comment-13098408
 ] 

Joseph Adler commented on PIG-2266:
-----------------------------------

Index: MRCompiler.java
===================================================================
--- MRCompiler.java     (revision 1165764)
+++ MRCompiler.java     (working copy)
@@ -1353,7 +1353,8 @@
                             .instantiateFuncFromSpec(ld.getLFile()
                                     .getFuncSpec());
                             Job job = new Job(conf);
-                            loader.setLocation(location, job);
+                            loader.setUDFContextSignature(ld.getSignature()); 
+                           loader.setLocation(location, job);
                             InputFormat inf = loader.getInputFormat();
                             List<InputSplit> splits = 
inf.getSplits(HadoopShims.cloneJobContext(job));
                             List<List<InputSplit>> results = MapRedUtil


> bug with input file joining optimization in Pig
> -----------------------------------------------
>
>                 Key: PIG-2266
>                 URL: https://issues.apache.org/jira/browse/PIG-2266
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.9.0
>            Reporter: Joseph Adler
>
> In 
> src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MRCompiler.java,
>  the function hasTooManyInputFiles instantiated a LoadFunc instance, then 
> calls setLocation before calling setUDFContextSignature. This is inconsistent 
> with the documentation for the LoadFunc interface (see 
> http://pig.apache.org/docs/r0.9.0/api/org/apache/pig/LoadFunc.html#setUDFContextSignature(java.lang.String)).
>  (We've written UDFs that assume that setUDFContextSignature is called first.)
> I think you can fix this by adding 
>    loader.setUDFContextSignature(ld.getSignature());
> Before
>    loader.setLocation(location, job);

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to