[ 
https://issues.apache.org/jira/browse/PIG-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12754883#action_12754883
 ] 

Ankur commented on PIG-894:
---------------------------

Is empty inputs referring to relation - l ('students.txt')  or f (filter l by 1 
== 2). I am seeing a similar issue where the sampler produces an empty file 
when the number of records in the relation being sorted in too low ( < 4 ). 

> order-by fails when input is empty
> ----------------------------------
>
>                 Key: PIG-894
>                 URL: https://issues.apache.org/jira/browse/PIG-894
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Thejas M Nair
>
> grunt> l = load 'students.txt' ;
> grunt> f = filter l by 1 == 2;
> grunt> o = order f by $0 ;
> grunt> dump o;
> This results in 3 MR jobs . The 2nd (sampling) MR creates empty sample file, 
> and 3rd MR (order-by) fails with following error in Map job -
> java.lang.RuntimeException: java.lang.RuntimeException: Empty samples file
>       at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:104)
>       at 
> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:58)
>       at 
> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:82)
>       at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.(MapTask.java:348)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:193)
>       at 
> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2207)
> Caused by: java.lang.RuntimeException: Empty samples file
>       at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:89)
>       ... 5 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to