Pig implements its own input split. It's really a list of underlying input
splits. Take a look at PigSplit.java-
https://github.com/apache/pig/blob/trunk/src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/PigSplit.java


On Tue, Jun 17, 2014 at 2:02 PM, Brian Stempin <[email protected]> wrote:

> This was where I started.  I created a class that extends
> CombineFileInputFormat and uses a LineRecordReader.  I don't know if this
> is a bug, but somewhere under the covers, the protocol gets removed from
> the URI and its assumed that the path is an HDFS path.  This causes an
> exception, of course.
>
> I took a look through the Pig source code to see if Pig uses a similar
> tactic to what I was trying, but my search came up dry.
>
> Brian
>

Reply via email to