Pig implements its own input split. It's really a list of underlying input splits. Take a look at PigSplit.java- https://github.com/apache/pig/blob/trunk/src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/PigSplit.java
On Tue, Jun 17, 2014 at 2:02 PM, Brian Stempin <[email protected]> wrote: > This was where I started. I created a class that extends > CombineFileInputFormat and uses a LineRecordReader. I don't know if this > is a bug, but somewhere under the covers, the protocol gets removed from > the URI and its assumed that the path is an HDFS path. This causes an > exception, of course. > > I took a look through the Pig source code to see if Pig uses a similar > tactic to what I was trying, but my search came up dry. > > Brian >
