[
https://issues.apache.org/jira/browse/PIG-3179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Koji Noguchi updated PIG-3179:
------------------------------
Attachment: pig-3179-v02.patch
Changed based on Rohini's suggestion.
Added extra line printing out the number of input splits.
{noformat}
PigSplit contains 11 wrappedSplits.
Input-split: file=hdfs://abc.def.com:8020/tmp/hij/part-r-00032.bz2
start-offset=0 length=11814548
Input-split: file=hdfs://abc.def.com:8020/tmp/hij/part-r-00033.bz2
start-offset=0 length=11953088
Input-split: file=hdfs://abc.def.com:8020/tmp/hij/part-r-00034.bz2
start-offset=0 length=12122182
Input-split: file=hdfs://abc.def...
...
{noformat}
> Task Information Header only prints out the first split for each task
> ---------------------------------------------------------------------
>
> Key: PIG-3179
> URL: https://issues.apache.org/jira/browse/PIG-3179
> Project: Pig
> Issue Type: Improvement
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Priority: Trivial
> Attachments: pig-3179-v01.patch, pig-3179-v02.patch
>
>
> When a task's PigSplit is containing more than wrappedSplit, it only logs the
> first fileinfo.
> When debugging, I saw
> {noformat}
> ===== Task Information Header =====
> Command: bash ....
> Start time: Mon Feb 11 16:41:21 UTC 2013
> Input-split file: hdfs://abc.bcd.efg:8020/tmp/hij/part-r-00000.bz2
> Input-split start-offset: 0Input-split length: 11854247
> {noformat}
> but the actual error was happing while reading part-r-00007.bz2. It would
> have been nice if the log showed all the info that task was going to read.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira