[
https://issues.apache.org/jira/browse/TEZ-3440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17809302#comment-17809302
]
Ayush Saxena commented on TEZ-3440:
-----------------------------------
Hi [~nroberts], [~jeagles]
We are having discussion around the file in TEZ-4533 added in this ticket for
test, that looks to be a violation of apache by laws. Can you folks share some
pointers, how this file was generated or any other pointers
> Shuffling to memory can get out-of-sync when fetching multiple compressed map
> outputs
> -------------------------------------------------------------------------------------
>
> Key: TEZ-3440
> URL: https://issues.apache.org/jira/browse/TEZ-3440
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Nathan Roberts
> Assignee: Nathan Roberts
> Priority: Major
> Fix For: 0.7.2, 0.9.0, 0.8.5
>
> Attachments: TEZ-3440-v1.patch, TEZ-3440.patch
>
>
> Haven't verified yet but certainly looks like tez needs same fix as
> MAPREDUCE-5308 in IFile.
> Specifically saw this because downstream tasks were reporting enough fetch
> failures that long-running upstream tasks had to be re-run, which makes job
> run for much longer than it needs.
> Usually shows itself as an "Invalid map id" error on a multi-map fetch on
> part 2-n (i.e. never the first one).
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)