[
https://issues.apache.org/jira/browse/PIG-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rohini Palaniswamy updated PIG-4373:
------------------------------------
Description: duplicate jars get added to distributed cache
[~daijy],
The patch fixes OOZIE-3300, but not the original issue of this jira. We can
move this patch to a different jira if the intent is to just fix it for Hadoop
3. Still there is one issue with the patch. It makes every resource type as
APPLICATION instead of PUBLIC or PRIVATE which will impact cluster performance.
[~jlowe] already asked us to fix that in TezResourceManager for the other
resources we ship as he saw lot of churn in our clusters. Making it for all the
files from Oozie as well, will make it worse.
Jason was fine with rolling back the change in Hadoop and marked MAPREDUCE-7118
a Blocker for Hadoop 3 releases. Just needs some other Hadoop PMC to chime in
and +1. Does not make sense to introduce an unwanted backward incompatibility
for Mapreduce which is slowly marching towards end of life. So we can postpone
it on the pig side (and do the proper fix) and have your hadoop team pull
MAPREDUCE-7118 instead.
> Implement PIG-3861 in Tez
> -------------------------
>
> Key: PIG-4373
> URL: https://issues.apache.org/jira/browse/PIG-4373
> Project: Pig
> Issue Type: Improvement
> Components: tez
> Affects Versions: 0.14.0
> Reporter: Rohini Palaniswamy
> Assignee: Daniel Dai
> Priority: Major
> Labels: MissingFeature
> Fix For: 0.18.0
>
> Attachments: PIG-4373_1.patch
>
>
> duplicate jars get added to distributed cache
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)