[jira] [Commented] (HADOOP-19091) Add support for Tez to MagicS3GuardCommitter

Steve Loughran (Jira) Fri, 01 Mar 2024 05:14:06 -0800


    [ 
https://issues.apache.org/jira/browse/HADOOP-19091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17822555#comment-17822555
 ]


Steve Loughran commented on HADOOP-19091:
-----------------------------------------

1. can you find out why a different job id is being passed down. What is the 
hive jira where they made that decision? because the spark design "just use the 
current time" was simply a choice of the easiest unique value -which ended up 
breaking when two tasks were launched in the same second.
2. magic committer will pick up the job id as passed in through the config. 

bq. The post-commitJob cleanup does delete the files since it deletes 
everything under the __magic directory instead of looking under the job dir,

not on trunk, it now supports parallel jobs with their own __magic-$jobID 
paths. so no cleanup

[~vnarayanan7] stick the patch up as a Github pr. do run the hadoop-aws 
integration tests and tell us which endpoint/region you tested against. 

> Add support for Tez to MagicS3GuardCommitter
> --------------------------------------------
>
>                 Key: HADOOP-19091
>                 URL: https://issues.apache.org/jira/browse/HADOOP-19091
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs/s3
>    Affects Versions: 3.3.6
>         Environment: Pig 17/Hive 3.1.3 with Hadoop 3.3.3 on AWS EMR 6-12.0
>            Reporter: Venkatasubrahmanian Narayanan
>            Assignee: Venkatasubrahmanian Narayanan
>            Priority: Major
>         Attachments: 0001-AWS-Hive-Changes.patch, 
> 0002-HIVE-27698-Backport-of-HIVE-22398-Remove-legacy-code.patch, 
> HADOOP-19091-HIVE-WIP.patch
>
>
> The MagicS3GuardCommitter assumes that the JobID of the task is the same as 
> that of the job's application master when writing/reading the .pendingset 
> file. This assumption is not valid when running with Tez, which creates 
> slightly different JobIDs for tasks and the application master.
>  
> While the MagicS3GuardCommitter is intended only for MRv2, it mostly works 
> fine with an MRv1 wrapper with Hive/Pig (with some minor changes to Hive) run 
> in MR mode. This issue only crops up when running queries with the Tez 
> execution engine. I can upload a patch to Hive 3.1 to reproduce this error on 
> EMR if needed.
>  
> Fixing this will probably require work from both Tez and Hadoop, wanted to 
> start a discussion here so we can figure out how exactly we go about this.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

[jira] [Commented] (HADOOP-19091) Add support for Tez to MagicS3GuardCommitter

Reply via email to