[GitHub] [spark] mridulm commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished

GitBox Sun, 09 Oct 2022 11:11:41 -0700


mridulm commented on code in PR #37922:
URL: https://github.com/apache/spark/pull/37922#discussion_r990763769



##########
common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/RemoteBlockPushResolver.java:
##########
@@ -393,6 +393,22 @@ public void applicationRemoved(String appId, boolean 
cleanupLocalDirs) {
     }
   }
 
+  @Override
+  public void removeShuffleMerge(String appId, int shuffleId, int 
shuffleMergeId) {
+    AppShuffleInfo appShuffleInfo = validateAndGetAppShuffleInfo(appId);
+    AppShuffleMergePartitionsInfo partitionsInfo = 
appShuffleInfo.shuffles.remove(shuffleId);

Review Comment:
   I added `deleteMergedFiles` above since `closeAndDeleteOutdatedPartitions` 
looses reference to existing files when finalized [1], as @wankunde has 
observed in his current PR.
   
   Why is this not an issue currently ?
   
   a) We are not deleting files currently when finalized - 
`closeAndDeleteOutdatedPartitions` is mainly trying to cleanup open fd's. It 
deletes files only if they were not finalized (and so no one can read it - they 
are partial).
   b) During application termination, we go and remove the parent directories 
and delete files recursively - so nothing is left once we are done.
   
   
   With this PR, we will need to change behavior to also delete older/existing 
files even if finalized.
   That would mean, when a new shuffle mergeid starts, the older files should 
get deleted.
   I have sketched an option above - from a deletion point of view - we will 
need to modify existing codepaths which are relying on 
`closeAndDeleteOutdatedPartitions` to suitably delete using `deleteMergedFiles`.
   
   Thinking more, probably it can be within `closeAndDeleteOutdatedPartitions` 
itself.
   If `isFinalized` - use logic of `deleteMergedFiles` as detailed, else use 
existing logic ?
   Will require changes to existing code though - we can split that as a follow 
up work @wankunde, so that this PR does not balloon in size.
   
   Thoughts @otterc, @zhouyejoe, @wankunde ?
   
   [1] Trying to keep track of the files, by preserving existing entry as in 
this PR currently, does not work - it will be lost during restart of NM (not in 
DB), will be expensive to maintain in long term in memory and this is available 
on disk anyway - `getReducerIds()` above should suffice to recreate the full 
list of files.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] mridulm commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished

Reply via email to