[jira] [Updated] (SPARK-40459) recoverDiskStore should not stop by existing recomputed files
[ https://issues.apache.org/jira/browse/SPARK-40459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-40459: -- Parent: SPARK-41515 Issue Type: Sub-task (was: Bug) > recoverDiskStore should not stop by existing recomputed files > - > > Key: SPARK-40459 > URL: https://issues.apache.org/jira/browse/SPARK-40459 > Project: Spark > Issue Type: Sub-task > Components: Kubernetes >Affects Versions: 3.2.3, 3.3.2 >Reporter: Dongjoon Hyun >Assignee: Dongjoon Hyun >Priority: Major > Fix For: 3.3.1, 3.2.3, 3.4.0 > > > {code:java} > org.apache.commons.io.FileExistsException: File element in parameter 'null' > already exists: '...' > at org.apache.commons.io.FileUtils.requireAbsent(FileUtils.java:2587) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2305) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2283) > at > org.apache.spark.storage.DiskStore.moveFileToBlock(DiskStore.scala:150) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.saveToDiskStore(BlockManager.scala:487) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.$anonfun$save$1(BlockManager.scala:407) > at > org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1445) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.save(BlockManager.scala:380) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.save(BlockManager.scala:490) > at > org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.$anonfun$recoverDiskStore$14(KubernetesLocalDiskShuffleExecutorComponents.scala:95) > at > scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36) > at > scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33) > at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:198) > at > org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.recoverDiskStore(KubernetesLocalDiskShuffleExecutorComponents.scala:91) > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-40459) recoverDiskStore should not stop by existing recomputed files
[ https://issues.apache.org/jira/browse/SPARK-40459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40459: Fix Version/s: 3.3.1 (was: 3.3.2) > recoverDiskStore should not stop by existing recomputed files > - > > Key: SPARK-40459 > URL: https://issues.apache.org/jira/browse/SPARK-40459 > Project: Spark > Issue Type: Bug > Components: Kubernetes >Affects Versions: 3.2.3, 3.3.2 >Reporter: Dongjoon Hyun >Assignee: Dongjoon Hyun >Priority: Major > Fix For: 3.4.0, 3.3.1, 3.2.3 > > > {code:java} > org.apache.commons.io.FileExistsException: File element in parameter 'null' > already exists: '...' > at org.apache.commons.io.FileUtils.requireAbsent(FileUtils.java:2587) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2305) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2283) > at > org.apache.spark.storage.DiskStore.moveFileToBlock(DiskStore.scala:150) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.saveToDiskStore(BlockManager.scala:487) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.$anonfun$save$1(BlockManager.scala:407) > at > org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1445) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.save(BlockManager.scala:380) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.save(BlockManager.scala:490) > at > org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.$anonfun$recoverDiskStore$14(KubernetesLocalDiskShuffleExecutorComponents.scala:95) > at > scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36) > at > scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33) > at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:198) > at > org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.recoverDiskStore(KubernetesLocalDiskShuffleExecutorComponents.scala:91) > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-40459) recoverDiskStore should not stop by existing recomputed files
[ https://issues.apache.org/jira/browse/SPARK-40459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-40459: -- Description: {code:java} org.apache.commons.io.FileExistsException: File element in parameter 'null' already exists: '...' at org.apache.commons.io.FileUtils.requireAbsent(FileUtils.java:2587) at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2305) at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2283) at org.apache.spark.storage.DiskStore.moveFileToBlock(DiskStore.scala:150) at org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.saveToDiskStore(BlockManager.scala:487) at org.apache.spark.storage.BlockManager$BlockStoreUpdater.$anonfun$save$1(BlockManager.scala:407) at org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1445) at org.apache.spark.storage.BlockManager$BlockStoreUpdater.save(BlockManager.scala:380) at org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.save(BlockManager.scala:490) at org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.$anonfun$recoverDiskStore$14(KubernetesLocalDiskShuffleExecutorComponents.scala:95) at scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36) at scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33) at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:198) at org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.recoverDiskStore(KubernetesLocalDiskShuffleExecutorComponents.scala:91) {code} was: {code:java} org.apache.commons.io.FileExistsException: File element in parameter 'null' already exists: '/data/spark-1/executor-x/blockmgr-62eea9f7-d58e-40ed-af5b-91da82be8f25/36/shuffle_1_188_0.data' at org.apache.commons.io.FileUtils.requireAbsent(FileUtils.java:2587) at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2305) at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2283) at org.apache.spark.storage.DiskStore.moveFileToBlock(DiskStore.scala:150) at org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.saveToDiskStore(BlockManager.scala:487) at org.apache.spark.storage.BlockManager$BlockStoreUpdater.$anonfun$save$1(BlockManager.scala:407) at org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1445) at org.apache.spark.storage.BlockManager$BlockStoreUpdater.save(BlockManager.scala:380) at org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.save(BlockManager.scala:490) at org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.$anonfun$recoverDiskStore$14(KubernetesLocalDiskShuffleExecutorComponents.scala:95) at scala.collection.IndexedSeqOptimized.foreach(IndexedSeqOptimized.scala:36) at scala.collection.IndexedSeqOptimized.foreach$(IndexedSeqOptimized.scala:33) at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:198) at org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.recoverDiskStore(KubernetesLocalDiskShuffleExecutorComponents.scala:91) {code} > recoverDiskStore should not stop by existing recomputed files > - > > Key: SPARK-40459 > URL: https://issues.apache.org/jira/browse/SPARK-40459 > Project: Spark > Issue Type: Bug > Components: Kubernetes >Affects Versions: 3.2.3, 3.3.2 >Reporter: Dongjoon Hyun >Priority: Major > > {code:java} > org.apache.commons.io.FileExistsException: File element in parameter 'null' > already exists: '...' > at org.apache.commons.io.FileUtils.requireAbsent(FileUtils.java:2587) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2305) > at org.apache.commons.io.FileUtils.moveFile(FileUtils.java:2283) > at > org.apache.spark.storage.DiskStore.moveFileToBlock(DiskStore.scala:150) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.saveToDiskStore(BlockManager.scala:487) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.$anonfun$save$1(BlockManager.scala:407) > at > org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1445) > at > org.apache.spark.storage.BlockManager$BlockStoreUpdater.save(BlockManager.scala:380) > at > org.apache.spark.storage.BlockManager$TempFileBasedBlockStoreUpdater.save(BlockManager.scala:490) > at > org.apache.spark.shuffle.KubernetesLocalDiskShuffleExecutorComponents$.$anonfun$recoverDiskStore$14(KubernetesLocalDiskShu