[
https://issues.apache.org/jira/browse/APEXCORE-590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Pradeep A. Dalvi updated APEXCORE-590:
--------------------------------------
Description:
For restarting application, we try to copy previous app state i.e. checkpoints
directory from original app. However checkpoints are not being copied due to
incorrect check of source and destination directory path.
16/12/16 13:28:32 ERROR fs.MapRFileSystem: Failed to delete path
maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0006/checkpoints,
error: No such file or directory (2)
16/12/16 13:28:32 INFO stram.FSRecoveryHandler: Creating
maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0006/recovery/log
16/12/16 13:28:32 INFO stram.StramClient: Ignoring
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/events
as it already exists under
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/events
16/12/16 13:28:32 INFO stram.StramClient: Ignoring
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/recovery
as it already exists under
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/recovery
16/12/16 13:28:32 INFO stram.StramClient: Ignoring
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/checkpoints
as it already exists under
maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/checkpoints
16/12/16 13:28:32 INFO stram.StramClient: Set the environment for the
application master
was:
For restarting application, we try to copy previous app state i.e. checkpoints
directory from original app. Before doing so, we try to delete 'checkpoints'
directory from newly launched application directory. MapR-FS throws an
exception since that directory is not present yet, hence we fail to copy
checkpoints from original app.
Need to catch exception around fs.delete(checkpointPath, true); in
copyInitialState function of StramClient.
16/12/16 12:48:03 INFO stram.StramClient: Restart from
maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0004
16/12/16 12:48:03 INFO Configuration.deprecation: io.bytes.per.checksum is
deprecated. Instead, use dfs.bytes-per-checksum
16/12/16 12:48:03 ERROR fs.MapRFileSystem: Failed to delete path
maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0005/checkpoints,
error: No such file or directory (2)
16/12/16 12:48:03 INFO stram.FSRecoveryHandler: Creating
maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0005/recovery/lo
> Failed to restart application on MapR
> -------------------------------------
>
> Key: APEXCORE-590
> URL: https://issues.apache.org/jira/browse/APEXCORE-590
> Project: Apache Apex Core
> Issue Type: Bug
> Reporter: Pradeep A. Dalvi
> Assignee: Pradeep A. Dalvi
>
> For restarting application, we try to copy previous app state i.e.
> checkpoints directory from original app. However checkpoints are not being
> copied due to incorrect check of source and destination directory path.
> 16/12/16 13:28:32 ERROR fs.MapRFileSystem: Failed to delete path
> maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0006/checkpoints,
> error: No such file or directory (2)
> 16/12/16 13:28:32 INFO stram.FSRecoveryHandler: Creating
> maprfs:/user/dtadmin/datatorrent/apps/application_1481890072066_0006/recovery/log
> 16/12/16 13:28:32 INFO stram.StramClient: Ignoring
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/events
> as it already exists under
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/events
> 16/12/16 13:28:32 INFO stram.StramClient: Ignoring
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/recovery
> as it already exists under
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/recovery
> 16/12/16 13:28:32 INFO stram.StramClient: Ignoring
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/checkpoints
> as it already exists under
> maprfs:///user/dtadmin/datatorrent/apps/application_1481890072066_0004/checkpoints
> 16/12/16 13:28:32 INFO stram.StramClient: Set the environment for the
> application master
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)