[ 
https://issues.apache.org/jira/browse/SAMZA-2664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lakshmi Manasa Gaduputi updated SAMZA-2664:
-------------------------------------------
    Description: 
TO change the grouper factory of the job, checkpoint validation should be 
disabled (using the config = ). This ensures the new checkpoints can be written 
with the new factory and job can use the new checkpoints. However, since the 
factory is part of the checkpoint key, the older checkpoints are never cleaned 
up. This leads to job reading those old checkpoints on every deploy and failing 
checkpoint validation. to get job running, checkpoint validation needs to be 
disabled forever.

Keeping checkpoint validation disabled forever is risky, as that would ignore 
the exceptions caused by serialization errors for checkpoint key.

 

one solution is to clean out the older checkpoints when validation fails due to 
grouper factory.

  was:
TO change the grouper factory of the job, checkpoint validation should be 
disabled (using the config = ). This ensures the new checkpoints can be written 
with the new factory and job can use the new checkpoints. However, since the 
factory is part of the checkpoint key, the older checkpoints are never cleaned 
up. This leads to job reading those old checkpoints on every deploy and failing 
checkpoint validation. to get job running, checkpoint validation needs to be 
disabled forever.

 

one solution is to clean out the older checkpoints when validation fails due to 
grouper factory.


> Checkpoint validation enabled needs to be false forever if grouper factory 
> was changed.
> ---------------------------------------------------------------------------------------
>
>                 Key: SAMZA-2664
>                 URL: https://issues.apache.org/jira/browse/SAMZA-2664
>             Project: Samza
>          Issue Type: Bug
>            Reporter: Lakshmi Manasa Gaduputi
>            Assignee: Lakshmi Manasa Gaduputi
>            Priority: Major
>
> TO change the grouper factory of the job, checkpoint validation should be 
> disabled (using the config = ). This ensures the new checkpoints can be 
> written with the new factory and job can use the new checkpoints. However, 
> since the factory is part of the checkpoint key, the older checkpoints are 
> never cleaned up. This leads to job reading those old checkpoints on every 
> deploy and failing checkpoint validation. to get job running, checkpoint 
> validation needs to be disabled forever.
> Keeping checkpoint validation disabled forever is risky, as that would ignore 
> the exceptions caused by serialization errors for checkpoint key.
>  
> one solution is to clean out the older checkpoints when validation fails due 
> to grouper factory.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to