[ 
https://issues.apache.org/jira/browse/SAMZA-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ajo Thomas resolved SAMZA-2749.
-------------------------------
    Resolution: Fixed

> Startpoint bug fix
> ------------------
>
>                 Key: SAMZA-2749
>                 URL: https://issues.apache.org/jira/browse/SAMZA-2749
>             Project: Samza
>          Issue Type: Bug
>            Reporter: Alan Zhang
>            Assignee: Alan Zhang
>            Priority: Major
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Within Samza (the main class to pay attention to is OffsetManager.scala), 
> there is a bug in which a startpoint can be deleted before the startpoint 
> actually gets used for message consumption. If a container gets into this 
> situation, then the result is that the startpoint is ignored and consumption 
> will continue from the previous processed message from before the startpoint 
> was applied.
>  # Load last processed offsets and startpoints
>  # Use startpoints to register starting offsets for consumers
>  # Message processing starts, but messages for only some of the partitions 
> are received
>  # Write checkpoint using last processed offsets
>  ## If a partition did not get messages, then the last processed offset is 
> still the offset from before the standpoint.
>  # Delete startpoints
>  # Container dies (e.g. due to running out of memory)
>  # On restart, load last processed offsets (startpoints have been deleted)
>  ## The partitions that did have messages in the previous deployment will 
> have the correct checkpoint.
>  ## The partitions that did not have messages will have the checkpoint set to 
> the offset from before the startpoint was applied. This is unexpected, and it 
> means that bootstrapping is not happening for this partition.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to