stupidusername edited a comment on issue #5157:
URL: https://github.com/apache/pulsar/issues/5157#issuecomment-633390668


   I've been doing some tests using this source connector: 
[sequence_source](https://github.com/Antti-Kaikkonen/PulsarSequenceSource).
   
   I tested different combinations of `--processing-guarantees` and 
`--parallelism` and consumed the published messages.
   
   - Using `EFFECTIVELY_ONCE` and `--parallelism` set to `1` I got one sequence 
of ordered numbers.
   - Using `ATLEAST_ONCE` and `--parallelism` set to `1` I got one sequence of 
ordered numbers.
   - Using `EFFECTIVELY_ONCE` and `--parallelism` set to `4` I got one sequence 
of ordered numbers.
   - Using `ATLEAST_ONCE` and `--parallelism` set to `4` I got what it seems to 
be four sequences of numbers, each one of them starting from 0.
   - Using `EFFECTIVELY_ONCE` and `--parallelism` set to `4` with a modified 
version of the source connector that uses a different `partitionId` for each 
instance of the connector I also got four sequences of numbers.
   
   The tests were conducted on a standalone instance of Pulsar. Deduplication 
was not set.
   
   So It looks like when ussing `EFFECTIVELY_ONCE` duplicated messages are 
beign removed even though deduplication is not set on a namespace level. Is 
this observation correct?
   
   Also, I would like to know if each instance of the source connector has the 
guarantee of writing its messages in order to the topic. If that's true: How is 
it achieved?
   
   Thanks!


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to