[ https://issues.apache.org/jira/browse/BEAM-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17547993#comment-17547993 ]
Danny McCormick commented on BEAM-7658: --------------------------------------- This issue has been migrated to https://github.com/apache/beam/issues/19649 > Synthetic unbounded source looses (duplicates?) data while splitting > -------------------------------------------------------------------- > > Key: BEAM-7658 > URL: https://issues.apache.org/jira/browse/BEAM-7658 > Project: Beam > Issue Type: Bug > Components: testing > Reporter: Lukasz Gajowy > Priority: P3 > > This came out while creating KafkaIOIT ingesting data generated using > SyntheticUnboundedSource. Hashcode of data created by > {code:java} > .apply("Calculate hashcode", Combine.globally(new > HashingFn()).withoutDefaults()){code} > was different for 1000 records every time. When a number of splits was set to > 1 the problem disappeared (sourceOptions.forceNumInitialBundles). > -- This message was sent by Atlassian Jira (v8.20.7#820007)