[
https://issues.apache.org/jira/browse/KAFKA-9385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
kaikai.hou resolved KAFKA-9385.
-------------------------------
Resolution: Duplicate
> Connect cluster: connector task repeat like a splitbrain cluster problem
> -------------------------------------------------------------------------
>
> Key: KAFKA-9385
> URL: https://issues.apache.org/jira/browse/KAFKA-9385
> Project: Kafka
> Issue Type: Bug
> Components: KafkaConnect
> Reporter: kaikai.hou
> Priority: Major
> Attachments: 12_31_d8c7j_1.jpg
>
>
> I am using Debezium. And find a task repeat
> problem.[Jump|[https://issues.redhat.com/browse/DBZ-1573?jql=key%20in%20watchedIssues()]]
>
> 1. I push the Debezium image to our private image repository.
> 2. Deploy the connect cluster with the following *Deployment Config*:
> {code:java}
> //代码占位符
> apiVersion: apps.openshift.io/v1
> kind: DeploymentConfig
> metadata:
> annotations:
> openshift.io/generated-by: OpenShiftWebConsole
> creationTimestamp: '2019-10-14T07:45:41Z'
> generation: 29
> labels:
> app: debezium-test-cloud
> name: debezium-test-cloud
> namespace: test
> resourceVersion: '168496156'
> selfLink: >-
>
> /apis/apps.openshift.io/v1/namespaces/test/deploymentconfigs/debezium-test-cloud
> uid: 9f4f8f4d-ee56-11e9-a5a1-00163e0e008f
> spec:
> replicas: 2
> selector:
> app: debezium-test-cloud
> deploymentconfig: debezium-test-cloud
> strategy:
> activeDeadlineSeconds: 21600
> resources: {}
> rollingParams:
> intervalSeconds: 1
> maxSurge: 25%
> maxUnavailable: 25%
> timeoutSeconds: 600
> updatePeriodSeconds: 1
> type: Rolling
> template:
> metadata:
> annotations:
> openshift.io/generated-by: OpenShiftWebConsole
> creationTimestamp: null
> labels:
> app: debezium-test-cloud
> deploymentconfig: debezium-test-cloud
> spec:
> containers:
> - env:
> - name: BOOTSTRAP_SERVERS
> value: '192.168.100.228:9092'
> - name: GROUP_ID
> value: test-cloud
> - name: CONFIG_STORAGE_TOPIC
> value: base.test-cloud.config
> - name: OFFSET_STORAGE_TOPIC
> value: base.test-cloud.offset
> - name: STATUS_STORAGE_TOPIC
> value: base.test-cloud.status
> - name: CONNECT_KEY_CONVERTER_SCHEMAS_ENABLE
> value: 'true'
> - name: CONNECT_VALUE_CONVERTER_SCHEMAS_ENABLE
> value: 'true'
> - name: CONNECT_PRODUCER_MAX_REQUEST_SIZE
> value: '20971520'
> - name: CONNECT_DATABASE_HISTORY_KAFKA_RECOVERY_POLL_INTERVAL_MS
> value: '1000'
> - name: HEAP_OPTS
> value: '-XX:+UseContainerSupport -XX:MaxRAMPercentage=75.0'
> image:
> 'registry.cn-hangzhou.aliyuncs.com/eshine/debeziumconnect:1.0.0.Beta2'
> imagePullPolicy: IfNotPresent
> name: debezium-test-cloud
> ports:
> - containerPort: 8083
> protocol: TCP
> - containerPort: 8778
> protocol: TCP
> - containerPort: 9092
> protocol: TCP
> - containerPort: 9779
> protocol: TCP
> resources:
> limits:
> cpu: 400m
> memory: 1Gi
> requests:
> cpu: 200m
> memory: 1Gi
> terminationMessagePath: /dev/termination-log
> terminationMessagePolicy: File
> volumeMounts:
> - mountPath: /kafka/config
> name: debezium-test-cloud-1
> - mountPath: /kafka/data
> name: debezium-test-cloud-2
> - mountPath: /kafka/logs
> name: debezium-test-cloud-3
> dnsPolicy: ClusterFirst
> restartPolicy: Always
> schedulerName: default-scheduler
> securityContext: {}
> terminationGracePeriodSeconds: 30
> volumes:
> - emptyDir: {}
> name: debezium-test-cloud-1
> - emptyDir: {}
> name: debezium-test-cloud-2
> - emptyDir: {}
> name: debezium-test-cloud-3
> test: false
> triggers:
> - type: ConfigChange
> status:
> availableReplicas: 2
> conditions:
> - lastTransitionTime: '2019-11-25T06:44:30Z'
> lastUpdateTime: '2019-11-25T06:44:44Z'
> message: replication controller "debezium-test-cloud-15" successfully
> rolled out
> reason: NewReplicationControllerAvailable
> status: 'True'
> type: Progressing
> - lastTransitionTime: '2019-12-31T10:06:23Z'
> lastUpdateTime: '2019-12-31T10:06:23Z'
> message: Deployment config has minimum availability.
> status: 'True'
> type: Available
> details:
> causes:
> - type: Manual
> message: manual change
> latestVersion: 15
> observedGeneration: 29
> readyReplicas: 2
> replicas: 2
> unavailableReplicas: 0
> updatedReplicas: 2
> {code}
> 3. Connect cluster in openshift: one service with two pods
> 4.
> a). task_connector_1_0 and task_connector_3_0 were running in podA;
> task_connector_2_0 was running in PodB
> b) Then, PodA console follows error log: In attachment
> "12_31_d8c7j_1.jpg"
> !12_31_d8c7j_1.jpg!
> c) Then, Rebalance started;
> d) However, In PodB, all task (task_connector_1_0, task_connector_2_0,
> task_connector_3_0) are running. In PodA, still task_connector_1_0 and
> task_connector_3_0.
> e) So the repeat task appeared.
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)