[jira] [Updated] (KAFKA-13454) kafka has duplicate configuration information log information printing during startup
[ https://issues.apache.org/jira/browse/KAFKA-13454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangzhanchang updated KAFKA-13454: --- Description: kafka has duplicate configuration information log information printing during startup,repeated information printing will bring confusion to users. [2021-11-15 12:07:29,133] INFO KafkaConfig values: advertised.host.name = null advertised.listeners = null advertised.port = null alter.config.policy.class.name = null alter.log.dirs.replication.quota.window.num = 11 alter.log.dirs.replication.quota.window.size.seconds = 1 authorizer.class.name = ... [2021-11-15 12:07:29,161] INFO KafkaConfig values: advertised.host.name = null advertised.listeners = null advertised.port = null alter.config.policy.class.name = null alter.log.dirs.replication.quota.window.num = 11 alter.log.dirs.replication.quota.window.size.seconds = 1 authorizer.class.name = auto.create.topics.enable = true auto.leader.rebalance.enable = true background.threads = 10 > kafka has duplicate configuration information log information printing during > startup > - > > Key: KAFKA-13454 > URL: https://issues.apache.org/jira/browse/KAFKA-13454 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.5.0, 3.0.0 >Reporter: zhangzhanchang >Priority: Minor > > kafka has duplicate configuration information log information printing during > startup,repeated information printing will bring confusion to users. > [2021-11-15 12:07:29,133] INFO KafkaConfig values: > advertised.host.name = null > advertised.listeners = null > advertised.port = null > alter.config.policy.class.name = null > alter.log.dirs.replication.quota.window.num = 11 > alter.log.dirs.replication.quota.window.size.seconds = 1 > authorizer.class.name = > ... > [2021-11-15 12:07:29,161] INFO KafkaConfig values: > advertised.host.name = null > advertised.listeners = null > advertised.port = null > alter.config.policy.class.name = null > alter.log.dirs.replication.quota.window.num = 11 > alter.log.dirs.replication.quota.window.size.seconds = 1 > authorizer.class.name = > auto.create.topics.enable = true > auto.leader.rebalance.enable = true > background.threads = 10 > -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (KAFKA-13454) kafka has duplicate configuration information log information printing during startup
[ https://issues.apache.org/jira/browse/KAFKA-13454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangzhanchang updated KAFKA-13454: --- Summary: kafka has duplicate configuration information log information printing during startup (was: kafka has duplicate configuration information during startup) > kafka has duplicate configuration information log information printing during > startup > - > > Key: KAFKA-13454 > URL: https://issues.apache.org/jira/browse/KAFKA-13454 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.5.0, 3.0.0 >Reporter: zhangzhanchang >Priority: Minor > -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (KAFKA-13454) kafka has duplicate configuration information during startup
zhangzhanchang created KAFKA-13454: -- Summary: kafka has duplicate configuration information during startup Key: KAFKA-13454 URL: https://issues.apache.org/jira/browse/KAFKA-13454 Project: Kafka Issue Type: Bug Reporter: zhangzhanchang -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Updated] (KAFKA-13454) kafka has duplicate configuration information during startup
[ https://issues.apache.org/jira/browse/KAFKA-13454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangzhanchang updated KAFKA-13454: --- Affects Version/s: 3.0.0 2.5.0 > kafka has duplicate configuration information during startup > > > Key: KAFKA-13454 > URL: https://issues.apache.org/jira/browse/KAFKA-13454 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.5.0, 3.0.0 >Reporter: zhangzhanchang >Priority: Minor > -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Commented] (KAFKA-13422) Even if the correct username and password are configured, when ClientBroker or KafkaClient tries to establish a SASL connection to ServerBroker, an exception is thrown
[ https://issues.apache.org/jira/browse/KAFKA-13422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443526#comment-17443526 ] RivenSun commented on KAFKA-13422: -- Hi [~guozhang] , [~hachikuji] , [~showuon] Can you give some suggestions? Thanks. > Even if the correct username and password are configured, when ClientBroker > or KafkaClient tries to establish a SASL connection to ServerBroker, an > exception is thrown: (Authentication failed: Invalid username or password) > -- > > Key: KAFKA-13422 > URL: https://issues.apache.org/jira/browse/KAFKA-13422 > Project: Kafka > Issue Type: Bug > Components: clients, core >Affects Versions: 2.7.1, 3.0.0 >Reporter: RivenSun >Priority: Major > Attachments: CustomerAuthCallbackHandler.java, > LoginContext_login_debug.png, SaslClientCallbackHandler_handle_debug.png > > > > h1. Foreword: > When deploying a Kafka cluster with a higher version (2.7.1), I encountered > an exception of communication identity authentication failure between > brokers. In the current latest version 3.0.0, this problem can also be > reproduced. > h1. Problem recurring: > h2. 1)broker Version is 3.0.0 > h3. The content of kafka_server_jaas.conf of each broker is exactly the same, > the content is as follows: > > > {code:java} > KafkaServer { > org.apache.kafka.common.security.plain.PlainLoginModule required > username="admin" > password="kJTVDziatPgjXG82sFHc4O1EIuewmlvS" > user_admin="kJTVDziatPgjXG82sFHc4O1EIuewmlvS" > user_alice="alice"; > org.apache.kafka.common.security.scram.ScramLoginModule required > username="admin_scram" > password="admin_scram_password"; > > }; > {code} > > > h3. broker server.properties: > One of the broker configuration files is provided, and the content of the > configuration files of other brokers is only different from the localPublicIp > of advertised.listeners. > > {code:java} > broker.id=1 > broker.rack=us-east-1a > advertised.listeners=SASL_PLAINTEXT://localPublicIp:9779,SASL_SSL://localPublicIp:9889,INTERNAL_SSL://:9009,PLAIN_PLUGIN_SSL://localPublicIp:9669 > log.dirs=/asyncmq/kafka/data_1,/asyncmq/kafka/data_2 > zookeeper.connect=*** > listeners=SASL_PLAINTEXT://:9779,SASL_SSL://:9889,INTERNAL_SSL://:9009,PLAIN_PLUGIN_SSL://:9669 > listener.security.protocol.map=INTERNAL_SSL:SASL_SSL,PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL,PLAIN_PLUGIN_SSL:SASL_SSL > listener.name.plain_plugin_ssl.plain.sasl.server.callback.handler.class=org.apache.kafka.common.security.plain.internals.PlainServerCallbackHandler > #ssl config > ssl.keystore.password=*** > ssl.key.password=*** > ssl.truststore.password=*** > ssl.keystore.location=*** > ssl.truststore.location=*** > ssl.client.auth=none > ssl.endpoint.identification.algorithm= > #broker communicate config > #security.inter.broker.protocol=SASL_PLAINTEXT > inter.broker.listener.name=INTERNAL_SSL > sasl.mechanism.inter.broker.protocol=PLAIN > #sasl authentication config > sasl.kerberos.service.name=kafka > sasl.enabled.mechanisms=PLAIN,SCRAM-SHA-256,SCRAM-SHA-512,GSSAPI > delegation.token.master.key=*** > delegation.token.expiry.time.ms=8640 > delegation.token.max.lifetime.ms=31536 > {code} > > > Then start all brokers at the same time. Each broker has actually been > started successfully, but when establishing a connection between the > controller node and all brokers, the identity authentication has always > failed. The connection between brokers cannot be established normally, > causing the entire Kafka cluster to be unable to provide external services. > h3. The server log keeps printing abnormally like crazy: > The real ip sensitive information of the broker in the log, I use ** > instead of here > > {code:java} > [2021-10-29 14:16:19,831] INFO [SocketServer listenerType=ZK_BROKER, > nodeId=3] Started socket server acceptors and processors > (kafka.network.SocketServer) > [2021-10-29 14:16:19,836] INFO Kafka version: 3.0.0 > (org.apache.kafka.common.utils.AppInfoParser) > [2021-10-29 14:16:19,836] INFO Kafka commitId: 8cb0a5e9d3441962 > (org.apache.kafka.common.utils.AppInfoParser) > [2021-10-29 14:16:19,836] INFO Kafka startTimeMs: 1635516979831 > (org.apache.kafka.common.utils.AppInfoParser) > [2021-10-29 14:16:19,837] INFO [KafkaServer id=3] started > (kafka.server.KafkaServer) > [2021-10-29 14:16:20,249] INFO [SocketServer listenerType=ZK_BROKER, > nodeId=3] Failed authentication with /** (Authentication failed: Invalid > username or password) (org.apache.kafka.common.network.Selector) > [2021-10-29 14:16:20,680] INFO
[jira] [Commented] (KAFKA-13449) Comment optimization for parameter log.cleaner.delete.retention.ms
[ https://issues.apache.org/jira/browse/KAFKA-13449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443523#comment-17443523 ] RivenSun commented on KAFKA-13449: -- Hi [~showuon] and [~guozhang] Do you think this issue is reasonable? > Comment optimization for parameter log.cleaner.delete.retention.ms > --- > > Key: KAFKA-13449 > URL: https://issues.apache.org/jira/browse/KAFKA-13449 > Project: Kafka > Issue Type: Improvement > Components: config >Affects Versions: 3.0.0 >Reporter: RivenSun >Priority: Major > > You can view the comment of this parameter from Kafka's official website. > https://kafka.apache.org/documentation/#brokerconfigs_log.cleaner.delete.retention.ms > {code:java} > log.cleaner.delete.retention.ms > How long are delete records retained? {code} > I think it should be consistent with the comment of topic level parameter > *delete.retention.ms* . > https://kafka.apache.org/documentation/#topicconfigs_delete.retention.ms -- This message was sent by Atlassian Jira (v8.20.1#820001)
[GitHub] [kafka] mkandaswamy opened a new pull request #11494: [WIP] MINOR: Improve KafkaStreamsTest: testInitializesAndDestroysMetricsReporters
mkandaswamy opened a new pull request #11494: URL: https://github.com/apache/kafka/pull/11494 Add additional asserts for KafkaStreamsTest: testInitializesAndDestroysMetricsReporters to help diagnose if it flakily fails in the future. * MockMetricsReporter gets initialized only once during KafkaStreams construction, so make assert check stricter by ensuring initDiff is one. * Assert KafkaStreams is not running before, we validate whether MockMetricsMetricsReporter close count got incremented after streams close. ### Committer Checklist (excluded from commit message) - [ ] Verify design and implementation - [ ] Verify test coverage and CI build status - [ ] Verify documentation (including upgrade notes) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [kafka] RivenSun2 commented on pull request #11340: KAFKA-13310 : KafkaConsumer cannot jump out of the poll method, and the…
RivenSun2 commented on pull request #11340: URL: https://github.com/apache/kafka/pull/11340#issuecomment-968476724 @showuon please browse the latest commit when available. Thanks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (KAFKA-13437) Broker parameter optimization: security.inter.broker.protocol and num.network.threads
[ https://issues.apache.org/jira/browse/KAFKA-13437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443502#comment-17443502 ] RivenSun commented on KAFKA-13437: -- Supplement to suggestion 1: 1) {code:java} advertised.listeners=SASL_PLAINTEXT://{publicIp}:9119,SASL_SSL://{publicIp}:9229 listeners=SASL_PLAINTEXT://:9119,SASL_SSL://:9229 listener.security.protocol.map=PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL security.inter.broker.protocol=SASL_SSL {code} Expected InterBrokerListenerName is *SASL_SSL* 2) {code:java} advertised.listeners=SASL_PLAINTEXT://{publicIp}:9119,INTERNAL_SSL://:9339 listeners=SASL_PLAINTEXT://:9119,INTERNAL_SSL://:9339 listener.security.protocol.map=INTERNAL_SSL:SASL_SSL,PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL security.inter.broker.protocol=SASL_SSL {code} Expected InterBrokerListenerName is *INTERNAL_SSL* 3) {code:java} advertised.listeners=SASL_PLAINTEXT://{publicIp}:9119,INTERNAL_SSL://:9339,PLAIN_PLUGIN_SSL://{publicIp}:9449 listeners=SASL_PLAINTEXT://:9119,INTERNAL_SSL://:9339,PLAIN_PLUGIN_SSL://:9449 listener.security.protocol.map=INTERNAL_SSL:SASL_SSL,PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL,PLAIN_PLUGIN_SSL:SASL_SSL security.inter.broker.protocol=SASL_SSL {code} You can see that there are two listenerNames whose protocol is SASL_SSL: INTERNAL_SSL and PLAIN_PLUGIN_SSL. We can put all qualified listenerNames into HashSet and take out the *first element* to use as InterBrokerListenerName. The index value of the first element depends on the {color:#de350b}hash(){color} result of each element. > Broker parameter optimization: security.inter.broker.protocol and > num.network.threads > - > > Key: KAFKA-13437 > URL: https://issues.apache.org/jira/browse/KAFKA-13437 > Project: Kafka > Issue Type: Improvement > Components: core >Affects Versions: 3.0.0 >Reporter: RivenSun >Priority: Major > > h1. 1. security.inter.broker.protocol > Firstly see this parameter comment > {code:java} > security.inter.broker.protocolSecurity protocol used to communicate between > brokers. Valid values are: PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL. It is an > error to set this and inter.broker.listener.name properties at the same time. > {code} > We will not know from the comments, after using this configuration, the final > value of InterBrokerListenerName is the same as the value of > security.inter.broker.protocol. I originally thought it would find a suitable > listenerName from the listener.security.protocol.map configuration. > The result is: broker startup failed > > {code:java} > [2021-11-09 06:28:44,058] ERROR Exiting Kafka due to fatal exception > (kafka.Kafka$) > java.lang.IllegalArgumentException: requirement failed: > inter.broker.listener.name must be a listener name defined in > advertised.listeners. The valid options based on currently configured > listeners are SASL_PLAINTEXT,INTERNAL_SSL,PLAIN_PLUGIN_SSL > at scala.Predef$.require(Predef.scala:337) > at kafka.server.KafkaConfig.validateValues(KafkaConfig.scala:1952) > at kafka.server.KafkaConfig.(KafkaConfig.scala:1897) > at kafka.server.KafkaConfig.(KafkaConfig.scala:1394) > at kafka.Kafka$.buildServer(Kafka.scala:67) > at kafka.Kafka$.main(Kafka.scala:87) > at kafka.Kafka.main(Kafka.scala) > {code} > > > h1. 2. num.network.threads > The networkThreads corresponding to this parameter are not shared by all > listeners, but each listener will create the same number of > networkProcessors, which causes the Kafka process to open too many > unnecessary threads, which leads to a waste of resources. > for example: > listenerNameA: used for communication between brokers > listenerNameB: used to connect production messages and fetch messages on the > client side > listenerNameC: Used by Kafka operation and maintenance personnel to manage > the cluster and send control type requests, such as deleting topics or adding > partitions, etc. > So as expected, the num.network.threads of listenerNameB should be increased, > and the networkThreads of the other two listeners can be appropriately reduced > > h1. Rootcause: > 1. See "getInterBrokerListenerNameAndSecurityProtocol" method in > KafkaConfig.scala > {code:java} > private def getInterBrokerListenerNameAndSecurityProtocol: (ListenerName, > SecurityProtocol) = { > Option(getString(KafkaConfig.InterBrokerListenerNameProp)) match { > case Some(_) if > originals.containsKey(KafkaConfig.InterBrokerSecurityProtocolProp) => > throw new ConfigException(s"Only one of > ${KafkaConfig.InterBrokerListenerNameProp} and " + >
[jira] [Commented] (KAFKA-13407) Kafka controller out of service after ZK leader restart
[ https://issues.apache.org/jira/browse/KAFKA-13407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443354#comment-17443354 ] VZhang commented on KAFKA-13407: [~ecomar] The PR attached is trying to reinitialize the Controller Context. But for this case, maybe the controller should never resign himself after ZK leader restart. Thanks. > Kafka controller out of service after ZK leader restart > --- > > Key: KAFKA-13407 > URL: https://issues.apache.org/jira/browse/KAFKA-13407 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.8.0, 2.8.1 > Environment: Ubuntu 20.04 >Reporter: Daniel >Priority: Critical > > When the Zookeeper leader disappears, a new instance becomes the leader, the > instances need to reconnect to Zookeeper, but the Kafka "Controller" gets > lost in limbo state after re-establishing connection. > See below for how I manage to reproduce this over and over. > *Prerequisites* > Have a Kafka cluster with 3 instances running version 2.8.1. Figure out which > one is the Controller. I'm using Kafkacat 1.5.0 and get this info using the > `-L` flag. > Zookeeper runs with 3 instances on version 3.5.9. Figure out which one is > leader by checking > > {code:java} > echo stat | nc -v localhost 2181 > {code} > > > *Reproduce* > 1. Stop the leader Zookeeper service. > 2. Watch the logs of the Kafka Controller and ensure that it reconnects and > registers again. > > {code:java} > Oct 27 09:13:08 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:08,882] INFO > Unable to read additional data from server sessionid 0x1f2a12870003, likely > server has closed socket, closing socket connection and attempting reconnect > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] WARN > SASL configuration failed: javax.security.auth.login.LoginException: No JAAS > configuration section named 'Client' was found in specified JAAS > configuration file: '/opt/kafka/config/kafka_server_jaas.conf'. Will continue > connection to Zookeeper server without SASL authentication, if Zookeeper > server allows it. (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] INFO > Opening socket connection to server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] ERROR > [ZooKeeperClient Kafka server] Auth failed. (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,549] INFO > Socket connection established, initiating session, client: > /10.10.85.215:39338, server: > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,569] INFO > Session establishment complete on server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181, > sessionid = 0x1f2a12870003, negotiated timeout = 18000 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,548] INFO > [ZooKeeperClient Kafka server] Reinitializing due to auth failure. > (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [PartitionStateMachine controllerId=1003] Stopped partition state machine > (kafka.controller.ZkPartitionStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [ReplicaStateMachine controllerId=1003] Stopped replica state machine > (kafka.controller.ZkReplicaStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutting down > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Stopped > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutdown completed > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Shutting down > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Stopped > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Shutdown completed > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]:
[jira] [Comment Edited] (KAFKA-13407) Kafka controller out of service after ZK leader restart
[ https://issues.apache.org/jira/browse/KAFKA-13407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443353#comment-17443353 ] Vinson Zhang edited comment on KAFKA-13407 at 11/14/21, 2:27 PM: - [~Olsson] It looks like ZK auth failure triggered the Controller resign. You can try to remove the environment variable 'java.security.auth.login.config' and declare the JAAS file content in Kafka server.properties. It works on my local. Thanks. was (Author: JIRAUSER280027): [~Olsson] You can try to remove the environment variable 'java.security.auth.login.config' and declare the JAAS file content in Kafka server.properties. It looks like ZK auth failure triggered the Controller resign. It works on my local. Thanks. > Kafka controller out of service after ZK leader restart > --- > > Key: KAFKA-13407 > URL: https://issues.apache.org/jira/browse/KAFKA-13407 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.8.0, 2.8.1 > Environment: Ubuntu 20.04 >Reporter: Daniel >Priority: Critical > > When the Zookeeper leader disappears, a new instance becomes the leader, the > instances need to reconnect to Zookeeper, but the Kafka "Controller" gets > lost in limbo state after re-establishing connection. > See below for how I manage to reproduce this over and over. > *Prerequisites* > Have a Kafka cluster with 3 instances running version 2.8.1. Figure out which > one is the Controller. I'm using Kafkacat 1.5.0 and get this info using the > `-L` flag. > Zookeeper runs with 3 instances on version 3.5.9. Figure out which one is > leader by checking > > {code:java} > echo stat | nc -v localhost 2181 > {code} > > > *Reproduce* > 1. Stop the leader Zookeeper service. > 2. Watch the logs of the Kafka Controller and ensure that it reconnects and > registers again. > > {code:java} > Oct 27 09:13:08 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:08,882] INFO > Unable to read additional data from server sessionid 0x1f2a12870003, likely > server has closed socket, closing socket connection and attempting reconnect > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] WARN > SASL configuration failed: javax.security.auth.login.LoginException: No JAAS > configuration section named 'Client' was found in specified JAAS > configuration file: '/opt/kafka/config/kafka_server_jaas.conf'. Will continue > connection to Zookeeper server without SASL authentication, if Zookeeper > server allows it. (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] INFO > Opening socket connection to server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] ERROR > [ZooKeeperClient Kafka server] Auth failed. (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,549] INFO > Socket connection established, initiating session, client: > /10.10.85.215:39338, server: > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,569] INFO > Session establishment complete on server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181, > sessionid = 0x1f2a12870003, negotiated timeout = 18000 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,548] INFO > [ZooKeeperClient Kafka server] Reinitializing due to auth failure. > (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [PartitionStateMachine controllerId=1003] Stopped partition state machine > (kafka.controller.ZkPartitionStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [ReplicaStateMachine controllerId=1003] Stopped replica state machine > (kafka.controller.ZkReplicaStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutting down > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Stopped > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutdown completed > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Shutting down >
[jira] [Commented] (KAFKA-13407) Kafka controller out of service after ZK leader restart
[ https://issues.apache.org/jira/browse/KAFKA-13407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17443353#comment-17443353 ] Vinson Zhang commented on KAFKA-13407: -- [~Olsson] You can try to remove the environment variable 'java.security.auth.login.config' and declare the JAAS file content in Kafka server.properties. It looks like ZK auth failure triggered the Controller resign. It works on my local. Thanks. > Kafka controller out of service after ZK leader restart > --- > > Key: KAFKA-13407 > URL: https://issues.apache.org/jira/browse/KAFKA-13407 > Project: Kafka > Issue Type: Bug >Affects Versions: 2.8.0, 2.8.1 > Environment: Ubuntu 20.04 >Reporter: Daniel >Priority: Critical > > When the Zookeeper leader disappears, a new instance becomes the leader, the > instances need to reconnect to Zookeeper, but the Kafka "Controller" gets > lost in limbo state after re-establishing connection. > See below for how I manage to reproduce this over and over. > *Prerequisites* > Have a Kafka cluster with 3 instances running version 2.8.1. Figure out which > one is the Controller. I'm using Kafkacat 1.5.0 and get this info using the > `-L` flag. > Zookeeper runs with 3 instances on version 3.5.9. Figure out which one is > leader by checking > > {code:java} > echo stat | nc -v localhost 2181 > {code} > > > *Reproduce* > 1. Stop the leader Zookeeper service. > 2. Watch the logs of the Kafka Controller and ensure that it reconnects and > registers again. > > {code:java} > Oct 27 09:13:08 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:08,882] INFO > Unable to read additional data from server sessionid 0x1f2a12870003, likely > server has closed socket, closing socket connection and attempting reconnect > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] WARN > SASL configuration failed: javax.security.auth.login.LoginException: No JAAS > configuration section named 'Client' was found in specified JAAS > configuration file: '/opt/kafka/config/kafka_server_jaas.conf'. Will continue > connection to Zookeeper server without SASL authentication, if Zookeeper > server allows it. (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] INFO > Opening socket connection to server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,548] ERROR > [ZooKeeperClient Kafka server] Auth failed. (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,549] INFO > Socket connection established, initiating session, client: > /10.10.85.215:39338, server: > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:10 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:10,569] INFO > Session establishment complete on server > zookeeper-kafka.service.consul.lab.aws.blue.example.net/10.10.84.12:2181, > sessionid = 0x1f2a12870003, negotiated timeout = 18000 > (org.apache.zookeeper.ClientCnxn) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,548] INFO > [ZooKeeperClient Kafka server] Reinitializing due to auth failure. > (kafka.zookeeper.ZooKeeperClient) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [PartitionStateMachine controllerId=1003] Stopped partition state machine > (kafka.controller.ZkPartitionStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,550] INFO > [ReplicaStateMachine controllerId=1003] Stopped replica state machine > (kafka.controller.ZkReplicaStateMachine) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutting down > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Stopped > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,551] INFO > [RequestSendThread controllerId=1003] Shutdown completed > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Shutting down > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Stopped > (kafka.controller.RequestSendThread) > Oct 27 09:13:11 ip-10-10-85-215 kafka[62961]: [2021-10-27 09:13:11,552] INFO > [RequestSendThread controllerId=1003] Shutdown completed >
[GitHub] [kafka] feyman2016 commented on pull request #10593: KAFKA-10800 Enhance the test for validation when the state machine creates a snapshot
feyman2016 commented on pull request #10593: URL: https://github.com/apache/kafka/pull/10593#issuecomment-968300161 @hachikuji @jsancio Just noticed this has been merged, thanks a lot for the review and also the commit! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [kafka] GuoPhilipse edited a comment on pull request #11483: KAFKA-13441: improve upgrade doc
GuoPhilipse edited a comment on pull request #11483: URL: https://github.com/apache/kafka/pull/11483#issuecomment-968294268 @dajac @hachikuji Could you kindly help take a look ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [kafka] GuoPhilipse commented on pull request #11483: KAFKA-13441: improve upgrade doc
GuoPhilipse commented on pull request #11483: URL: https://github.com/apache/kafka/pull/11483#issuecomment-968294268 @hachikuji Could you kindly help take a look ? Thanks in advance. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[GitHub] [kafka] RivenSun2 commented on pull request #11340: KAFKA-13310 : KafkaConsumer cannot jump out of the poll method, and the…
RivenSun2 commented on pull request #11340: URL: https://github.com/apache/kafka/pull/11340#issuecomment-968251877 @showuon Thanks for your suggestion. I will add more tests. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org