Greetings, Kafka specialists
A strange issue in my Kafka instance that blocked my for a couple of days.
1. Cannot produce event to [topic-test] due to a KAFKA_STORAGE_ERROR
2. It seems to me the log segment file is not having any problem.
I can dump log, index, timeindex with kafka-dump-log without seeing any error.
3. No error when I tried to produce events to other topics. Kafka log
files are in a same disk partition.
4. Restart Kafka instance and zookeeper instance did not help.
5. I cannot find useful information about the error from server.log with
TRACE level.
Could you please let me know whether similar issue ever happened before?
Where shall I go to dig further and continue my investigation?
Thanks a lot!
Peter
The kafka client log
kafka-console-producer --bootstrap-server 135.251.236.162:9092 --topic
topic-test
>[2022-04-28 11:12:10,925] WARN [Producer clientId=console-producer] Got error
>produce response with correlation id 5 on topic-partition topic-test-0,
>retrying (2 attempts left). Error: KAFKA_STORAGE_ERROR
>(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:10,925] WARN [Producer clientId=console-producer] Received
invalid metadata error in produce request on partition topic-test-0 due to
org.apache.kafka.common.errors.KafkaStorageException: Disk error when trying to
access log file on the disk.. Going to request metadata update now
(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:11,024] WARN [Producer clientId=console-producer] Got error
produce response with correlation id 7 on topic-partition topic-test-0,
retrying (1 attempts left). Error: KAFKA_STORAGE_ERROR
(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:11,024] WARN [Producer clientId=console-producer] Received
invalid metadata error in produce request on partition topic-test-0 due to
org.apache.kafka.common.errors.KafkaStorageException: Disk error when trying to
access log file on the disk.. Going to request metadata update now
(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:11,127] WARN [Producer clientId=console-producer] Got error
produce response with correlation id 9 on topic-partition topic-test-0,
retrying (0 attempts left). Error: KAFKA_STORAGE_ERROR
(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:11,127] WARN [Producer clientId=console-producer] Received
invalid metadata error in produce request on partition topic-test-0 due to
org.apache.kafka.common.errors.KafkaStorageException: Disk error when trying to
access log file on the disk.. Going to request metadata update now
(org.apache.kafka.clients.producer.internals.Sender)
[2022-04-28 11:12:11,231] ERROR Error when sending message to topic topic-test
with key: null, value: 0 bytes with error:
(org.apache.kafka.clients.producer.internals.ErrorLoggingCallback)
org.apache.kafka.common.errors.KafkaStorageException: Disk error when trying to
access log file on the disk.
[2022-04-28 11:12:11,233] WARN [Producer clientId=console-producer] Received
invalid metadata error in produce request on partition topic-test-0 due to
org.apache.kafka.common.errors.KafkaStorageException: Disk error when trying to
access log file on the disk.. Going to request metadata update now
(org.apache.kafka.clients.producer.internals.Sender)
The server.log
[2022-04-27 07:41:51,203] TRACE [KafkaApi-0] Handling
request:RequestHeader(apiKey=METADATA, apiVersion=11,
clientId=console-producer, correlationId=9) --
MetadataRequestData(topics=[MetadataRequestTopic(topicId=AAAAAAAAAAAAAAAAAAAAAA,
name='topic-test')], allowAutoTopicCreation=true,
includeClusterAuthorizedOperations=false,
includeTopicAuthorizedOperations=false) from connection
135.251.236.162:9092-135.251.236.162:44194-2;securityProtocol:PLAINTEXT,principal:User:ANONYMOUS
(kafka.server.KafkaApis)
[2022-04-27 07:41:51,203] TRACE [KafkaApi-0] Sending topic metadata
MetadataResponseTopic(errorCode=0, name='topic-tst',
topicId=mgS7D7-9RZSgeEUJ3XXErw, isInternal=false,
partitions=[MetadataResponsePartition(errorCode=0, partitionIndex=0,
leaderId=0, leaderEpoch=0, replicaNodes=[0], isrNodes=[0],
offlineReplicas=[])], topicAuthorizedOperations=-2147483648) and brokers
baijin162-vnfprov:9092 (id: 0 rack: null) for correlation id 9 to client
console-producer (kafka.server.KafkaApis)
[2022-04-27 07:41:51,297] TRACE [KafkaApi-0] Handling
request:RequestHeader(apiKey=PRODUCE, apiVersion=9, clientId=console-producer,
correlationId=10) -- {acks=1,timeout=1500,partitionSizes=[topic-test-0=81]}
from connection
135.251.236.162:9092-135.251.236.162:44194-2;securityProtocol:PLAINTEXT,principal:User:ANONYMOUS
(kafka.server.KafkaApis)
[2022-04-27 07:41:51,297] TRACE [ReplicaManager broker=0] Append
[HashMap(topic-test-0 -> MemoryRecords(size=81,
buffer=java.nio.HeapByteBuffer[pos=0 lim=81 cap=84]))] to local log
(kafka.server.ReplicaManager)
[2022-04-27 07:41:51,297] DEBUG [KafkaApi-0] Produce request with correlation
id 10 from client console-producer on partition topic-test-0 failed due to
org.apache.kafka.common.errors.KafkaStorageException (kafka.server.KafkaApis)