Nikki Thean created KAFKA-4607:
----------------------------------

             Summary: Kafka Streams allows you to provide strings with illegal 
characters for internal topic names
                 Key: KAFKA-4607
                 URL: https://issues.apache.org/jira/browse/KAFKA-4607
             Project: Kafka
          Issue Type: Bug
            Reporter: Nikki Thean
            Priority: Minor


When using the aggregate function of the Kafka Streams DSL, I supplied the 
function with a name for the underlying state store that contained whitespace. 
The stream application ran with no error message but seemed to be hanging. 

After a lot of troubleshooting, I looked in the logs on the Kafka cluster and 
noticed that it was complaining about illegal characters in the log line. It 
turns out that an internal changelog topic had been created that contained 
whitespace and this was causing both Kafka and ZK to choke. I was unable to use 
the delete-topic command line utility as it also threw the illegal character 
error. In the end I had to manually delete the topics out of ZK and do some 
stuff with the offset logs.

I am not well versed in the Kafka or Kafka Streams code, but from looking 
through it, I notice that 
[InternalTopicManager|https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/InternalTopicManager.java#L170]
 does not seem to validate the names of auto-generated internal topics beyond 
[checking for null topic 
names|https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/InternalTopicConfig.java#L38]
 rather than using Topic.validate.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to