[ https://issues.apache.org/jira/browse/KAFKA-7214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16598992#comment-16598992 ]
John Roesler commented on KAFKA-7214: ------------------------------------- Hi [~habdank], I've commented on https://issues.apache.org/jira/browse/KAFKA-6777. If the problem is long GC pauses, but the JVM never actually runs out of memory, there there would be no OOME to catch. Can you let us know what kind of GC pauses you're observing? As Guozhang said, if any pause is longer than any of the heartbeat intervals (or other keepalive configs like the poll interval), then it will cause problems. The only solution to long GC pauses is to reshape the computation, which it seems like you have done. Also, regarding: > The keyword in all those errors is: KSTREAM-SOURCE-XXXXXXXXX This is just the name of a source node in your streams topology. Such names will appear in all kinds of logs, and doesn't indicate that your problem is related to this ticket. Maybe we can relocate this discussion to a new ticket? If you do start a new ticket, can you indicate what behavior you observe? Such as, "my application crashes" or "my application rebalances frequently". Thanks, -John > Mystic FATAL error > ------------------ > > Key: KAFKA-7214 > URL: https://issues.apache.org/jira/browse/KAFKA-7214 > Project: Kafka > Issue Type: Bug > Components: streams > Affects Versions: 0.11.0.3, 1.1.1 > Reporter: Seweryn Habdank-Wojewodzki > Priority: Critical > > Dears, > Very often at startup of the streaming application I got exception: > {code} > Exception caught in process. taskId=0_1, processor=KSTREAM-SOURCE-0000000000, > topic=my_instance_medium_topic, partition=1, offset=198900203; > [org.apache.kafka.streams.processor.internals.StreamTask.process(StreamTask.java:212), > > org.apache.kafka.streams.processor.internals.AssignedTasks$2.apply(AssignedTasks.java:347), > > org.apache.kafka.streams.processor.internals.AssignedTasks.applyToRunningTasks(AssignedTasks.java:420), > > org.apache.kafka.streams.processor.internals.AssignedTasks.process(AssignedTasks.java:339), > > org.apache.kafka.streams.processor.internals.StreamThread.processAndPunctuate(StreamThread.java:648), > > org.apache.kafka.streams.processor.internals.StreamThread.runOnce(StreamThread.java:513), > > org.apache.kafka.streams.processor.internals.StreamThread.runLoop(StreamThread.java:482), > > org.apache.kafka.streams.processor.internals.StreamThread.run(StreamThread.java:459)] > in thread > my_application-my_instance-my_instance_medium-72ee1819-edeb-4d85-9d65-f67f7c321618-StreamThread-62 > {code} > and then (without shutdown request from my side): > {code} > 2018-07-30 07:45:02 [ar313] [INFO ] StreamThread:912 - stream-thread > [my_application-my_instance-my_instance-72ee1819-edeb-4d85-9d65-f67f7c321618-StreamThread-62] > State transition from PENDING_SHUTDOWN to DEAD. > {code} > What is this? > How to correctly handle it? > Thanks in advance for help. -- This message was sent by Atlassian JIRA (v7.6.3#76005)