[ 
https://issues.apache.org/jira/browse/FLUME-3086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15969211#comment-15969211
 ] 

Jeff Holoman commented on FLUME-3086:
-------------------------------------

We expect folks to be using Kafka for offset storage, and using auto-commit on 
the channel would break our at least once guarantees as auto commit cannot 
ensure delivery. 

> KafkaChannel commits consume offset to Zookeeper every time causing 
> performance problem
> ---------------------------------------------------------------------------------------
>
>                 Key: FLUME-3086
>                 URL: https://issues.apache.org/jira/browse/FLUME-3086
>             Project: Flume
>          Issue Type: Bug
>          Components: Kafka Channel
>    Affects Versions: 1.7.0
>            Reporter: JiYeon OH
>
> Currently, the AUTO_COMMIT_ENABLED setting is hard corded to 
> false(KafkaChannel.java line 285), and the commit is done every 
> transaction(KafkaChannel.java line 567). This will cause large overhead to 
> Kafka Zookeeper when consuming very large amount of data because every 
> consumer in every node will be writing to Zookeeper. 
> I actually had this problem and affected other running consumers' offsets on 
> the Zookeeper. So I suggest to change the hard coded setting 
> AUTO_COMMIT_ENABLED to true, and set a default AUTO_COMMIT_INTERVAL value, 
> and remove the commitOffsets() method. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to