date:20141027


 [ 
https://issues.apache.org/jira/browse/KAFKA-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ewen Cheslack-Postava updated KAFKA-559:

Attachment: KAFKA-559.patch

> Garbage collect old consumer metadata entries
> -
>
> Key: KAFKA-559
> URL: https://issues.apache.org/jira/browse/KAFKA-559
> Project: Kafka
>  Issue Type: New Feature
>Reporter: Jay Kreps
>Assignee: Tejas Patil
>  Labels: newbie, project
> Attachments: KAFKA-559.patch, KAFKA-559.v1.patch, KAFKA-559.v2.patch
>
>
> Many use cases involve tranient consumers. These consumers create entries 
> under their consumer group in zk and maintain offsets there as well. There is 
> currently no way to delete these entries. It would be good to have a tool 
> that did something like
>   bin/delete-obsolete-consumer-groups.sh [--topic t1] --since [date] 
> --zookeeper [zk_connect]
> This would scan through consumer group entries and delete any that had no 
> offset update since the given date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-559) Garbage collect old consumer metadata entries


 [ 
https://issues.apache.org/jira/browse/KAFKA-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ewen Cheslack-Postava updated KAFKA-559:

Assignee: Ewen Cheslack-Postava  (was: Tejas Patil)
  Status: Patch Available  (was: Open)

> Garbage collect old consumer metadata entries
> -
>
> Key: KAFKA-559
> URL: https://issues.apache.org/jira/browse/KAFKA-559
> Project: Kafka
>  Issue Type: New Feature
>Reporter: Jay Kreps
>Assignee: Ewen Cheslack-Postava
>  Labels: newbie, project
> Attachments: KAFKA-559.patch, KAFKA-559.v1.patch, KAFKA-559.v2.patch
>
>
> Many use cases involve tranient consumers. These consumers create entries 
> under their consumer group in zk and maintain offsets there as well. There is 
> currently no way to delete these entries. It would be good to have a tool 
> that did something like
>   bin/delete-obsolete-consumer-groups.sh [--topic t1] --since [date] 
> --zookeeper [zk_connect]
> This would scan through consumer group entries and delete any that had no 
> offset update since the given date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-559) Garbage collect old consumer metadata entries


[ 
https://issues.apache.org/jira/browse/KAFKA-559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185377#comment-14185377
 ] 

Ewen Cheslack-Postava commented on KAFKA-559:
-

Created reviewboard https://reviews.apache.org/r/27232/diff/
 against branch origin/trunk

> Garbage collect old consumer metadata entries
> -
>
> Key: KAFKA-559
> URL: https://issues.apache.org/jira/browse/KAFKA-559
> Project: Kafka
>  Issue Type: New Feature
>Reporter: Jay Kreps
>Assignee: Tejas Patil
>  Labels: newbie, project
> Attachments: KAFKA-559.patch, KAFKA-559.v1.patch, KAFKA-559.v2.patch
>
>
> Many use cases involve tranient consumers. These consumers create entries 
> under their consumer group in zk and maintain offsets there as well. There is 
> currently no way to delete these entries. It would be good to have a tool 
> that did something like
>   bin/delete-obsolete-consumer-groups.sh [--topic t1] --since [date] 
> --zookeeper [zk_connect]
> This would scan through consumer group entries and delete any that had no 
> offset update since the given date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-559) Garbage collect old consumer metadata entries


[ 
https://issues.apache.org/jira/browse/KAFKA-559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185380#comment-14185380
 ] 

Ewen Cheslack-Postava commented on KAFKA-559:
-

This is an updated version of the patch by [~tejas.patil]. I'm pretty sure I've 
addressed all the issues [~jjkoshy] brought up.

> Garbage collect old consumer metadata entries
> -
>
> Key: KAFKA-559
> URL: https://issues.apache.org/jira/browse/KAFKA-559
> Project: Kafka
>  Issue Type: New Feature
>Reporter: Jay Kreps
>Assignee: Ewen Cheslack-Postava
>  Labels: newbie, project
> Attachments: KAFKA-559.patch, KAFKA-559.v1.patch, KAFKA-559.v2.patch
>
>
> Many use cases involve tranient consumers. These consumers create entries 
> under their consumer group in zk and maintain offsets there as well. There is 
> currently no way to delete these entries. It would be good to have a tool 
> that did something like
>   bin/delete-obsolete-consumer-groups.sh [--topic t1] --since [date] 
> --zookeeper [zk_connect]
> This would scan through consumer group entries and delete any that had no 
> offset update since the given date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (KAFKA-1732) DumpLogSegments tool fails when path has a '.'

Ewen Cheslack-Postava created KAFKA-1732:


 Summary: DumpLogSegments tool fails when path has a '.'
 Key: KAFKA-1732
 URL: https://issues.apache.org/jira/browse/KAFKA-1732
 Project: Kafka
  Issue Type: Bug
  Components: tools
Affects Versions: 0.8.1.1
Reporter: Ewen Cheslack-Postava
Priority: Minor


Using DumpLogSegments in a directory that has a '.' that isn't part of the file 
extension causes an exception:

{code}
16:48 $ time /Users/ewencp/kafka.git/bin/kafka-run-class.sh 
kafka.tools.DumpLogSegments  --file 
/Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
 --verify-index-only
Dumping 
/Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
Exception in thread "main" java.io.FileNotFoundException: 
/Users/ewencp/kafka.log (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:146)
at kafka.utils.Utils$.openChannel(Utils.scala:162)
at kafka.log.FileMessageSet.(FileMessageSet.scala:74)
at 
kafka.tools.DumpLogSegments$.kafka$tools$DumpLogSegments$$dumpIndex(DumpLogSegments.scala:109)
at 
kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:80)
at 
kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:73)
at 
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
at kafka.tools.DumpLogSegments$.main(DumpLogSegments.scala:73)
at kafka.tools.DumpLogSegments.main(DumpLogSegments.scala)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (KAFKA-1733) Producer.send will block indeterminately when broker is unavailable.

2014-10-27 Thread Marc Chung (JIRA)

Marc Chung created KAFKA-1733:
-

 Summary: Producer.send will block indeterminately when broker is 
unavailable.
 Key: KAFKA-1733
 URL: https://issues.apache.org/jira/browse/KAFKA-1733
 Project: Kafka
  Issue Type: Bug
  Components: core, producer 
Reporter: Marc Chung
Assignee: Jun Rao


This is a follow up to the conversation here:

https://mail-archives.apache.org/mod_mbox/kafka-dev/201409.mbox/%3ccaog_4qymoejhkbo0n31+a-ujx0z5unsisd5wbrmn-xtx7gi...@mail.gmail.com%3E

During ClientUtils.fetchTopicMetadata, if the broker is unavailable, 
socket.connect will block indeterminately. Any retry policy 
(message.send.max.retries) further increases the time spent waiting for the 
socket to connect.

The root fix is to add a connection timeout value to the BlockingChannel's 
socket configuration, like so:

{noformat}
-channel.socket.connect(new InetSocketAddress(host, port))
+channel.socket.connect(new InetSocketAddress(host, port), connectTimeoutMs)
{noformat}

The simplest thing to do here would be to have a constant, default value that 
would be applied to every socket configuration. 

Is that acceptable? 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-1733) Producer.send will block indeterminately when broker is unavailable.

2014-10-27 Thread Marc Chung (JIRA)


[ 
https://issues.apache.org/jira/browse/KAFKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185539#comment-14185539
 ] 

Marc Chung commented on KAFKA-1733:
---

I have a patch (work in progress) here: 
https://github.com/mchung/kafka/commit/87b8ddbfe23dc887f56fa6f9ea3669733933c49b

> Producer.send will block indeterminately when broker is unavailable.
> 
>
> Key: KAFKA-1733
> URL: https://issues.apache.org/jira/browse/KAFKA-1733
> Project: Kafka
>  Issue Type: Bug
>  Components: core, producer 
>Reporter: Marc Chung
>Assignee: Jun Rao
>
> This is a follow up to the conversation here:
> https://mail-archives.apache.org/mod_mbox/kafka-dev/201409.mbox/%3ccaog_4qymoejhkbo0n31+a-ujx0z5unsisd5wbrmn-xtx7gi...@mail.gmail.com%3E
> During ClientUtils.fetchTopicMetadata, if the broker is unavailable, 
> socket.connect will block indeterminately. Any retry policy 
> (message.send.max.retries) further increases the time spent waiting for the 
> socket to connect.
> The root fix is to add a connection timeout value to the BlockingChannel's 
> socket configuration, like so:
> {noformat}
> -channel.socket.connect(new InetSocketAddress(host, port))
> +channel.socket.connect(new InetSocketAddress(host, port), connectTimeoutMs)
> {noformat}
> The simplest thing to do here would be to have a constant, default value that 
> would be applied to every socket configuration. 
> Is that acceptable? 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Review Request 27238: Patch for KAFKA-1732

2014-10-27 Thread Ewen Cheslack-Postava


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/27238/
---

Review request for kafka.


Bugs: KAFKA-1732
https://issues.apache.org/jira/browse/KAFKA-1732


Repository: kafka


Description
---

KAFKA-1732 Handle paths with '.' properly in DumpLogSegments.


Diffs
-

  core/src/main/scala/kafka/tools/DumpLogSegments.scala 
8e9d47b8d4adc5754ed8861aa04ddd3c6b629e3d 

Diff: https://reviews.apache.org/r/27238/diff/


Testing
---


Thanks,

Ewen Cheslack-Postava

[jira] [Commented] (KAFKA-1732) DumpLogSegments tool fails when path has a '.'


[ 
https://issues.apache.org/jira/browse/KAFKA-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185586#comment-14185586
 ] 

Ewen Cheslack-Postava commented on KAFKA-1732:
--

Created reviewboard https://reviews.apache.org/r/27238/diff/
 against branch origin/trunk

> DumpLogSegments tool fails when path has a '.'
> --
>
> Key: KAFKA-1732
> URL: https://issues.apache.org/jira/browse/KAFKA-1732
> Project: Kafka
>  Issue Type: Bug
>  Components: tools
>Affects Versions: 0.8.1.1
>Reporter: Ewen Cheslack-Postava
>Priority: Minor
> Attachments: KAFKA-1732.patch
>
>
> Using DumpLogSegments in a directory that has a '.' that isn't part of the 
> file extension causes an exception:
> {code}
> 16:48 $ time /Users/ewencp/kafka.git/bin/kafka-run-class.sh 
> kafka.tools.DumpLogSegments  --file 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
>  --verify-index-only
> Dumping 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
> Exception in thread "main" java.io.FileNotFoundException: 
> /Users/ewencp/kafka.log (No such file or directory)
>   at java.io.FileInputStream.open(Native Method)
>   at java.io.FileInputStream.(FileInputStream.java:146)
>   at kafka.utils.Utils$.openChannel(Utils.scala:162)
>   at kafka.log.FileMessageSet.(FileMessageSet.scala:74)
>   at 
> kafka.tools.DumpLogSegments$.kafka$tools$DumpLogSegments$$dumpIndex(DumpLogSegments.scala:109)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:80)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:73)
>   at 
> scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
>   at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
>   at kafka.tools.DumpLogSegments$.main(DumpLogSegments.scala:73)
>   at kafka.tools.DumpLogSegments.main(DumpLogSegments.scala)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1732) DumpLogSegments tool fails when path has a '.'


 [ 
https://issues.apache.org/jira/browse/KAFKA-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ewen Cheslack-Postava updated KAFKA-1732:
-
Assignee: Ewen Cheslack-Postava
  Status: Patch Available  (was: Open)

> DumpLogSegments tool fails when path has a '.'
> --
>
> Key: KAFKA-1732
> URL: https://issues.apache.org/jira/browse/KAFKA-1732
> Project: Kafka
>  Issue Type: Bug
>  Components: tools
>Affects Versions: 0.8.1.1
>Reporter: Ewen Cheslack-Postava
>Assignee: Ewen Cheslack-Postava
>Priority: Minor
> Attachments: KAFKA-1732.patch
>
>
> Using DumpLogSegments in a directory that has a '.' that isn't part of the 
> file extension causes an exception:
> {code}
> 16:48 $ time /Users/ewencp/kafka.git/bin/kafka-run-class.sh 
> kafka.tools.DumpLogSegments  --file 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
>  --verify-index-only
> Dumping 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
> Exception in thread "main" java.io.FileNotFoundException: 
> /Users/ewencp/kafka.log (No such file or directory)
>   at java.io.FileInputStream.open(Native Method)
>   at java.io.FileInputStream.(FileInputStream.java:146)
>   at kafka.utils.Utils$.openChannel(Utils.scala:162)
>   at kafka.log.FileMessageSet.(FileMessageSet.scala:74)
>   at 
> kafka.tools.DumpLogSegments$.kafka$tools$DumpLogSegments$$dumpIndex(DumpLogSegments.scala:109)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:80)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:73)
>   at 
> scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
>   at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
>   at kafka.tools.DumpLogSegments$.main(DumpLogSegments.scala:73)
>   at kafka.tools.DumpLogSegments.main(DumpLogSegments.scala)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1732) DumpLogSegments tool fails when path has a '.'


 [ 
https://issues.apache.org/jira/browse/KAFKA-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ewen Cheslack-Postava updated KAFKA-1732:
-
Attachment: KAFKA-1732.patch

> DumpLogSegments tool fails when path has a '.'
> --
>
> Key: KAFKA-1732
> URL: https://issues.apache.org/jira/browse/KAFKA-1732
> Project: Kafka
>  Issue Type: Bug
>  Components: tools
>Affects Versions: 0.8.1.1
>Reporter: Ewen Cheslack-Postava
>Priority: Minor
> Attachments: KAFKA-1732.patch
>
>
> Using DumpLogSegments in a directory that has a '.' that isn't part of the 
> file extension causes an exception:
> {code}
> 16:48 $ time /Users/ewencp/kafka.git/bin/kafka-run-class.sh 
> kafka.tools.DumpLogSegments  --file 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
>  --verify-index-only
> Dumping 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
> Exception in thread "main" java.io.FileNotFoundException: 
> /Users/ewencp/kafka.log (No such file or directory)
>   at java.io.FileInputStream.open(Native Method)
>   at java.io.FileInputStream.(FileInputStream.java:146)
>   at kafka.utils.Utils$.openChannel(Utils.scala:162)
>   at kafka.log.FileMessageSet.(FileMessageSet.scala:74)
>   at 
> kafka.tools.DumpLogSegments$.kafka$tools$DumpLogSegments$$dumpIndex(DumpLogSegments.scala:109)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:80)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:73)
>   at 
> scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
>   at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
>   at kafka.tools.DumpLogSegments$.main(DumpLogSegments.scala:73)
>   at kafka.tools.DumpLogSegments.main(DumpLogSegments.scala)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Kafka Security?

2014-10-27 Thread Stephenson, John L

List Users,
Does anyone know when/if Kafka security features are being planned?   I 
haven't seen much on the net outside of the following proposal:  
https://cwiki.apache.org/confluence/display/KAFKA/Security.

Thanks!
john

[jira] [Created] (KAFKA-1734) System test metric plotting nonexistent file warnings

2014-10-27 Thread Andrew Olson (JIRA)

Andrew Olson created KAFKA-1734:
---

 Summary: System test metric plotting nonexistent file warnings
 Key: KAFKA-1734
 URL: https://issues.apache.org/jira/browse/KAFKA-1734
 Project: Kafka
  Issue Type: Bug
Reporter: Andrew Olson
Priority: Minor


Running the system tests (trunk code), there are many "The file ... does not 
exist for plotting (metrics)" warning messages, for example,

{noformat}
2014-10-27 14:47:58,478 - WARNING - The file 
/opt/kafka/system_test/replication_testsuite/testcase_0007/logs/broker-3/metrics/kafka.network.RequestMetrics.Produce-RemoteTimeMs.csv
 does not exist for plotting (metrics)
{noformat}

Looks like the generated metric file names only include the last part of the 
metric, e.g. "Produce-RemoteTimeMs.csv" not 
"kafka.network.RequestMetrics.Produce-RemoteTimeMs.csv".

{noformat}
$ ls 
/opt/kafka/system_test/replication_testsuite/testcase_0007/logs/broker-3/metrics/*Produce*
/opt/kafka/system_test/replication_testsuite/testcase_0007/logs/broker-3/metrics/Produce-RemoteTimeMs.csv
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: Kafka Security?

2014-10-27 Thread Gwen Shapira

This is very much work in progress.
You can follow the Jira here to see how it goes:
https://issues.apache.org/jira/browse/KAFKA-1682

On Mon, Oct 27, 2014 at 11:49 AM, Stephenson, John L
 wrote:
> List Users,
> Does anyone know when/if Kafka security features are being planned?   I 
> haven't seen much on the net outside of the following proposal:  
> https://cwiki.apache.org/confluence/display/KAFKA/Security.
>
> Thanks!
> john

[jira] [Updated] (KAFKA-1731) add config/jmx changes in 0.8.2 doc

2014-10-27 Thread Jun Rao (JIRA)


 [ 
https://issues.apache.org/jira/browse/KAFKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jun Rao updated KAFKA-1731:
---
Fix Version/s: 0.8.2
 Assignee: Jun Rao

I made a pass on the site doc to add the new broker side configs (offset 
management related configs will be added in kafka-1729) and the important jmxs. 
This is already committed to svn. I will leave this ticket open for a few more 
days for comments.

> add config/jmx changes in 0.8.2 doc
> ---
>
> Key: KAFKA-1731
> URL: https://issues.apache.org/jira/browse/KAFKA-1731
> Project: Kafka
>  Issue Type: Sub-task
>Reporter: Jun Rao
>Assignee: Jun Rao
> Fix For: 0.8.2
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-1731) add config/jmx changes in 0.8.2 doc

2014-10-27 Thread Gwen Shapira (JIRA)


[ 
https://issues.apache.org/jira/browse/KAFKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185818#comment-14185818
 ] 

Gwen Shapira commented on KAFKA-1731:
-

Any chance you can upload a patch so we can see what changed?

> add config/jmx changes in 0.8.2 doc
> ---
>
> Key: KAFKA-1731
> URL: https://issues.apache.org/jira/browse/KAFKA-1731
> Project: Kafka
>  Issue Type: Sub-task
>Reporter: Jun Rao
>Assignee: Jun Rao
> Fix For: 0.8.2
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1731) add config/jmx changes in 0.8.2 doc

2014-10-27 Thread Jun Rao (JIRA)


 [ 
https://issues.apache.org/jira/browse/KAFKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jun Rao updated KAFKA-1731:
---
Attachment: config-jmx_082.patch

Attached please find the patch.

> add config/jmx changes in 0.8.2 doc
> ---
>
> Key: KAFKA-1731
> URL: https://issues.apache.org/jira/browse/KAFKA-1731
> Project: Kafka
>  Issue Type: Sub-task
>Reporter: Jun Rao
>Assignee: Jun Rao
> Fix For: 0.8.2
>
> Attachments: config-jmx_082.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1733) Producer.send will block indeterminately when broker is unavailable.

2014-10-27 Thread Neha Narkhede (JIRA)


 [ 
https://issues.apache.org/jira/browse/KAFKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Neha Narkhede updated KAFKA-1733:
-
Reviewer: Jun Rao
Assignee: (was: Jun Rao)

> Producer.send will block indeterminately when broker is unavailable.
> 
>
> Key: KAFKA-1733
> URL: https://issues.apache.org/jira/browse/KAFKA-1733
> Project: Kafka
>  Issue Type: Bug
>  Components: core, producer 
>Reporter: Marc Chung
>
> This is a follow up to the conversation here:
> https://mail-archives.apache.org/mod_mbox/kafka-dev/201409.mbox/%3ccaog_4qymoejhkbo0n31+a-ujx0z5unsisd5wbrmn-xtx7gi...@mail.gmail.com%3E
> During ClientUtils.fetchTopicMetadata, if the broker is unavailable, 
> socket.connect will block indeterminately. Any retry policy 
> (message.send.max.retries) further increases the time spent waiting for the 
> socket to connect.
> The root fix is to add a connection timeout value to the BlockingChannel's 
> socket configuration, like so:
> {noformat}
> -channel.socket.connect(new InetSocketAddress(host, port))
> +channel.socket.connect(new InetSocketAddress(host, port), connectTimeoutMs)
> {noformat}
> The simplest thing to do here would be to have a constant, default value that 
> would be applied to every socket configuration. 
> Is that acceptable? 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: Review Request 27238: Patch for KAFKA-1732

2014-10-27 Thread Neha Narkhede


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/27238/#review58723
---

Ship it!


Ship It!

- Neha Narkhede


On Oct. 27, 2014, 6:41 p.m., Ewen Cheslack-Postava wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/27238/
> ---
> 
> (Updated Oct. 27, 2014, 6:41 p.m.)
> 
> 
> Review request for kafka.
> 
> 
> Bugs: KAFKA-1732
> https://issues.apache.org/jira/browse/KAFKA-1732
> 
> 
> Repository: kafka
> 
> 
> Description
> ---
> 
> KAFKA-1732 Handle paths with '.' properly in DumpLogSegments.
> 
> 
> Diffs
> -
> 
>   core/src/main/scala/kafka/tools/DumpLogSegments.scala 
> 8e9d47b8d4adc5754ed8861aa04ddd3c6b629e3d 
> 
> Diff: https://reviews.apache.org/r/27238/diff/
> 
> 
> Testing
> ---
> 
> 
> Thanks,
> 
> Ewen Cheslack-Postava
> 
>

[jira] [Updated] (KAFKA-1732) DumpLogSegments tool fails when path has a '.'

2014-10-27 Thread Neha Narkhede (JIRA)


 [ 
https://issues.apache.org/jira/browse/KAFKA-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Neha Narkhede updated KAFKA-1732:
-
Resolution: Fixed
Status: Resolved  (was: Patch Available)

Thanks for the patch. Pushed to trunk and 0.8.2

> DumpLogSegments tool fails when path has a '.'
> --
>
> Key: KAFKA-1732
> URL: https://issues.apache.org/jira/browse/KAFKA-1732
> Project: Kafka
>  Issue Type: Bug
>  Components: tools
>Affects Versions: 0.8.1.1
>Reporter: Ewen Cheslack-Postava
>Assignee: Ewen Cheslack-Postava
>Priority: Minor
> Attachments: KAFKA-1732.patch
>
>
> Using DumpLogSegments in a directory that has a '.' that isn't part of the 
> file extension causes an exception:
> {code}
> 16:48 $ time /Users/ewencp/kafka.git/bin/kafka-run-class.sh 
> kafka.tools.DumpLogSegments  --file 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
>  --verify-index-only
> Dumping 
> /Users/ewencp/kafka.git/system_test/replication_testsuite/testcase_1/logs/broker-3/kafka_server_3_logs/test_1-1/00016895.index
> Exception in thread "main" java.io.FileNotFoundException: 
> /Users/ewencp/kafka.log (No such file or directory)
>   at java.io.FileInputStream.open(Native Method)
>   at java.io.FileInputStream.(FileInputStream.java:146)
>   at kafka.utils.Utils$.openChannel(Utils.scala:162)
>   at kafka.log.FileMessageSet.(FileMessageSet.scala:74)
>   at 
> kafka.tools.DumpLogSegments$.kafka$tools$DumpLogSegments$$dumpIndex(DumpLogSegments.scala:109)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:80)
>   at 
> kafka.tools.DumpLogSegments$$anonfun$main$1.apply(DumpLogSegments.scala:73)
>   at 
> scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
>   at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
>   at kafka.tools.DumpLogSegments$.main(DumpLogSegments.scala:73)
>   at kafka.tools.DumpLogSegments.main(DumpLogSegments.scala)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: Review Request 26755: Patch for KAFKA-1706

2014-10-27 Thread Joel Koshy



> On Oct. 25, 2014, 7:52 a.m., Joel Koshy wrote:
> > core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala, line 109
> > 
> >
> > getAndDecrement(sizeFunction.get(e))
> 
> Jiangjie Qin wrote:
> It seems getAndDecrement() does not take argument and will always 
> decrement by 1.

ah yes you are right


> On Oct. 25, 2014, 7:52 a.m., Joel Koshy wrote:
> > core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala, line 82
> > 
> >
> > One significant caveat to this approach (and in the timed variant 
> > above) is that if a single large element needs to be enqueued it could 
> > potentially block a number of smaller elements from being enqueued. This 
> > may be okay in the case of mirror maker though but would make it less 
> > useful as a generic utility.
> 
> Jiangjie Qin wrote:
> I'm not sure why the big put could block small ones... It is possible 
> that there is a super big item put into the queue and makes the queue to pass 
> the byte limit by a lot. In that case, all the put will be blocked until a 
> bunch of small messages are taken out of the queue. But it seems to be the 
> purpose of having a byte limit for the queue.

I looked again. Yes you are right. It should not block smaller puts. Now I'm 
going to ask the question from the other side of the table: since you are just 
notifying waiting threads, it is possible for a large put to get starved if 
there are a lot of smaller puts that get notified earlier. To the best of my 
knowledge the JVM does not guarantee fairness in unblocking multiple contending 
threads. Ideally there should be some notion of maximum wait before a put 
attempt takes priority over others. i.e., these are nuances that may be a 
compelling reason to make it a specialized utility within MirrorMaker itself 
since it is not general enough (yet).


- Joel


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26755/#review58497
---


On Oct. 27, 2014, 6:50 a.m., Jiangjie Qin wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/26755/
> ---
> 
> (Updated Oct. 27, 2014, 6:50 a.m.)
> 
> 
> Review request for kafka.
> 
> 
> Bugs: KAFKA-1706
> https://issues.apache.org/jira/browse/KAFKA-1706
> 
> 
> Repository: kafka
> 
> 
> Description
> ---
> 
> changed arguments name
> 
> 
> correct typo.
> 
> 
> Incorporated Joel's comments. Also fixed negative queue size problem.
> 
> 
> Diffs
> -
> 
>   core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala PRE-CREATION 
> 
> Diff: https://reviews.apache.org/r/26755/diff/
> 
> 
> Testing
> ---
> 
> 
> Thanks,
> 
> Jiangjie Qin
> 
>

[DISCUSSION] Nested compression in Kafka?

2014-10-27 Thread Guozhang Wang

Hello folks,

I came across this "testComplexCompressDecompress" in
kafka.message.MessageCompressionTest while I'm working some consumer
decompression optimization. This test checks if nested compression is
supported.

I remember vaguely that some time ago we decide not to support nested
compression at Kafka, and in the new producer's MemoryRecords I also make
this assumption in this iterator implementation. Is that still the case? If
yes shall we remove this test case?

-- Guozhang

[jira] [Created] (KAFKA-1735) MemoryRecords.Iterator needs to handle partial reads from compressed stream

Guozhang Wang created KAFKA-1735:


 Summary: MemoryRecords.Iterator needs to handle partial reads from 
compressed stream
 Key: KAFKA-1735
 URL: https://issues.apache.org/jira/browse/KAFKA-1735
 Project: Kafka
  Issue Type: Bug
Reporter: Guozhang Wang
Assignee: Guozhang Wang
 Fix For: 0.9.0


Found a bug in the MemoryRecords.Iterator implementation, where 

{code}
stream.read(recordBuffer, 0, size)
{code}

can read less than size'ed bytes, and rest of the recordBuffer would set to 
"\0".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Review Request 27256: Fix KAFKA-1735

2014-10-27 Thread Guozhang Wang


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/27256/
---

Review request for kafka.


Bugs: KAFKA-1735
https://issues.apache.org/jira/browse/KAFKA-1735


Repository: kafka


Description
---

Handle partial reads from compressed stream


Diffs
-

  clients/src/main/java/org/apache/kafka/common/record/MemoryRecords.java 
040e5b91005edb8f015afdfa76fd94e0bf3cb4ca 

Diff: https://reviews.apache.org/r/27256/diff/


Testing
---


Thanks,

Guozhang Wang

[jira] [Updated] (KAFKA-1735) MemoryRecords.Iterator needs to handle partial reads from compressed stream


 [ 
https://issues.apache.org/jira/browse/KAFKA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guozhang Wang updated KAFKA-1735:
-
Status: Patch Available  (was: Open)

> MemoryRecords.Iterator needs to handle partial reads from compressed stream
> ---
>
> Key: KAFKA-1735
> URL: https://issues.apache.org/jira/browse/KAFKA-1735
> Project: Kafka
>  Issue Type: Bug
>Reporter: Guozhang Wang
>Assignee: Guozhang Wang
> Fix For: 0.9.0
>
> Attachments: KAFKA-1735.patch
>
>
> Found a bug in the MemoryRecords.Iterator implementation, where 
> {code}
> stream.read(recordBuffer, 0, size)
> {code}
> can read less than size'ed bytes, and rest of the recordBuffer would set to 
> "\0".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1735) MemoryRecords.Iterator needs to handle partial reads from compressed stream


 [ 
https://issues.apache.org/jira/browse/KAFKA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guozhang Wang updated KAFKA-1735:
-
Attachment: KAFKA-1735.patch

> MemoryRecords.Iterator needs to handle partial reads from compressed stream
> ---
>
> Key: KAFKA-1735
> URL: https://issues.apache.org/jira/browse/KAFKA-1735
> Project: Kafka
>  Issue Type: Bug
>Reporter: Guozhang Wang
>Assignee: Guozhang Wang
> Fix For: 0.9.0
>
> Attachments: KAFKA-1735.patch
>
>
> Found a bug in the MemoryRecords.Iterator implementation, where 
> {code}
> stream.read(recordBuffer, 0, size)
> {code}
> can read less than size'ed bytes, and rest of the recordBuffer would set to 
> "\0".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-1735) MemoryRecords.Iterator needs to handle partial reads from compressed stream


[ 
https://issues.apache.org/jira/browse/KAFKA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14186072#comment-14186072
 ] 

Guozhang Wang commented on KAFKA-1735:
--

Created reviewboard https://reviews.apache.org/r/27256/diff/
 against branch origin/trunk

> MemoryRecords.Iterator needs to handle partial reads from compressed stream
> ---
>
> Key: KAFKA-1735
> URL: https://issues.apache.org/jira/browse/KAFKA-1735
> Project: Kafka
>  Issue Type: Bug
>Reporter: Guozhang Wang
>Assignee: Guozhang Wang
> Fix For: 0.9.0
>
> Attachments: KAFKA-1735.patch
>
>
> Found a bug in the MemoryRecords.Iterator implementation, where 
> {code}
> stream.read(recordBuffer, 0, size)
> {code}
> can read less than size'ed bytes, and rest of the recordBuffer would set to 
> "\0".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: Review Request 26373: Patch for KAFKA-1647

2014-10-27 Thread Jiangjie Qin


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26373/
---

(Updated Oct. 28, 2014, 12:19 a.m.)


Review request for kafka.


Bugs: KAFKA-1647
https://issues.apache.org/jira/browse/KAFKA-1647


Repository: kafka


Description (updated)
---

Addressed Joel's comments.


the version 2 code seems to be submitted by mistake... This should be the code 
for review that addressed Joel's comments.


Addressed Jun's comments. Will do tests to verify if it works.


Addressed Joel's comments, we do not need to check the if leader exits for not 
when adding fetcher.


Diffs (updated)
-

  core/src/main/scala/kafka/server/ReplicaManager.scala 
78b7514cc109547c562e635824684fad581af653 

Diff: https://reviews.apache.org/r/26373/diff/


Testing
---


Thanks,

Jiangjie Qin

[jira] [Commented] (KAFKA-1647) Replication offset checkpoints (high water marks) can be lost on hard kills and restarts


[ 
https://issues.apache.org/jira/browse/KAFKA-1647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14186104#comment-14186104
 ] 

Jiangjie Qin commented on KAFKA-1647:
-

Updated reviewboard https://reviews.apache.org/r/26373/diff/
 against branch origin/trunk

> Replication offset checkpoints (high water marks) can be lost on hard kills 
> and restarts
> 
>
> Key: KAFKA-1647
> URL: https://issues.apache.org/jira/browse/KAFKA-1647
> Project: Kafka
>  Issue Type: Bug
>Affects Versions: 0.8.2
>Reporter: Joel Koshy
>Assignee: Jiangjie Qin
>Priority: Critical
>  Labels: newbie++
> Fix For: 0.8.2
>
> Attachments: KAFKA-1647.patch, KAFKA-1647_2014-10-13_16:38:39.patch, 
> KAFKA-1647_2014-10-18_00:26:51.patch, KAFKA-1647_2014-10-21_23:08:43.patch, 
> KAFKA-1647_2014-10-27_17:19:07.patch
>
>
> We ran into this scenario recently in a production environment. This can 
> happen when enough brokers in a cluster are taken down. i.e., a rolling 
> bounce done properly should not cause this issue. It can occur if all 
> replicas for any partition are taken down.
> Here is a sample scenario:
> * Cluster of three brokers: b0, b1, b2
> * Two partitions (of some topic) with replication factor two: p0, p1
> * Initial state:
> p0: leader = b0, ISR = {b0, b1}
> p1: leader = b1, ISR = {b0, b1}
> * Do a parallel hard-kill of all brokers
> * Bring up b2, so it is the new controller
> * b2 initializes its controller context and populates its leader/ISR cache 
> (i.e., controllerContext.partitionLeadershipInfo) from zookeeper. The last 
> known leaders are b0 (for p0) and b1 (for p2)
> * Bring up b1
> * The controller's onBrokerStartup procedure initiates a replica state change 
> for all replicas on b1 to become online. As part of this replica state change 
> it gets the last known leader and ISR and sends a LeaderAndIsrRequest to b1 
> (for p1 and p2). This LeaderAndIsr request contains: {{p0: leader=b0; p1: 
> leader=b1;} leaders=b1}. b0 is indicated as the leader of p0 but it is not 
> included in the leaders field because b0 is down.
> * On receiving the LeaderAndIsrRequest, b1's replica manager will 
> successfully make itself (b1) the leader for p1 (and create the local replica 
> object corresponding to p1). It will however abort the become follower 
> transition for p0 because the designated leader b0 is offline. So it will not 
> create the local replica object for p0.
> * It will then start the high water mark checkpoint thread. Since only p1 has 
> a local replica object, only p1's high water mark will be checkpointed to 
> disk. p0's previously written checkpoint  if any will be lost.
> So in summary it seems we should always create the local replica object even 
> if the online transition does not happen.
> Possible symptoms of the above bug could be one or more of the following (we 
> saw 2 and 3):
> # Data loss; yes on a hard-kill data loss is expected, but this can actually 
> cause loss of nearly all data if the broker becomes follower, truncates, and 
> soon after happens to become leader.
> # High IO on brokers that lose their high water mark then subsequently (on a 
> successful become follower transition) truncate their log to zero and start 
> catching up from the beginning.
> # If the offsets topic is affected, then offsets can get reset. This is 
> because during an offset load we don't read past the high water mark. So if a 
> water mark is missing then we don't load anything (even if the offsets are 
> there in the log).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KAFKA-1647) Replication offset checkpoints (high water marks) can be lost on hard kills and restarts


 [ 
https://issues.apache.org/jira/browse/KAFKA-1647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jiangjie Qin updated KAFKA-1647:

Attachment: KAFKA-1647_2014-10-27_17:19:07.patch

> Replication offset checkpoints (high water marks) can be lost on hard kills 
> and restarts
> 
>
> Key: KAFKA-1647
> URL: https://issues.apache.org/jira/browse/KAFKA-1647
> Project: Kafka
>  Issue Type: Bug
>Affects Versions: 0.8.2
>Reporter: Joel Koshy
>Assignee: Jiangjie Qin
>Priority: Critical
>  Labels: newbie++
> Fix For: 0.8.2
>
> Attachments: KAFKA-1647.patch, KAFKA-1647_2014-10-13_16:38:39.patch, 
> KAFKA-1647_2014-10-18_00:26:51.patch, KAFKA-1647_2014-10-21_23:08:43.patch, 
> KAFKA-1647_2014-10-27_17:19:07.patch
>
>
> We ran into this scenario recently in a production environment. This can 
> happen when enough brokers in a cluster are taken down. i.e., a rolling 
> bounce done properly should not cause this issue. It can occur if all 
> replicas for any partition are taken down.
> Here is a sample scenario:
> * Cluster of three brokers: b0, b1, b2
> * Two partitions (of some topic) with replication factor two: p0, p1
> * Initial state:
> p0: leader = b0, ISR = {b0, b1}
> p1: leader = b1, ISR = {b0, b1}
> * Do a parallel hard-kill of all brokers
> * Bring up b2, so it is the new controller
> * b2 initializes its controller context and populates its leader/ISR cache 
> (i.e., controllerContext.partitionLeadershipInfo) from zookeeper. The last 
> known leaders are b0 (for p0) and b1 (for p2)
> * Bring up b1
> * The controller's onBrokerStartup procedure initiates a replica state change 
> for all replicas on b1 to become online. As part of this replica state change 
> it gets the last known leader and ISR and sends a LeaderAndIsrRequest to b1 
> (for p1 and p2). This LeaderAndIsr request contains: {{p0: leader=b0; p1: 
> leader=b1;} leaders=b1}. b0 is indicated as the leader of p0 but it is not 
> included in the leaders field because b0 is down.
> * On receiving the LeaderAndIsrRequest, b1's replica manager will 
> successfully make itself (b1) the leader for p1 (and create the local replica 
> object corresponding to p1). It will however abort the become follower 
> transition for p0 because the designated leader b0 is offline. So it will not 
> create the local replica object for p0.
> * It will then start the high water mark checkpoint thread. Since only p1 has 
> a local replica object, only p1's high water mark will be checkpointed to 
> disk. p0's previously written checkpoint  if any will be lost.
> So in summary it seems we should always create the local replica object even 
> if the online transition does not happen.
> Possible symptoms of the above bug could be one or more of the following (we 
> saw 2 and 3):
> # Data loss; yes on a hard-kill data loss is expected, but this can actually 
> cause loss of nearly all data if the broker becomes follower, truncates, and 
> soon after happens to become leader.
> # High IO on brokers that lose their high water mark then subsequently (on a 
> successful become follower transition) truncate their log to zero and start 
> catching up from the beginning.
> # If the offsets topic is affected, then offsets can get reset. This is 
> because during an offset load we don't read past the high water mark. So if a 
> water mark is missing then we don't load anything (even if the offsets are 
> there in the log).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: Review Request 26373: Patch for KAFKA-1647

2014-10-27 Thread Jiangjie Qin


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26373/
---

(Updated Oct. 28, 2014, 12:20 a.m.)


Review request for kafka.


Bugs: KAFKA-1647
https://issues.apache.org/jira/browse/KAFKA-1647


Repository: kafka


Description
---

Addressed Joel's comments.


the version 2 code seems to be submitted by mistake... This should be the code 
for review that addressed Joel's comments.


Addressed Jun's comments. Will do tests to verify if it works.


Addressed Joel's comments, we do not need to check the if leader exits for not 
when adding fetcher.


Diffs
-

  core/src/main/scala/kafka/server/ReplicaManager.scala 
78b7514cc109547c562e635824684fad581af653 

Diff: https://reviews.apache.org/r/26373/diff/


Testing (updated)
---

Followed Joel's testing step. I was able to reproduce the problem without the 
patch and the WARN message goes away after applied the patch.


Thanks,

Jiangjie Qin

Jenkins build is back to normal : Kafka-trunk #319

2014-10-27 Thread Apache Jenkins Server

See

[jira] [Commented] (KAFKA-1481) Stop using dashes AND underscores as separators in MBean names

2014-10-27 Thread Jun Rao (JIRA)

[
https://issues.apache.org/jira/browse/KAFKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14186124#comment-14186124
]

Jun Rao commented on KAFKA-1481:

Vladimir,

Thanks for the patch. Really appreciate your help. I realized that this is one
of the biggest technical debt that we have accumulated over time. So, it may
take some time to sort this out. So, bear with me. Some more comments.

30. About Taggable, I still have mixed feelings. I can see why you created it.
However, my reasoning is that for a lot of the case classes (ClientIdTopic,
CliendIdAndBroker) that we create, it's weird that some of them are taggable
and some of them are not, depending whether they are used for tagging metric
names or not. Those classes have no direct relationships with the metrics.
Similarly, we only need to be aware of tags when creating metrics. Also,
because of this, we change the constructor of SimpleConsumer. Since this is an
API change, we should really try to avoid it.

My feeling is that it's probably simpler if we just create regular case classes
as before and generate metric tags explicitly when we create the metric. For
example, in AbstractFetcherThread, we can do

class FetcherStats(clientIdAndBroker: ClientIdAndBroker) extends
KafkaMetricsGroup {
val requestRate = newMeter("RequestsPerSec", "requests", TimeUnit.SECONDS,
Map("cliendId" ->
clientIdAndBroker.clientId,
"brokerHost" ->
clientIdAndBroker.host,
"brokerPort" ->
clientIdAndBroker.port))

and just have ClientIdAndBroker be the following case class.

case class ClientIdAndBroker(clientId: String, host: String, port: Int)

This way, the code is a bit cleaner since all the metric tag related stuff are
isolated to those places when the metrics are created. So, I'd suggest that we
remove Taggable.

31. AbstractFetcherThread:
31.1 You changed the meaning of clientId. clientId is used in the fetch request
and we want to leave it as just the clientId string. Since the clientId should
be uniquely representing a particular consumer client, we just need to include
the clientId in the metric name. We don't need to include the consumer id in
either the fetch request or the metric name since it's too long and has
redundant info.
31.2 FetcherLagStats: This is an existing problem. FetcherLagMetrics shouldn't
be keyed off ClientIdBrokerTopicPartition. It should be keyed off
ClientIdTopicPartition. This way, the metric name remains the same independent
of the current leader of those partitions.

32. ZookeeperConsumerConnector:
32.1 FetchQueueSize: I agree that the metric name just needs to be tagged with
clientId, topic and threadId. We don't need to include the consumerId since
it's too long (note that topicThread._2 includes both the consumerId and the
threadId).

33. KafkaMetricsGroup: Duplicate entries.
// kafka.consumer.ConsumerTopicStats <-- kafka.consumer.{ConsumerIterator,
PartitionTopicInfo}
explicitMetricName("kafka.consumer", "ConsumerTopicMetrics",
"MessagesPerSec"),
explicitMetricName("kafka.consumer", "ConsumerTopicMetrics",
"MessagesPerSec"),

// kafka.consumer.ConsumerTopicStats
explicitMetricName("kafka.consumer", "ConsumerTopicMetrics", "BytesPerSec"),
explicitMetricName("kafka.consumer", "ConsumerTopicMetrics", "BytesPerSec"),

// kafka.consumer.FetchRequestAndResponseStats <--
kafka.consumer.SimpleConsumer
explicitMetricName("kafka.consumer", "FetchRequestAndResponseMetrics",
"FetchResponseSize"),
explicitMetricName("kafka.consumer", "FetchRequestAndResponseMetrics",
"FetchRequestRateAndTimeMs"),
explicitMetricName("kafka.consumer", "FetchRequestAndResponseMetrics",
"FetchResponseSize"),
explicitMetricName("kafka.consumer", "FetchRequestAndResponseMetrics",
"FetchRequestRateAndTimeMs"),

/**
* ProducerRequestStats <-- SyncProducer
* metric for SyncProducer in fetchTopicMetaData() needs to be removed when
consumer is closed.
*/
explicitMetricName("kafka.producer", "ProducerRequestMetrics",
"ProducerRequestRateAndTimeMs"),
explicitMetricName("kafka.producer", "ProducerRequestMetrics",
"ProducerRequestSize"),
explicitMetricName("kafka.producer", "ProducerRequestMetrics",
"ProducerRequestRateAndTimeMs"),
explicitMetricName("kafka.producer", "ProducerRequestMetrics",
"ProducerRequestSize")

34. AbstractFetcherManager: Could you put the followings in 2 separate lines?
Similar things happen in a few other files. Perhaps you need to change the
formatting in your IDE?

}, metricPrefix.toTags

private def getFetcherId(topic: String, partitionId: Int) : Int = {
Utils.abs(31 * topic.hashCode() + partitionId) % numFetchers

> Stop using dashes AND underscores as separators in MBean nam

[jira] [Commented] (KAFKA-1501) transient unit tests failures due to port already in use

2014-10-27 Thread Jay Kreps (JIRA)


[ 
https://issues.apache.org/jira/browse/KAFKA-1501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14186130#comment-14186130
 ] 

Jay Kreps commented on KAFKA-1501:
--

Nice, so statistically it is 93% likely to be fixed, then!

So since this changes the socket server default is this the right thing to do? 
Could this have any negative side effects in production? I actually don't 
really understand the effect of this option or why lack of it was causing the 
failure. Could you explain?

> transient unit tests failures due to port already in use
> 
>
> Key: KAFKA-1501
> URL: https://issues.apache.org/jira/browse/KAFKA-1501
> Project: Kafka
>  Issue Type: Improvement
>  Components: core
>Reporter: Jun Rao
>Assignee: Guozhang Wang
>  Labels: newbie
> Attachments: KAFKA-1501.patch, KAFKA-1501.patch
>
>
> Saw the following transient failures.
> kafka.api.ProducerFailureHandlingTest > testTooLargeRecordWithAckOne FAILED
> kafka.common.KafkaException: Socket server failed to bind to 
> localhost:59909: Address already in use.
> at kafka.network.Acceptor.openServerSocket(SocketServer.scala:195)
> at kafka.network.Acceptor.(SocketServer.scala:141)
> at kafka.network.SocketServer.startup(SocketServer.scala:68)
> at kafka.server.KafkaServer.startup(KafkaServer.scala:95)
> at kafka.utils.TestUtils$.createServer(TestUtils.scala:123)
> at 
> kafka.api.ProducerFailureHandlingTest.setUp(ProducerFailureHandlingTest.scala:68)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[ANNOUNCEMENT] Apache Kafka 0.8.2-beta Released

2014-10-27 Thread Joe Stein

The Apache Kafka community is pleased to announce the beta release for Apache 
Kafka 0.8.2.

The 0.8.2-beta release introduces many new features, improvements and fixes 
including:
 - A new Java producer for ease of implementation and enhanced performance.
 - Delete topic support.
 - Per topic configuration of preference for consistency over availability.
 - Scala 2.11 support and dropping support for Scala 2.8.
 - LZ4 Compression.

All of the changes in this release can be found: 
https://archive.apache.org/dist/kafka/0.8.2-beta/RELEASE_NOTES.html

Apache Kafka is high-throughput, publish-subscribe messaging system rethought 
of as a distributed commit log.

** Fast => A single Kafka broker can handle hundreds of megabytes of reads and 
writes per second from thousands of clients.

** Scalable => Kafka is designed to allow a single cluster to serve as the 
central data backbone 
for a large organization. It can be elastically and transparently expanded 
without downtime. 
Data streams are partitioned and spread over a cluster of machines to allow 
data streams 
larger than the capability of any single machine and to allow clusters of 
co-ordinated consumers.

** Durable => Messages are persisted on disk and replicated within the cluster 
to prevent 
data loss. Each broker can handle terabytes of messages without performance 
impact.

** Distributed by Design => Kafka has a modern cluster-centric design that 
offers 
strong durability and fault-tolerance guarantees.

You can download the release from: http://kafka.apache.org/downloads.html

We welcome your help and feedback. For more information on how to
report problems, and to get involved, visit the project website at 
http://kafka.apache.org/

Re: [ANNOUNCEMENT] Apache Kafka 0.8.2-beta Released

2014-10-27 Thread Jay Kreps

I actually don't see the beta release on that download page:
http://kafka.apache.org/downloads.html

-Jay

On Mon, Oct 27, 2014 at 5:50 PM, Joe Stein  wrote:

> The Apache Kafka community is pleased to announce the beta release for
> Apache Kafka 0.8.2.
>
> The 0.8.2-beta release introduces many new features, improvements and
> fixes including:
>  - A new Java producer for ease of implementation and enhanced performance.
>  - Delete topic support.
>  - Per topic configuration of preference for consistency over availability.
>  - Scala 2.11 support and dropping support for Scala 2.8.
>  - LZ4 Compression.
>
> All of the changes in this release can be found:
> https://archive.apache.org/dist/kafka/0.8.2-beta/RELEASE_NOTES.html
>
> Apache Kafka is high-throughput, publish-subscribe messaging system
> rethought of as a distributed commit log.
>
> ** Fast => A single Kafka broker can handle hundreds of megabytes of reads
> and
> writes per second from thousands of clients.
>
> ** Scalable => Kafka is designed to allow a single cluster to serve as the
> central data backbone
> for a large organization. It can be elastically and transparently expanded
> without downtime.
> Data streams are partitioned and spread over a cluster of machines to
> allow data streams
> larger than the capability of any single machine and to allow clusters of
> co-ordinated consumers.
>
> ** Durable => Messages are persisted on disk and replicated within the
> cluster to prevent
> data loss. Each broker can handle terabytes of messages without
> performance impact.
>
> ** Distributed by Design => Kafka has a modern cluster-centric design that
> offers
> strong durability and fault-tolerance guarantees.
>
> You can download the release from: http://kafka.apache.org/downloads.html
>
> We welcome your help and feedback. For more information on how to
> report problems, and to get involved, visit the project website at
> http://kafka.apache.org/
>
>

Re: [ANNOUNCEMENT] Apache Kafka 0.8.2-beta Released

2014-10-27 Thread Gwen Shapira

Strange. I'm seeing it.

Browser cache?

On Mon, Oct 27, 2014 at 5:59 PM, Jay Kreps  wrote:
> I actually don't see the beta release on that download page:
> http://kafka.apache.org/downloads.html
>
> -Jay
>
> On Mon, Oct 27, 2014 at 5:50 PM, Joe Stein  wrote:
>
>> The Apache Kafka community is pleased to announce the beta release for
>> Apache Kafka 0.8.2.
>>
>> The 0.8.2-beta release introduces many new features, improvements and
>> fixes including:
>>  - A new Java producer for ease of implementation and enhanced performance.
>>  - Delete topic support.
>>  - Per topic configuration of preference for consistency over availability.
>>  - Scala 2.11 support and dropping support for Scala 2.8.
>>  - LZ4 Compression.
>>
>> All of the changes in this release can be found:
>> https://archive.apache.org/dist/kafka/0.8.2-beta/RELEASE_NOTES.html
>>
>> Apache Kafka is high-throughput, publish-subscribe messaging system
>> rethought of as a distributed commit log.
>>
>> ** Fast => A single Kafka broker can handle hundreds of megabytes of reads
>> and
>> writes per second from thousands of clients.
>>
>> ** Scalable => Kafka is designed to allow a single cluster to serve as the
>> central data backbone
>> for a large organization. It can be elastically and transparently expanded
>> without downtime.
>> Data streams are partitioned and spread over a cluster of machines to
>> allow data streams
>> larger than the capability of any single machine and to allow clusters of
>> co-ordinated consumers.
>>
>> ** Durable => Messages are persisted on disk and replicated within the
>> cluster to prevent
>> data loss. Each broker can handle terabytes of messages without
>> performance impact.
>>
>> ** Distributed by Design => Kafka has a modern cluster-centric design that
>> offers
>> strong durability and fault-tolerance guarantees.
>>
>> You can download the release from: http://kafka.apache.org/downloads.html
>>
>> We welcome your help and feedback. For more information on how to
>> report problems, and to get involved, visit the project website at
>> http://kafka.apache.org/
>>
>>

Re: Review Request 26755: Patch for KAFKA-1706

2014-10-27 Thread Joel Koshy


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26755/#review58725
---


Another thing I forgot to mention in the earlier review: we definitely should 
have a unit test for this. You will need to allow passing in the Time interface 
and use MockTime in the test.


core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala


Unused



core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala


if



core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala


no need the return
you can add on line 63:
else {
  false
}

(and remove the false at the very end)

Equivalent, but a little cleaner to look at



core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala


Again, this is obviously stylistic, but in small methods like this there is 
little need to return from the middle.

Can you restructure it to something like:

if (...)
  false
else {
  ...
  success
}



core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala


Same here


- Joel Koshy


On Oct. 27, 2014, 6:50 a.m., Jiangjie Qin wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/26755/
> ---
> 
> (Updated Oct. 27, 2014, 6:50 a.m.)
> 
> 
> Review request for kafka.
> 
> 
> Bugs: KAFKA-1706
> https://issues.apache.org/jira/browse/KAFKA-1706
> 
> 
> Repository: kafka
> 
> 
> Description
> ---
> 
> changed arguments name
> 
> 
> correct typo.
> 
> 
> Incorporated Joel's comments. Also fixed negative queue size problem.
> 
> 
> Diffs
> -
> 
>   core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala PRE-CREATION 
> 
> Diff: https://reviews.apache.org/r/26755/diff/
> 
> 
> Testing
> ---
> 
> 
> Thanks,
> 
> Jiangjie Qin
> 
>

Re: [ANNOUNCEMENT] Apache Kafka 0.8.2-beta Released

2014-10-27 Thread Jay Kreps

Yeah it must be a caching thing because others in the same office do see it
(but not all). And ctrl-shift-r doesn't seem to help. Nevermind :-)

-Jay

On Mon, Oct 27, 2014 at 6:00 PM, Gwen Shapira  wrote:

> Strange. I'm seeing it.
>
> Browser cache?
>
> On Mon, Oct 27, 2014 at 5:59 PM, Jay Kreps  wrote:
> > I actually don't see the beta release on that download page:
> > http://kafka.apache.org/downloads.html
> >
> > -Jay
> >
> > On Mon, Oct 27, 2014 at 5:50 PM, Joe Stein  wrote:
> >
> >> The Apache Kafka community is pleased to announce the beta release for
> >> Apache Kafka 0.8.2.
> >>
> >> The 0.8.2-beta release introduces many new features, improvements and
> >> fixes including:
> >>  - A new Java producer for ease of implementation and enhanced
> performance.
> >>  - Delete topic support.
> >>  - Per topic configuration of preference for consistency over
> availability.
> >>  - Scala 2.11 support and dropping support for Scala 2.8.
> >>  - LZ4 Compression.
> >>
> >> All of the changes in this release can be found:
> >> https://archive.apache.org/dist/kafka/0.8.2-beta/RELEASE_NOTES.html
> >>
> >> Apache Kafka is high-throughput, publish-subscribe messaging system
> >> rethought of as a distributed commit log.
> >>
> >> ** Fast => A single Kafka broker can handle hundreds of megabytes of
> reads
> >> and
> >> writes per second from thousands of clients.
> >>
> >> ** Scalable => Kafka is designed to allow a single cluster to serve as
> the
> >> central data backbone
> >> for a large organization. It can be elastically and transparently
> expanded
> >> without downtime.
> >> Data streams are partitioned and spread over a cluster of machines to
> >> allow data streams
> >> larger than the capability of any single machine and to allow clusters
> of
> >> co-ordinated consumers.
> >>
> >> ** Durable => Messages are persisted on disk and replicated within the
> >> cluster to prevent
> >> data loss. Each broker can handle terabytes of messages without
> >> performance impact.
> >>
> >> ** Distributed by Design => Kafka has a modern cluster-centric design
> that
> >> offers
> >> strong durability and fault-tolerance guarantees.
> >>
> >> You can download the release from:
> http://kafka.apache.org/downloads.html
> >>
> >> We welcome your help and feedback. For more information on how to
> >> report problems, and to get involved, visit the project website at
> >> http://kafka.apache.org/
> >>
> >>
>

Re: Review Request 27256: Fix KAFKA-1735

2014-10-27 Thread Neha Narkhede


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/27256/#review58744
---



clients/src/main/java/org/apache/kafka/common/record/MemoryRecords.java


Would it be possible to add a unit test for this?


- Neha Narkhede


On Oct. 27, 2014, 11:59 p.m., Guozhang Wang wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/27256/
> ---
> 
> (Updated Oct. 27, 2014, 11:59 p.m.)
> 
> 
> Review request for kafka.
> 
> 
> Bugs: KAFKA-1735
> https://issues.apache.org/jira/browse/KAFKA-1735
> 
> 
> Repository: kafka
> 
> 
> Description
> ---
> 
> Handle partial reads from compressed stream
> 
> 
> Diffs
> -
> 
>   clients/src/main/java/org/apache/kafka/common/record/MemoryRecords.java 
> 040e5b91005edb8f015afdfa76fd94e0bf3cb4ca 
> 
> Diff: https://reviews.apache.org/r/27256/diff/
> 
> 
> Testing
> ---
> 
> 
> Thanks,
> 
> Guozhang Wang
> 
>

[jira] [Commented] (KAFKA-1731) add config/jmx changes in 0.8.2 doc

2014-10-27 Thread Gwen Shapira (JIRA)


[ 
https://issues.apache.org/jira/browse/KAFKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14186179#comment-14186179
 ] 

Gwen Shapira commented on KAFKA-1731:
-

Thanks :)

No comments, it looks good. 

> add config/jmx changes in 0.8.2 doc
> ---
>
> Key: KAFKA-1731
> URL: https://issues.apache.org/jira/browse/KAFKA-1731
> Project: Kafka
>  Issue Type: Sub-task
>Reporter: Jun Rao
>Assignee: Jun Rao
> Fix For: 0.8.2
>
> Attachments: config-jmx_082.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: [ANNOUNCEMENT] Apache Kafka 0.8.2-beta Released

2014-10-27 Thread Jun Rao

Joe,

Thanks for driving the release.

Jun

On Mon, Oct 27, 2014 at 5:50 PM, Joe Stein  wrote:

> The Apache Kafka community is pleased to announce the beta release for
> Apache Kafka 0.8.2.
>
> The 0.8.2-beta release introduces many new features, improvements and
> fixes including:
>  - A new Java producer for ease of implementation and enhanced performance.
>  - Delete topic support.
>  - Per topic configuration of preference for consistency over availability.
>  - Scala 2.11 support and dropping support for Scala 2.8.
>  - LZ4 Compression.
>
> All of the changes in this release can be found:
> https://archive.apache.org/dist/kafka/0.8.2-beta/RELEASE_NOTES.html
>
> Apache Kafka is high-throughput, publish-subscribe messaging system
> rethought of as a distributed commit log.
>
> ** Fast => A single Kafka broker can handle hundreds of megabytes of reads
> and
> writes per second from thousands of clients.
>
> ** Scalable => Kafka is designed to allow a single cluster to serve as the
> central data backbone
> for a large organization. It can be elastically and transparently expanded
> without downtime.
> Data streams are partitioned and spread over a cluster of machines to
> allow data streams
> larger than the capability of any single machine and to allow clusters of
> co-ordinated consumers.
>
> ** Durable => Messages are persisted on disk and replicated within the
> cluster to prevent
> data loss. Each broker can handle terabytes of messages without
> performance impact.
>
> ** Distributed by Design => Kafka has a modern cluster-centric design that
> offers
> strong durability and fault-tolerance guarantees.
>
> You can download the release from: http://kafka.apache.org/downloads.html
>
> We welcome your help and feedback. For more information on how to
> report problems, and to get involved, visit the project website at
> http://kafka.apache.org/
>
>

Re: Review Request 26373: Patch for KAFKA-1647

2014-10-27 Thread Joel Koshy


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26373/#review58748
---

Ship it!


Looks good to me. Can you make these final edits and upload another RB?


core/src/main/scala/kafka/server/ReplicaManager.scala


The .format needs to be on this line. Can you fix it and upload a new patch?



core/src/main/scala/kafka/server/ReplicaManager.scala


Small edits:

// Create the local replica even if the leader is unavailable. This is 
required to ensure that we include the partition's high watermark in the 
checkpoint file (see KAFKA-1647)

Also, I'm not sure if we need to explicitly reference the jira in comments 
since people can just git annotate.


- Joel Koshy


On Oct. 28, 2014, 12:20 a.m., Jiangjie Qin wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/26373/
> ---
> 
> (Updated Oct. 28, 2014, 12:20 a.m.)
> 
> 
> Review request for kafka.
> 
> 
> Bugs: KAFKA-1647
> https://issues.apache.org/jira/browse/KAFKA-1647
> 
> 
> Repository: kafka
> 
> 
> Description
> ---
> 
> Addressed Joel's comments.
> 
> 
> the version 2 code seems to be submitted by mistake... This should be the 
> code for review that addressed Joel's comments.
> 
> 
> Addressed Jun's comments. Will do tests to verify if it works.
> 
> 
> Addressed Joel's comments, we do not need to check the if leader exits for 
> not when adding fetcher.
> 
> 
> Diffs
> -
> 
>   core/src/main/scala/kafka/server/ReplicaManager.scala 
> 78b7514cc109547c562e635824684fad581af653 
> 
> Diff: https://reviews.apache.org/r/26373/diff/
> 
> 
> Testing
> ---
> 
> Followed Joel's testing step. I was able to reproduce the problem without the 
> patch and the WARN message goes away after applied the patch.
> 
> 
> Thanks,
> 
> Jiangjie Qin
> 
>

Re: Review Request 26755: Patch for KAFKA-1706

2014-10-27 Thread Jiangjie Qin


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26755/
---

(Updated Oct. 28, 2014, 1:34 a.m.)


Review request for kafka.


Bugs: KAFKA-1706
https://issues.apache.org/jira/browse/KAFKA-1706


Repository: kafka


Description (updated)
---

changed arguments name


correct typo.


Incorporated Joel's comments. Also fixed negative queue size problem.


Incorporated Joel's comments.


Diffs (updated)
-

  core/src/main/scala/kafka/utils/ByteBoundedBlockingQueue.scala PRE-CREATION 

Diff: https://reviews.apache.org/r/26755/diff/


Testing
---


Thanks,

Jiangjie Qin

[jira] [Updated] (KAFKA-1706) Adding a byte bounded blocking queue to util.


 [ 
https://issues.apache.org/jira/browse/KAFKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jiangjie Qin updated KAFKA-1706:

Attachment: KAFKA-1706_2014-10-27_18:34:37.patch

> Adding a byte bounded blocking queue to util.
> -
>
> Key: KAFKA-1706
> URL: https://issues.apache.org/jira/browse/KAFKA-1706
> Project: Kafka
>  Issue Type: Improvement
>Reporter: Jiangjie Qin
>Assignee: Jiangjie Qin
> Attachments: KAFKA-1706.patch, KAFKA-1706_2014-10-15_09:26:26.patch, 
> KAFKA-1706_2014-10-15_09:28:01.patch, KAFKA-1706_2014-10-26_23:47:31.patch, 
> KAFKA-1706_2014-10-26_23:50:07.patch, KAFKA-1706_2014-10-27_18:34:37.patch
>
>
> We saw many out of memory issues in Mirror Maker. To enhance memory 
> management we want to introduce a ByteBoundedBlockingQueue that has limit on 
> both number of messages and number of bytes in it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (KAFKA-1706) Adding a byte bounded blocking queue to util.