from:"Sa Li"

Hello, all

I like to use the tool metrics-kafka which seems to be attractive to report
kafka metric and use graphite to graph metrics, however I am having trouble
to make it work.

In https://github.com/stealthly/metrics-kafka, it says:

In the main metrics-kafka folder

1) sudo ./bootstrap.sh 2) ./gradlew test 3) sudo ./shutdown.sh
When I run ./bootstrap, see this is what I got
root@DO-mq-dev:/home/stuser/jmx/metrics-kafka# ././bootstrap.sh
/dev/stdin: line 1: syntax error near unexpected token `newline'
/dev/stdin: line 1: `!DOCTYPE html'
/dev/stdin: line 1: syntax error near unexpected token `newline'
/dev/stdin: line 1: `!DOCTYPE html'
e348a98a5afb8b89b94fce51b125e8a2045d9834268ec64c3e38cb7b165ef642
2015/01/09 16:49:21 Error response from daemon: Could not find entity for
broker1

And this is how I vagrant up:
root@DO-mq-dev:/home/stuser/jmx/metrics-kafka# vagrant up
/usr/share/vagrant/plugins/provisioners/docker/plugin.rb:13:in
`require_relative':
/usr/share/vagrant/plugins/provisioners/docker/config.rb:23: syntax error,
unexpected tPOW (SyntaxError)
  def run(name, **options)
  ^
/usr/share/vagrant/plugins/provisioners/docker/config.rb:43: syntax error,
unexpected keyword_end, expecting $end
from /usr/share/vagrant/plugins/provisioners/docker/plugin.rb:13:in
`block in class:Plugin'
from /usr/lib/ruby/vendor_ruby/vagrant/registry.rb:27:in `call'
from /usr/lib/ruby/vendor_ruby/vagrant/registry.rb:27:in `get'
from
/usr/share/vagrant/plugins/kernel_v2/config/vm_provisioner.rb:34:in
`initialize'
from /usr/share/vagrant/plugins/kernel_v2/config/vm.rb:223:in `new'
from /usr/share/vagrant/plugins/kernel_v2/config/vm.rb:223:in
`provision'
from /home/stuser/jmx/metrics-kafka/Vagrantfile:29:in `block (2
levels) in top (required)'
from /usr/lib/ruby/vendor_ruby/vagrant/config/v2/loader.rb:37:in
`call'
from /usr/lib/ruby/vendor_ruby/vagrant/config/v2/loader.rb:37:in
`load'
from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:104:in
`block (2 levels) in load'
from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:98:in `each'
from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:98:in
`block in load'
from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:95:in `each'
from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:95:in `load'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:335:in
`machine'
from /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:142:in
`block in with_target_vms'
from /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:175:in
`call'
from /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:175:in
`block in with_target_vms'
from /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:174:in
`map'
from /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:174:in
`with_target_vms'
from /usr/share/vagrant/plugins/commands/up/command.rb:56:in `block
in execute'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:210:in `block
(2 levels) in batch'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:208:in `tap'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:208:in `block
in batch'
from internal:prelude:10:in `synchronize'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:207:in `batch'
from /usr/share/vagrant/plugins/commands/up/command.rb:55:in
`execute'
from /usr/lib/ruby/vendor_ruby/vagrant/cli.rb:38:in `execute'
from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:484:in `cli'
from /usr/bin/vagrant:127:in `main'

Any idea to make it work?

thanks


-- 

Alec Li

Re: metric-kafka problems

Thank you very much, Joe, I will try all of them, keep here posted.
On Jan 9, 2015 5:55 PM, Joe Stein joe.st...@stealth.ly wrote:

 Hi, https://github.com/stealthly/metrics-kafka is a project to be used as
 an example of how to use Kafka as a central point to send all of your
 metrics for your entire infrastructure. The consumers integrate so as to
 abstract the load and coupling of services so systems can just send their
 stats to Kafka and then you can do whatever you want with them from there
 (often multiple things). We also build a Yammer Metrics Reporter (which is
 what Kafka uses to send its Metrics) for Kafka itself so brokers can send
 their stats into a Kafka topic and used downstream (typically another
 cluster).  The issue you reported was caused by changes by github and I
 just pushed fixes for them so things are working again.

 If you are not looking for that type of solution and want to just see and
 chart broker metrics then I would suggest taking a look at
 https://github.com/airbnb/kafka-statsd-metrics2 and you point it to
 https://github.com/kamon-io/docker-grafana-graphite. I find this a very
 quick out the box way to see what is going on with a broker when no stats
 reporter is already in place. If you want a Kafka metrics reporter for just
 graphite check out https://github.com/damienclaveau/kafka-graphite for
 just
 ganglie https://github.com/criteo/kafka-ganglia for just Riemann
 https://github.com/TheLadders/KafkaRiemannMetricsReporter and/or you also
 can use a service like SPM
 https://apps.sematext.com/spm-reports/mainPage.do?selectedApplication=4293
 or DataDog https://www.datadoghq.com/

 Hope this help, thanks!

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC
  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 On Fri, Jan 9, 2015 at 7:51 PM, Sa Li sal...@gmail.com wrote:

  Hello, all
 
  I like to use the tool metrics-kafka which seems to be attractive to
 report
  kafka metric and use graphite to graph metrics, however I am having
 trouble
  to make it work.
 
  In https://github.com/stealthly/metrics-kafka, it says:
 
  In the main metrics-kafka folder
 
  1) sudo ./bootstrap.sh 2) ./gradlew test 3) sudo ./shutdown.sh
  When I run ./bootstrap, see this is what I got
  root@DO-mq-dev:/home/stuser/jmx/metrics-kafka# ././bootstrap.sh
  /dev/stdin: line 1: syntax error near unexpected token `newline'
  /dev/stdin: line 1: `!DOCTYPE html'
  /dev/stdin: line 1: syntax error near unexpected token `newline'
  /dev/stdin: line 1: `!DOCTYPE html'
  e348a98a5afb8b89b94fce51b125e8a2045d9834268ec64c3e38cb7b165ef642
  2015/01/09 16:49:21 Error response from daemon: Could not find entity for
  broker1
 
  And this is how I vagrant up:
  root@DO-mq-dev:/home/stuser/jmx/metrics-kafka# vagrant up
  /usr/share/vagrant/plugins/provisioners/docker/plugin.rb:13:in
  `require_relative':
  /usr/share/vagrant/plugins/provisioners/docker/config.rb:23: syntax
 error,
  unexpected tPOW (SyntaxError)
def run(name, **options)
^
  /usr/share/vagrant/plugins/provisioners/docker/config.rb:43: syntax
 error,
  unexpected keyword_end, expecting $end
  from
 /usr/share/vagrant/plugins/provisioners/docker/plugin.rb:13:in
  `block in class:Plugin'
  from /usr/lib/ruby/vendor_ruby/vagrant/registry.rb:27:in `call'
  from /usr/lib/ruby/vendor_ruby/vagrant/registry.rb:27:in `get'
  from
  /usr/share/vagrant/plugins/kernel_v2/config/vm_provisioner.rb:34:in
  `initialize'
  from /usr/share/vagrant/plugins/kernel_v2/config/vm.rb:223:in
 `new'
  from /usr/share/vagrant/plugins/kernel_v2/config/vm.rb:223:in
  `provision'
  from /home/stuser/jmx/metrics-kafka/Vagrantfile:29:in `block (2
  levels) in top (required)'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/v2/loader.rb:37:in
  `call'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/v2/loader.rb:37:in
  `load'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:104:in
  `block (2 levels) in load'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:98:in
  `each'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:98:in
  `block in load'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:95:in
  `each'
  from /usr/lib/ruby/vendor_ruby/vagrant/config/loader.rb:95:in
  `load'
  from /usr/lib/ruby/vendor_ruby/vagrant/environment.rb:335:in
  `machine'
  from
 /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:142:in
  `block in with_target_vms'
  from
 /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:175:in
  `call'
  from
 /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:175:in
  `block in with_target_vms'
  from
 /usr/lib/ruby/vendor_ruby/vagrant/plugin/v2/command.rb:174:in
  `map

Re: zookeeper monitoring

Hi, I parse the zkServer.sh and make changes on
/etc/zookeeper/conf/environment

ZOOMAIN=-Dcom.sun.management.jmxremote=true
-Dcom.sun.management.jmxremote.local.only=false
-Dcom.sun.management.jmxremote.authenticate=false
-Dcom.sun.management.jmxremote.ssl=false
com.sun.management.jmxremote.port=2
org.apache.zookeeper.server.quorum.QuorumPeerMain

which was originally
ZOOMAIN=org.apache.zookeeper.server.quorum.QuorumPeerMain

when I run it, it is like

root@DO-mq-dev:/etc/zookeeper/bin# ./zkServer.sh start
JMX enabled by default
Using config: /etc/zookeeper/conf/zoo.cfg
Starting zookeeper ... STARTED


But when I try to connect it by jconsole: 10.100.70.128:2, but it fails
to connect, is there a way to confirm jmxremote port= 2?

thanks

AL

On Thu, Jan 8, 2015 at 4:02 PM, Sa Li sal...@gmail.com wrote:


 Hi, all

 I've just figured out the monitoring of kafka by jconsole, I want to do
 the same thing to zookeeper. Zookeeper site says The class
 *org.apache.zookeeper.server.quorum.QuorumPeerMain* will start a JMX
 manageable ZooKeeper server. This class registers the proper MBeans during
 initalization to support JMX monitoring and management of the instance. See
 *bin/zkServer.sh* for one example of starting ZooKeeper using
 QuorumPeerMain.

 I found when I type:
 root@pof-kstorm-dev1:/etc/kafka# zkServer.sh start
 JMX enabled by default
 Using config: /etc/zookeeper/conf/zoo.cfg
 Starting zookeeper ... STARTED

 Seems JMX is enabled by default, by checking the zkServer.sh,

 ZOOMAIN=-Dcom.sun.management.jmxremote
 -Dcom.sun.management.jmxremote.local.only=$JMXLOCALONLY
 org.apache.zookeeper.server.quorum.QuorumPeerMain

 here
 -Dcom.sun.management.jmxremote.local.only=$JMXLOCALONLY , should I change
 a jmxport here, or what is the default jmx_port number for zookeeper?

 thanks

 --

 Alec Li




-- 

Alec Li

Re: zookeeper monitoring

Worked, thanks



On Fri, Jan 9, 2015 at 10:37 AM, Sa Li sal...@gmail.com wrote:

 Hi, I parse the zkServer.sh and make changes on
 /etc/zookeeper/conf/environment

 ZOOMAIN=-Dcom.sun.management.jmxremote=true
 -Dcom.sun.management.jmxremote.local.only=false
 -Dcom.sun.management.jmxremote.authenticate=false
 -Dcom.sun.management.jmxremote.ssl=false
 com.sun.management.jmxremote.port=2
 org.apache.zookeeper.server.quorum.QuorumPeerMain

 which was originally
 ZOOMAIN=org.apache.zookeeper.server.quorum.QuorumPeerMain

 when I run it, it is like

 root@DO-mq-dev:/etc/zookeeper/bin# ./zkServer.sh start
 JMX enabled by default
 Using config: /etc/zookeeper/conf/zoo.cfg
 Starting zookeeper ... STARTED


 But when I try to connect it by jconsole: 10.100.70.128:2, but it
 fails to connect, is there a way to confirm jmxremote port= 2?

 thanks

 AL

 On Thu, Jan 8, 2015 at 4:02 PM, Sa Li sal...@gmail.com wrote:


 Hi, all

 I've just figured out the monitoring of kafka by jconsole, I want to do
 the same thing to zookeeper. Zookeeper site says The class
 *org.apache.zookeeper.server.quorum.QuorumPeerMain* will start a JMX
 manageable ZooKeeper server. This class registers the proper MBeans during
 initalization to support JMX monitoring and management of the instance. See
 *bin/zkServer.sh* for one example of starting ZooKeeper using
 QuorumPeerMain.

 I found when I type:
 root@pof-kstorm-dev1:/etc/kafka# zkServer.sh start
 JMX enabled by default
 Using config: /etc/zookeeper/conf/zoo.cfg
 Starting zookeeper ... STARTED

 Seems JMX is enabled by default, by checking the zkServer.sh,

 ZOOMAIN=-Dcom.sun.management.jmxremote
 -Dcom.sun.management.jmxremote.local.only=$JMXLOCALONLY
 org.apache.zookeeper.server.quorum.QuorumPeerMain

 here
 -Dcom.sun.management.jmxremote.local.only=$JMXLOCALONLY , should I change
 a jmxport here, or what is the default jmx_port number for zookeeper?

 thanks

 --

 Alec Li




 --

 Alec Li




-- 

Alec Li

Re: kafka monitoring

2015-01-08 Thread Sa Li

Thank you very much for all the reply, I am able to connect jconsole now,
by set env JMX_PORT= start server. However, when I connect it I found
there is a port conflict with the kafka-run-class.sh,

Error: Exception thrown by the agent : java.rmi.server.ExportException:
Port already in use: ; nested exception is:
java.net.BindException: Address already in use

I can of course to reset the JMX_PORT number other than , but I am
curious, do I have to ?

thanks

AL

On Thu, Jan 8, 2015 at 11:57 AM, Gene Robichaux gene.robich...@match.com
wrote:

 Is there a firewall between your DEV and PROD environments? If so you will
 need to open access on all ports, not just JMX port.

 It gets complicated with JMX.

 Gene Robichaux
 Manager, Database Operations
 Match.com
 8300 Douglas Avenue I Suite 800 I Dallas, TX  75225

 -Original Message-
 From: Sa Li [mailto:sal...@gmail.com]
 Sent: Thursday, January 08, 2015 1:09 PM
 To: users@kafka.apache.org
 Subject: kafka monitoring

 Hello, All

 I understand many of you are using jmxtrans along with graphite/ganglia to
 pull out metrics, according to https://kafka.apache.org/081/ops.html,  it
 says The easiest way to see the available metrics to fire up jconsole and
 point it at a running kafka client or server; this will all browsing all
 metrics with JMX. ..

 I tried to fire up a jconsole on windows attempting to access our dev and
 production cluster which are running good, here is the main node of my dev:
 10.100.75.128, broker port:9092, zk port:2181

 Jconsole shows:

  New Connection
 Remote Process:

 Usage: hostname:port OR service:jmx:protocol:sap
 Username:Password:

 Sorry about my naive, I tried connect base on above ip just can't be
 connected, do I need to do something in dev server to be able to make it
 work?

 thanks

 --

 Alec Li




-- 

Alec Li

Re: kafka monitoring

2015-01-08 Thread Sa Li

In addition, I found all the attributes in jconsole MBeans are cool, but
not being graphed, so again, if I want to view the real-time graphing,
jmxtrans + graphite is the solution?

thanks

AL

On Thu, Jan 8, 2015 at 1:35 PM, Sa Li sal...@gmail.com wrote:

 Thank you very much for all the reply, I am able to connect jconsole now,
 by set env JMX_PORT= start server. However, when I connect it I found
 there is a port conflict with the kafka-run-class.sh,

 Error: Exception thrown by the agent : java.rmi.server.ExportException:
 Port already in use: ; nested exception is:
 java.net.BindException: Address already in use

 I can of course to reset the JMX_PORT number other than , but I am
 curious, do I have to ?

 thanks

 AL

 On Thu, Jan 8, 2015 at 11:57 AM, Gene Robichaux gene.robich...@match.com
 wrote:

 Is there a firewall between your DEV and PROD environments? If so you
 will need to open access on all ports, not just JMX port.

 It gets complicated with JMX.

 Gene Robichaux
 Manager, Database Operations
 Match.com
 8300 Douglas Avenue I Suite 800 I Dallas, TX  75225

 -Original Message-
 From: Sa Li [mailto:sal...@gmail.com]
 Sent: Thursday, January 08, 2015 1:09 PM
 To: users@kafka.apache.org
 Subject: kafka monitoring

 Hello, All

 I understand many of you are using jmxtrans along with graphite/ganglia
 to pull out metrics, according to https://kafka.apache.org/081/ops.html,
 it says The easiest way to see the available metrics to fire up jconsole
 and point it at a running kafka client or server; this will all browsing
 all metrics with JMX. ..

 I tried to fire up a jconsole on windows attempting to access our dev and
 production cluster which are running good, here is the main node of my dev:
 10.100.75.128, broker port:9092, zk port:2181

 Jconsole shows:

  New Connection
 Remote Process:

 Usage: hostname:port OR service:jmx:protocol:sap
 Username:Password:

 Sorry about my naive, I tried connect base on above ip just can't be
 connected, do I need to do something in dev server to be able to make it
 work?

 thanks

 --

 Alec Li




 --

 Alec Li




-- 

Alec Li

Re: NotLeaderForPartitionException while doing performance test

2015-01-08 Thread Sa Li

   26u  IPv6  213145  0t0  TCP *:2181 (LISTEN)
java  22152   root   27u  IPv6  211541  0t0  TCP
exemplary-birds.master:3888 (LISTEN)
java  22152   root   28u  IPv6  443527  0t0  TCP
exemplary-birds.master:3888-complicated-laugh.master:43940 (ESTABLISHED)
java  22152   root   29u  IPv6   23347  0t0  TCP
exemplary-birds.master:43797-harmful-jar.master:2888 (ESTABLISHED)
java  22152   root   30u  IPv6  204517  0t0  TCP
exemplary-birds.master:3888-harmful-jar.master:50791 (ESTABLISHED)
java  22152   root   31u  IPv6 4278513  0t0  TCP
exemplary-birds.master:3888-voluminous-mass.master:50452 (ESTABLISHED)
java  22152   root   32u  IPv6 4345845  0t0  TCP
exemplary-birds.master:2181-harmful-jar.master:45048 (ESTABLISHED)
java  22152   root   33u  IPv6  443552  0t0  TCP
exemplary-birds.master:3888-beloved-judge.master:56370 (ESTABLISHED)
java  22152   root   35u  IPv6 4364514  0t0  TCP
exemplary-birds.master:2181-voluminous-mass.master:60600 (ESTABLISHED)
ssh   24632 sa3u  IPv4 4289852  0t0  TCP
exemplary-birds.master:60510-harmful-jar.master:ssh (ESTABLISHED)
ssh   24645 sa3u  IPv4 4289867  0t0  TCP
exemplary-birds.master:33295-voluminous-mass.master:ssh (ESTABLISHED)

I didn't see anything wrong with it, but seem, the connection was
temporally closed.. Anyone has similar issue?

thanks







On Wed, Jan 7, 2015 at 10:32 PM, Jaikiran Pai jai.forums2...@gmail.com
wrote:

 There are different ways to find the connection count and each one depends
 on the operating system that's being used. lsof -i is one option, for
 example, on *nix systems.

 -Jaikiran

 On Thursday 08 January 2015 11:40 AM, Sa Li wrote:

 Yes, it is weird hostname, ;), that is what our system guys name it. How
 to
 take a note to measure the connections open to 10.100.98.102?

 Thanks

 AL
 On Jan 7, 2015 9:42 PM, Jaikiran Pai jai.forums2...@gmail.com wrote:

  On Thursday 08 January 2015 01:51 AM, Sa Li wrote:

  see this type of error again, back to normal in few secs

 [2015-01-07 20:19:49,744] WARN Error in I/O with harmful-jar.master/
 10.100.98.102

  That's a really weird hostname, the harmful-jar.master. Is that
 really
 your hostname? You mention that this happens during performance testing.
 Have you taken a note of how many connection are open to that
 10.100.98.102
 IP when this Connection refused exception happens?

 -Jaikiran


 (org.apache.kafka.common.network.Selector)

 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
   at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
   at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
   at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,754] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
   at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
   at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
   at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,764] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
   at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
   at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
   at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
   at java.lang.Thread.run(Thread.java:745)
 160403 records sent, 32080.6 records/sec (91.78 MB/sec), 507.0 ms avg
 latency, 2418.0 max latency.
 109882 records sent, 21976.4 records/sec (62.87 MB/sec), 672.7 ms avg
 latency, 3529.0 max latency.
 100315 records sent, 19995.0 records/sec (57.21 MB/sec), 774.8 ms avg
 latency, 3858.0 max latency.

 On Wed, Jan 7, 2015 at 12:07 PM, Sa Li sal...@gmail.com wrote:

   Hi, All

 I am doing performance test by

 bin/kafka-run-class.sh org.apache.kafka.clients

Re: NotLeaderForPartitionException while doing performance test

Yes, it is weird hostname, ;), that is what our system guys name it. How to
take a note to measure the connections open to 10.100.98.102?

Thanks

AL
On Jan 7, 2015 9:42 PM, Jaikiran Pai jai.forums2...@gmail.com wrote:

 On Thursday 08 January 2015 01:51 AM, Sa Li wrote:

 see this type of error again, back to normal in few secs

 [2015-01-07 20:19:49,744] WARN Error in I/O with harmful-jar.master/
 10.100.98.102


 That's a really weird hostname, the harmful-jar.master. Is that really
 your hostname? You mention that this happens during performance testing.
 Have you taken a note of how many connection are open to that 10.100.98.102
 IP when this Connection refused exception happens?

 -Jaikiran


(org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
  at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
  at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
  at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,754] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
  at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
  at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
  at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,764] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
  at org.apache.kafka.common.network.Selector.poll(
 Selector.java:232)
  at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
  at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
  at java.lang.Thread.run(Thread.java:745)
 160403 records sent, 32080.6 records/sec (91.78 MB/sec), 507.0 ms avg
 latency, 2418.0 max latency.
 109882 records sent, 21976.4 records/sec (62.87 MB/sec), 672.7 ms avg
 latency, 3529.0 max latency.
 100315 records sent, 19995.0 records/sec (57.21 MB/sec), 774.8 ms avg
 latency, 3858.0 max latency.

 On Wed, Jan 7, 2015 at 12:07 PM, Sa Li sal...@gmail.com wrote:

  Hi, All

 I am doing performance test by

 bin/kafka-run-class.sh org.apache.kafka.clients.
 tools.ProducerPerformance
 test-rep-three 5 100 -1 acks=1 bootstrap.servers=
 10.100.98.100:9092,10.100.98.101:9092,10.100.98.102:9092
 buffer.memory=67108864 batch.size=8196

 where the topic test-rep-three is described as follow:

 bin/kafka-topics.sh --describe --zookeeper 10.100.98.101:2181 --topic
 test-rep-three
 Topic:test-rep-threePartitionCount:8ReplicationFactor:3
 Configs:
  Topic: test-rep-three   Partition: 0Leader: 100
  Replicas:
 100,102,101   Isr: 102,101,100
  Topic: test-rep-three   Partition: 1Leader: 101
  Replicas:
 101,100,102   Isr: 102,101,100
  Topic: test-rep-three   Partition: 2Leader: 102
  Replicas:
 102,101,100   Isr: 101,102,100
  Topic: test-rep-three   Partition: 3Leader: 100
  Replicas:
 100,101,102   Isr: 101,100,102
  Topic: test-rep-three   Partition: 4Leader: 101
  Replicas:
 101,102,100   Isr: 102,100,101
  Topic: test-rep-three   Partition: 5Leader: 102
  Replicas:
 102,100,101   Isr: 100,102,101
  Topic: test-rep-three   Partition: 6Leader: 102
  Replicas:
 100,102,101   Isr: 102,101,100
  Topic: test-rep-three   Partition: 7Leader: 101
  Replicas:
 101,100,102   Isr: 101,100,102

 Apparently, it produces the messages and run for a while, but it
 periodically have such exceptions:

 org.apache.kafka.common.errors.NotLeaderForPartitionException: This
 server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This
 server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This
 server
 is not the leader for that topic-partition

NotLeaderForPartitionException while doing performance test

Hi, All

I am doing performance test by

bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
test-rep-three 5 100 -1 acks=1 bootstrap.servers=10.100.98.100:9092,
10.100.98.101:9092,10.100.98.102:9092 buffer.memory=67108864 batch.size=8196

where the topic test-rep-three is described as follow:

bin/kafka-topics.sh --describe --zookeeper 10.100.98.101:2181 --topic
test-rep-three
Topic:test-rep-threePartitionCount:8ReplicationFactor:3
Configs:
Topic: test-rep-three   Partition: 0Leader: 100 Replicas:
100,102,101   Isr: 102,101,100
Topic: test-rep-three   Partition: 1Leader: 101 Replicas:
101,100,102   Isr: 102,101,100
Topic: test-rep-three   Partition: 2Leader: 102 Replicas:
102,101,100   Isr: 101,102,100
Topic: test-rep-three   Partition: 3Leader: 100 Replicas:
100,101,102   Isr: 101,100,102
Topic: test-rep-three   Partition: 4Leader: 101 Replicas:
101,102,100   Isr: 102,100,101
Topic: test-rep-three   Partition: 5Leader: 102 Replicas:
102,100,101   Isr: 100,102,101
Topic: test-rep-three   Partition: 6Leader: 102 Replicas:
100,102,101   Isr: 102,101,100
Topic: test-rep-three   Partition: 7Leader: 101 Replicas:
101,100,102   Isr: 101,100,102

Apparently, it produces the messages and run for a while, but it
periodically have such exceptions:

org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
is not the leader for that topic-partition.
141292 records sent, 28258.4 records/sec (80.85 MB/sec), 551.2 ms avg
latency, 1494.0 max latency.
142526 records sent, 28505.2 records/sec (81.55 MB/sec), 580.8 ms avg
latency, 1513.0 max latency.
146564 records sent, 29312.8 records/sec (83.86 MB/sec), 557.9 ms avg
latency, 1431.0 max latency.
146755 records sent, 29351.0 records/sec (83.97 MB/sec), 556.7 ms avg
latency, 1480.0 max latency.
147963 records sent, 29592.6 records/sec (84.67 MB/sec), 556.7 ms avg
latency, 1546.0 max latency.
146931 records sent, 29386.2 records/sec (84.07 MB/sec), 550.9 ms avg
latency, 1715.0 max latency.
146947 records sent, 29389.4 records/sec (84.08 MB/sec), 555.1 ms avg
latency, 1750.0 max latency.
146422 records sent, 29284.4 records/sec (83.78 MB/sec), 557.9 ms avg
latency, 1818.0 max latency.
147516 records sent, 29503.2 records/sec (84.41 MB/sec), 555.6 ms avg
latency, 1806.0 max latency.
147877 records sent, 29575.4 records/sec (84.62 MB/sec), 552.1 ms avg
latency, 1821.0 max latency.
147201 records sent, 29440.2 records/sec (84.23 MB/sec), 554.5 ms avg
latency, 1826.0 max latency.
148317 records sent, 29663.4 records/sec (84.87 MB/sec), 558.1 ms avg
latency, 1792.0 max latency.
147756 records sent, 29551.2 records/sec (84.55 MB/sec), 550.9 ms avg
latency, 1806.0 max latency

then back into correct process state, is that because rebalance?

thanks



-- 

Alec Li

Re: NotLeaderForPartitionException while doing performance test

see this type of error again, back to normal in few secs

[2015-01-07 20:19:49,744] WARN Error in I/O with harmful-jar.master/
10.100.98.102 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 20:19:49,754] WARN Error in I/O with harmful-jar.master/
10.100.98.102 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 20:19:49,764] WARN Error in I/O with harmful-jar.master/
10.100.98.102 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
160403 records sent, 32080.6 records/sec (91.78 MB/sec), 507.0 ms avg
latency, 2418.0 max latency.
109882 records sent, 21976.4 records/sec (62.87 MB/sec), 672.7 ms avg
latency, 3529.0 max latency.
100315 records sent, 19995.0 records/sec (57.21 MB/sec), 774.8 ms avg
latency, 3858.0 max latency.

On Wed, Jan 7, 2015 at 12:07 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 I am doing performance test by

 bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
 test-rep-three 5 100 -1 acks=1 bootstrap.servers=
 10.100.98.100:9092,10.100.98.101:9092,10.100.98.102:9092
 buffer.memory=67108864 batch.size=8196

 where the topic test-rep-three is described as follow:

 bin/kafka-topics.sh --describe --zookeeper 10.100.98.101:2181 --topic
 test-rep-three
 Topic:test-rep-threePartitionCount:8ReplicationFactor:3
 Configs:
 Topic: test-rep-three   Partition: 0Leader: 100 Replicas:
 100,102,101   Isr: 102,101,100
 Topic: test-rep-three   Partition: 1Leader: 101 Replicas:
 101,100,102   Isr: 102,101,100
 Topic: test-rep-three   Partition: 2Leader: 102 Replicas:
 102,101,100   Isr: 101,102,100
 Topic: test-rep-three   Partition: 3Leader: 100 Replicas:
 100,101,102   Isr: 101,100,102
 Topic: test-rep-three   Partition: 4Leader: 101 Replicas:
 101,102,100   Isr: 102,100,101
 Topic: test-rep-three   Partition: 5Leader: 102 Replicas:
 102,100,101   Isr: 100,102,101
 Topic: test-rep-three   Partition: 6Leader: 102 Replicas:
 100,102,101   Isr: 102,101,100
 Topic: test-rep-three   Partition: 7Leader: 101 Replicas:
 101,100,102   Isr: 101,100,102

 Apparently, it produces the messages and run for a while, but it
 periodically have such exceptions:

 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server
 is not the leader for that topic-partition.
 org.apache.kafka.common.errors.NotLeaderForPartitionException: This server

connection error among nodes

Hi, Experts

Our cluster is a 3 nodes cluster, I simply test producer locally, see

bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
test-rep-three 100 3000 -1 acks=1 bootstrap.servers=10.100.98.100:9092
buffer.memory=67108864 batch.size=8196

But I got such error, I do think this is critical issue, it just temporally
lose the connection and get back, what is the reason for this?

[2015-01-07 21:44:14,180] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,190] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,200] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,210] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,220] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,230] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
[2015-01-07 21:44:14,240] WARN Error in I/O with voluminous-mass.master/
10.100.98.101 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)

Re: NotLeaderForPartitionException while doing performance test

I checked topic config, isr changes dynamically.

root@voluminous-mass:/srv/kafka# bin/kafka-topics.sh --describe --zookeeper
10.100.98.101:2181 --topic test-rep-three
Topic:test-rep-threePartitionCount:8ReplicationFactor:3
Configs:
Topic: test-rep-three   Partition: 0Leader: 100 Replicas:
100,102,101   Isr: 100
Topic: test-rep-three   Partition: 1Leader: 100 Replicas:
101,100,102   Isr: 100,101,102
Topic: test-rep-three   Partition: 2Leader: 102 Replicas:
102,101,100   Isr: 101,102
Topic: test-rep-three   Partition: 3Leader: 100 Replicas:
100,101,102   Isr: 100
Topic: test-rep-three   Partition: 4Leader: 100 Replicas:
101,102,100   Isr: 100
Topic: test-rep-three   Partition: 5Leader: 102 Replicas:
102,100,101   Isr: 100,102,101
Topic: test-rep-three   Partition: 6Leader: 100 Replicas:
100,102,101   Isr: 100,102,101
Topic: test-rep-three   Partition: 7Leader: 100 Replicas:
101,100,102   Isr: 100
root@voluminous-mass:/srv/kafka# bin/kafka-topics.sh --describe --zookeeper
10.100.98.101:2181 --topic test-rep-three
Topic:test-rep-threePartitionCount:8ReplicationFactor:3
Configs:
Topic: test-rep-three   Partition: 0Leader: 100 Replicas:
100,102,101   Isr: 102,100,101
Topic: test-rep-three   Partition: 1Leader: 101 Replicas:
101,100,102   Isr: 101,102,100
Topic: test-rep-three   Partition: 2Leader: 102 Replicas:
102,101,100   Isr: 101,102
Topic: test-rep-three   Partition: 3Leader: 100 Replicas:
100,101,102   Isr: 101,100,102
Topic: test-rep-three   Partition: 4Leader: 101 Replicas:
101,102,100   Isr: 101,102,100
Topic: test-rep-three   Partition: 5Leader: 102 Replicas:
102,100,101   Isr: 102,101,100
Topic: test-rep-three   Partition: 6Leader: 102 Replicas:
100,102,101   Isr: 102,101
Topic: test-rep-three   Partition: 7Leader: 101 Replicas:
101,100,102   Isr: 101,100,102

Why that happen?

thanks


On Wed, Jan 7, 2015 at 12:21 PM, Sa Li sal...@gmail.com wrote:

 see this type of error again, back to normal in few secs

 [2015-01-07 20:19:49,744] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,754] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 20:19:49,764] WARN Error in I/O with harmful-jar.master/
 10.100.98.102 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 160403 records sent, 32080.6 records/sec (91.78 MB/sec), 507.0 ms avg
 latency, 2418.0 max latency.
 109882 records sent, 21976.4 records/sec (62.87 MB/sec), 672.7 ms avg
 latency, 3529.0 max latency.
 100315 records sent, 19995.0 records/sec (57.21 MB/sec), 774.8 ms avg
 latency, 3858.0 max latency.

 On Wed, Jan 7, 2015 at 12:07 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 I am doing performance test by

 bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
 test-rep-three 5 100 -1 acks=1 bootstrap.servers=
 10.100.98.100:9092,10.100.98.101:9092,10.100.98.102:9092
 buffer.memory=67108864 batch.size=8196

 where

Re: connection error among nodes

Things bother me, sometimes, the errors won't pop out, sometimes it comes,
why?

On Wed, Jan 7, 2015 at 1:49 PM, Sa Li sal...@gmail.com wrote:


 Hi, Experts

 Our cluster is a 3 nodes cluster, I simply test producer locally, see

 bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
 test-rep-three 100 3000 -1 acks=1 bootstrap.servers=10.100.98.100:9092
 buffer.memory=67108864 batch.size=8196

 But I got such error, I do think this is critical issue, it just
 temporally lose the connection and get back, what is the reason for this?

 [2015-01-07 21:44:14,180] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,190] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,200] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,210] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,220] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,230] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection refused
 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
 at
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
 at
 org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
 at
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
 at java.lang.Thread.run(Thread.java:745)
 [2015-01-07 21:44:14,240] WARN Error in I/O with voluminous-mass.master/
 10.100.98.101 (org.apache.kafka.common.network.Selector)
 java.net.ConnectException: Connection

question about jmxtrans to get kafka metrics

Hi, All

I installed jmxtrans and graphite, wish to be able to graph stuff from
kafka, but firstly I start the jmxtrans and getting such errors, (I use the
example graphite json).

./jmxtrans.sh start graphite.json

[07 Jan 2015 17:55:58] [ServerScheduler_Worker-4] 180214 DEBUG
(com.googlecode.jmxtrans.jobs.ServerJob:31) - + Started server job:
Server [host=w2, port=1099,
url=service:jmx:rmi:///jndi/rmi://w2:1099/jmxrmi, cronExpression=null,
numQueryThreads=null]
[07 Jan 2015 17:55:58] [ServerScheduler_Worker-4] 180217 ERROR
(com.googlecode.jmxtrans.jobs.ServerJob:39) - Error
java.io.IOException: Failed to retrieve RMIServer stub:
javax.naming.ConfigurationException [Root exception is
java.rmi.UnknownHostException: Unknown host: w2; nested exception is:
java.net.UnknownHostException: w2]
at
javax.management.remote.rmi.RMIConnector.connect(RMIConnector.java:369)
at
javax.management.remote.JMXConnectorFactory.connect(JMXConnectorFactory.java:268)
at
com.googlecode.jmxtrans.util.JmxUtils.getServerConnection(JmxUtils.java:351)
at
com.googlecode.jmxtrans.util.JmxConnectionFactory.makeObject(JmxConnectionFactory.java:31)
at
org.apache.commons.pool.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:1212)
at com.googlecode.jmxtrans.jobs.ServerJob.execute(ServerJob.java:36)
at org.quartz.core.JobRunShell.run(JobRunShell.java:216)
at
org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:549)
Caused by: javax.naming.ConfigurationException [Root exception is
java.rmi.UnknownHostException: Unknown host: w2; nested exception is:
java.net.UnknownHostException: w2]
at
com.sun.jndi.rmi.registry.RegistryContext.lookup(RegistryContext.java:118)
at
com.sun.jndi.toolkit.url.GenericURLContext.lookup(GenericURLContext.java:203)
at javax.naming.InitialContext.lookup(InitialContext.java:411)
at
javax.management.remote.rmi.RMIConnector.findRMIServerJNDI(RMIConnector.java:1929)
at
javax.management.remote.rmi.RMIConnector.findRMIServer(RMIConnector.java:1896)
at
javax.management.remote.rmi.RMIConnector.connect(RMIConnector.java:286)
... 7 more
Caused by: java.rmi.UnknownHostException: Unknown host: w2; nested
exception is:
java.net.UnknownHostException: w2
at sun.rmi.transport.tcp.TCPEndpoint.newSocket(TCPEndpoint.java:616)
at
sun.rmi.transport.tcp.TCPChannel.createConnection(TCPChannel.java:216)
at
sun.rmi.transport.tcp.TCPChannel.newConnection(TCPChannel.java:202)
at sun.rmi.server.UnicastRef.newCall(UnicastRef.java:341)
at sun.rmi.registry.RegistryImpl_Stub.lookup(Unknown Source)
at
com.sun.jndi.rmi.registry.RegistryContext.lookup(RegistryContext.java:114)
... 12 more
Caused by: java.net.UnknownHostException: w2
at
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:579)
at java.net.Socket.connect(Socket.java:528)
at java.net.Socket.init(Socket.java:425)
at java.net.Socket.init(Socket.java:208)
at
sun.rmi.transport.proxy.RMIDirectSocketFactory.createSocket(RMIDirectSocketFactory.java:40)
at
sun.rmi.transport.proxy.RMIMasterSocketFactory.createSocket(RMIMasterSocketFactory.java:147)
at sun.rmi.transport.tcp.TCPEndpoint.newSocket(TCPEndpoint.java:613)
... 17 more

The graphite.json

 {
  servers : [ {
port : 1099,
host : w2,
queries : [ {
  obj : java.lang:type=Memory,
  attr : [ HeapMemoryUsage, NonHeapMemoryUsage ],
  outputWriters : [ {
@class : com.googlecode.jmxtrans.model.output.GraphiteWriter,
settings : {
  port : 2003,
  host : 10.100.70.128
}
  } ]
} ]
  } ]
}


Anyone help me to diagnose what this problem is?

thanks


-- 

Alec Li

no space left error

Hi, All

I am doing performance test on our new kafka production server, but after
sending some messages (even faked message by using bin/kafka-run-class.sh
org.apache.kafka.clients.tools.ProducerPerformance), it comes out the error
of connection, and shut down the brokers, after that, I see such errors,

conf-su: cannot create temp file for here-document: No space left on device

How can I fix it, I am concerning that will happen when we start to publish
real messages in kafka, and should I create some cron to regularly clean
certain directories?

thanks

-- 

Alec Li

Re: no space left error

Continue this issue, when I restart the server, like
bin/kafka-server-start.sh config/server.properties

it will fails to start the server, like

[2015-01-06 20:00:55,441] FATAL Fatal error during KafkaServerStable
startup. Prepare to shutdown (kafka.server.KafkaServerStartable)
java.lang.InternalError: a fault occurred in a recent unsafe memory access
operation in compiled Java code
at java.nio.HeapByteBuffer.init(HeapByteBuffer.java:57)
at java.nio.ByteBuffer.allocate(ByteBuffer.java:331)
at
kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:188)
at
kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:165)
at
kafka.utils.IteratorTemplate.maybeComputeNext(IteratorTemplate.scala:66)
at kafka.utils.IteratorTemplate.hasNext(IteratorTemplate.scala:58)
at kafka.log.LogSegment.recover(LogSegment.scala:165)
at kafka.log.Log.recoverLog(Log.scala:179)
at kafka.log.Log.loadSegments(Log.scala:155)
at kafka.log.Log.init(Log.scala:64)
at
kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:118)
at
kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:113)
at
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
at
scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
at
kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:113)
at
kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:105)
at
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
at
scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:34)
at kafka.log.LogManager.loadLogs(LogManager.scala:105)
at kafka.log.LogManager.init(LogManager.scala:57)
at kafka.server.KafkaServer.createLogManager(KafkaServer.scala:275)
at kafka.server.KafkaServer.startup(KafkaServer.scala:72)
at
kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34)
at kafka.Kafka$.main(Kafka.scala:46)
at kafka.Kafka.main(Kafka.scala)
[2015-01-06 20:00:55,443] INFO [Kafka Server 100], shutting down
(kafka.server.KafkaServer)
[2015-01-06 20:00:55,444] INFO Terminate ZkClient event thread.
(org.I0Itec.zkclient.ZkEventThread)
[2015-01-06 20:00:55,446] INFO Session: 0x684a5ed9da3a1a0f closed
(org.apache.zookeeper.ZooKeeper)
[2015-01-06 20:00:55,446] INFO EventThread shut down
(org.apache.zookeeper.ClientCnxn)
[2015-01-06 20:00:55,447] INFO [Kafka Server 100], shut down completed
(kafka.server.KafkaServer)
[2015-01-06 20:00:55,447] INFO [Kafka Server 100], shutting down
(kafka.server.KafkaServer)

Any ideas

On Tue, Jan 6, 2015 at 12:00 PM, Sa Li sal...@gmail.com wrote:

 the complete error message:

 -su: cannot create temp file for here-document: No space left on device
 OpenJDK 64-Bit Server VM warning: Insufficient space for shared memory
 file:
/tmp/hsperfdata_root/19721
 Try using the -Djava.io.tmpdir= option to select an alternate temp
 location.
 [2015-01-06 19:50:49,244] FATAL  (kafka.Kafka$)
 java.io.FileNotFoundException: conf (No such file or directory)
 at java.io.FileInputStream.open(Native Method)
 at java.io.FileInputStream.init(FileInputStream.java:146)
 at java.io.FileInputStream.init(FileInputStream.java:101)
 at kafka.utils.Utils$.loadProps(Utils.scala:144)
 at kafka.Kafka$.main(Kafka.scala:34)
 at kafka.Kafka.main(Kafka.scala)

 On Tue, Jan 6, 2015 at 11:58 AM, Sa Li sal...@gmail.com wrote:


 Hi, All

 I am doing performance test on our new kafka production server, but after
 sending some messages (even faked message by using bin/kafka-run-class.sh
 org.apache.kafka.clients.tools.ProducerPerformance), it comes out the error
 of connection, and shut down the brokers, after that, I see such errors,

 conf-su: cannot create temp file for here-document: No space left on
 device

 How can I fix it, I am concerning that will happen when we start to
 publish real messages in kafka, and should I create some cron to regularly
 clean certain directories?

 thanks

 --

 Alec Li




 --

 Alec Li




-- 

Alec Li

Re: no space left error

Thanks the reply, the disk is not full:

root@exemplary-birds:~# df -h
Filesystem  Size  Used Avail Use% Mounted on
/dev/sda2   133G  3.4G  123G   3% /
none4.0K 0  4.0K   0% /sys/fs/cgroup
udev 32G  4.0K   32G   1% /dev
tmpfs   6.3G  764K  6.3G   1% /run
none5.0M 0  5.0M   0% /run/lock
none 32G 0   32G   0% /run/shm
none100M 0  100M   0% /run/user
/dev/sdb114T   15G   14T   1% /srv

Neither the memory

root@exemplary-birds:~# free
 total   used   free sharedbuffers cached
Mem:  659633729698380   56264992776 1706687863812
-/+ buffers/cache:1663900   64299472
Swap:   997372  0 997372

thanks


On Tue, Jan 6, 2015 at 12:10 PM, David Birdsong david.birds...@gmail.com
wrote:

 I'm keen to hear about how to work one's way out of a filled partition
 since I've run into this many times after having tuned retention bytes or
 retention (time?) incorrectly. The proper path to resolving this isn't
 obvious based on my many harried searches through documentation.

 I often end up stopping the particular broker, picking an unlucky
 topic/partition, deleting, modifying the any topics that consumed too much
 space by lowering their retention bytes, and restarting.

 On Tue, Jan 6, 2015 at 12:02 PM, Sa Li sal...@gmail.com wrote:

  Continue this issue, when I restart the server, like
  bin/kafka-server-start.sh config/server.properties
 
  it will fails to start the server, like
 
  [2015-01-06 20:00:55,441] FATAL Fatal error during KafkaServerStable
  startup. Prepare to shutdown (kafka.server.KafkaServerStartable)
  java.lang.InternalError: a fault occurred in a recent unsafe memory
 access
  operation in compiled Java code
  at java.nio.HeapByteBuffer.init(HeapByteBuffer.java:57)
  at java.nio.ByteBuffer.allocate(ByteBuffer.java:331)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:188)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:165)
  at
  kafka.utils.IteratorTemplate.maybeComputeNext(IteratorTemplate.scala:66)
  at
 kafka.utils.IteratorTemplate.hasNext(IteratorTemplate.scala:58)
  at kafka.log.LogSegment.recover(LogSegment.scala:165)
  at kafka.log.Log.recoverLog(Log.scala:179)
  at kafka.log.Log.loadSegments(Log.scala:155)
  at kafka.log.Log.init(Log.scala:64)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:118)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:113)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:113)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:105)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:34)
  at kafka.log.LogManager.loadLogs(LogManager.scala:105)
  at kafka.log.LogManager.init(LogManager.scala:57)
  at
 kafka.server.KafkaServer.createLogManager(KafkaServer.scala:275)
  at kafka.server.KafkaServer.startup(KafkaServer.scala:72)
  at
  kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34)
  at kafka.Kafka$.main(Kafka.scala:46)
  at kafka.Kafka.main(Kafka.scala)
  [2015-01-06 20:00:55,443] INFO [Kafka Server 100], shutting down
  (kafka.server.KafkaServer)
  [2015-01-06 20:00:55,444] INFO Terminate ZkClient event thread.
  (org.I0Itec.zkclient.ZkEventThread)
  [2015-01-06 20:00:55,446] INFO Session: 0x684a5ed9da3a1a0f closed
  (org.apache.zookeeper.ZooKeeper)
  [2015-01-06 20:00:55,446] INFO EventThread shut down
  (org.apache.zookeeper.ClientCnxn)
  [2015-01-06 20:00:55,447] INFO [Kafka Server 100], shut down completed
  (kafka.server.KafkaServer)
  [2015-01-06 20:00:55,447] INFO [Kafka Server 100], shutting down
  (kafka.server.KafkaServer)
 
  Any ideas
 
  On Tue, Jan 6, 2015 at 12:00 PM, Sa Li sal...@gmail.com wrote:
 
   the complete error message:
  
   -su: cannot create temp file for here-document: No space left on device
   OpenJDK 64-Bit Server VM warning: Insufficient space for shared memory
   file:
  /tmp/hsperfdata_root/19721
   Try using the -Djava.io.tmpdir= option to select an alternate temp
   location.
   [2015-01-06 19:50:49,244] FATAL  (kafka.Kafka$)
   java.io.FileNotFoundException: conf (No such file or directory)
   at java.io.FileInputStream.open(Native Method)
   at java.io.FileInputStream.init(FileInputStream.java:146)
   at java.io.FileInputStream.init(FileInputStream.java:101

some connection errors happen in performance test

Hi, All

I am running performance test on kafka, the command

bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
test-rep-three 500 100 -1 acks=1 bootstrap.servers=
10.100.10.101:9092 buffer.memory=67108864 batch.size=8196

Since we send 50 billions to brokers, it was OK but periodically pop out
such errors:

[2015-01-06 19:38:32,127] WARN Error in I/O with exemplary-birds.master/
127.0.1.1 (org.apache.kafka.common.network.Selector)
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at org.apache.kafka.common.network.Selector.poll(Selector.java:232)
at
org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:191)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:184)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:115)
at java.lang.Thread.run(Thread.java:745)
1950 records sent, 224.4 records/sec (0.02 MB/sec), 611.4 ms avg latency,
9259.0 max latency.
2899650 records sent, 579930.0 records/sec (55.31 MB/sec), 2399.5 ms avg
latency, 9505.0 max latency.
3170219 records sent, 634043.8 records/sec (60.47 MB/sec), 568.7 ms avg
latency, 1201.0 max latency.

And I feel the error happen more often, our kafka cluster is a three node
cluster.

thanks


-- 

Alec Li

config for consumer and producer

Hi, All

I am testing and making changes on server.properties, I wonder do I need to
specifically change the values in consumer and producer properties, here is
the consumer.properties

zookeeper.connect=10.100.98.100:2181,10.100.98.101:2181,10.100.98.102:2181
# timeout in ms for connecting to zookeeper
zookeeper.connection.timeout.ms=100
#consumer group id
group.id=test-consumer-group
#consumer timeout
#consumer.timeout.ms=5000

I use defaults for most of parameters, for group.id, it was defined as A
string that uniquely identifies the group of consumer processes to which
this consumer belongs. By setting the same group id multiple processes
indicate that they are all part of the same consumer group.

Do I need to define many consumer-group here?
For producer, we are not user java client, it is a C# client sending
message to kafka, so this producer won't be matter (except I am doing
producer test locally), right?

producer.type=sync
compression.codec=none

Thanks
-- 

Alec Li

java.io.IOException: Connection reset by peer

Hi, All

I am running a C# producer to send messages to kafka (3 nodes cluster), but
have such errors:

[2015-01-06 16:09:51,143] ERROR Closing socket for /10.100.70.128 because
of error (kafka.network.Processor)
java.io.IOException: Connection reset by peer
at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
at sun.nio.ch.IOUtil.read(IOUtil.java:197)
at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
at kafka.utils.Utils$.read(Utils.scala:380)
at
kafka.network.BoundedByteBufferReceive.readFrom(BoundedByteBufferReceive.scala:54)
at kafka.network.Processor.read(SocketServer.scala:444)
at kafka.network.Processor.run(SocketServer.scala:340)
at java.lang.Thread.run(Thread.java:745)
[2015-01-06 16:09:51,144] ERROR Closing socket for /10.100.70.128 because
of error (kafka.network.Processor)
java.io.IOException: Connection reset by peer
at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
at sun.nio.ch.IOUtil.read(IOUtil.java:197)
at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
at kafka.utils.Utils$.read(Utils.scala:380)
at
kafka.network.BoundedByteBufferReceive.readFrom(BoundedByteBufferReceive.scala:54)
at kafka.network.Processor.read(SocketServer.scala:444)
at kafka.network.Processor.run(SocketServer.scala:340)
at java.lang.Thread.run(Thread.java:745)
[2015-01-06 16:09:56,138] INFO Closing socket connection to /10.100.70.28.
(kafka.network.Processor)
[2015-01-06 16:10:07,685] INFO Closing socket connection to /10.100.70.28.
(kafka.network.Processor)
[2015-01-06 16:10:31,423] INFO Closing socket connection to /10.100.70.28.
(kafka.network.Processor)
[2015-01-06 16:11:08,077] INFO Closing socket connection to /10.100.70.28.
(kafka.network.Processor)
[2015-01-06 16:11:43,990] INFO Closing socket connection to /10.100.70.28.
(kafka.network.Processor)
[2015-01-06 16:12:24,168] INFO Closing socket connection to /10.100.70.128.
(kafka.network.Processor)

But I do see messages in brokers. Any ideas?

thanks


-- 

Alec Li

Re: messages lost

Hi, experts

Again, we still having the issues of losing data, see we see data 5000
records, but only find 4500 records on brokers, we did set required.acks -1
to make sure all brokers ack, but that only cause the long latency, but not
cure the data lost.


thanks


On Mon, Jan 5, 2015 at 9:55 AM, Xiaoyu Wang xw...@rocketfuel.com wrote:

 @Sa,

 the required.acks is producer side configuration. Set to -1 means requiring
 ack from all brokers.

 On Fri, Jan 2, 2015 at 1:51 PM, Sa Li sal...@gmail.com wrote:

  Thanks a lot, Tim, this is the config of brokers
 
  --
  broker.id=1
  port=9092
  host.name=10.100.70.128
  num.network.threads=4
  num.io.threads=8
  socket.send.buffer.bytes=1048576
  socket.receive.buffer.bytes=1048576
  socket.request.max.bytes=104857600
  auto.leader.rebalance.enable=true
  auto.create.topics.enable=true
  default.replication.factor=3
 
  log.dirs=/tmp/kafka-logs-1
  num.partitions=8
 
  log.flush.interval.messages=1
  log.flush.interval.ms=1000
  log.retention.hours=168
  log.segment.bytes=536870912
  log.cleanup.interval.mins=1
 
  zookeeper.connect=10.100.70.128:2181,10.100.70.28:2181,10.100.70.29:2181
  zookeeper.connection.timeout.ms=100
 
  ---
 
 
  We actually play around request.required.acks in producer config, -1
 cause
  long latency, 1 is the parameter to cause messages lost. But I am not
 sure,
  if this is the reason to lose the records.
 
 
  thanks
 
  AL
 
 
 
 
 
 
 
  On Fri, Jan 2, 2015 at 9:59 AM, Timothy Chen tnac...@gmail.com wrote:
 
   What's your configured required.acks? And also are you waiting for all
   your messages to be acknowledged as well?
  
   The new producer returns futures back, but you still need to wait for
   the futures to complete.
  
   Tim
  
   On Fri, Jan 2, 2015 at 9:54 AM, Sa Li sal...@gmail.com wrote:
Hi, all
   
We are sending the message from a producer, we send 10 records,
 but
   we
see only 99573 records for that topics, we confirm this by consume
 this
topic and check the log size in kafka web console.
   
Any ideas for the message lost, what is the reason to cause this?
   
thanks
   
--
   
Alec Li
  
 
 
 
  --
 
  Alec Li
 




-- 

Alec Li

Re: no space left error

BTW, I found the the /kafka/logs also getting biger and bigger, like
controller.log and state-change.logs. should I launch a cron the clean them
up regularly or there is way to delete them regularly?

thanks

AL

On Tue, Jan 6, 2015 at 2:01 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 We fix the problem, I like to share the what the problem is in case
 someone come across the similar issues. We add the data drive for each node
 /dev/sdb1 , but specify the wrong path in server.properties, which means
 the data was written into the wrong drive /dev/sda2, quickly eat up all the
 space in sda2, now we change the path. The sdb1 has 15Tb, which allows us
 to store data for a while and will be deleted in 1/2 weeks as config
 mentioned.

 But I am kinda curious about David's comments,  ... after having tuned
 retention bytes or retention (time?) incorrectly. ..  How do you guys set
 log.retention.bytes?  I set log.retention.hours=336 (2 weeks), and should
 I set log.retention.bytes as default -1 or some other amount?

 thanks

 AL

 On Tue, Jan 6, 2015 at 12:43 PM, Sa Li sal...@gmail.com wrote:

 Thanks the reply, the disk is not full:

 root@exemplary-birds:~# df -h
 Filesystem  Size  Used Avail Use% Mounted on
 /dev/sda2   133G  3.4G  123G   3% /
 none4.0K 0  4.0K   0% /sys/fs/cgroup
 udev 32G  4.0K   32G   1% /dev
 tmpfs   6.3G  764K  6.3G   1% /run
 none5.0M 0  5.0M   0% /run/lock
 none 32G 0   32G   0% /run/shm
 none100M 0  100M   0% /run/user
 /dev/sdb114T   15G   14T   1% /srv

 Neither the memory

 root@exemplary-birds:~# free
  total   used   free sharedbuffers cached
 Mem:  659633729698380   56264992776 1706687863812
 -/+ buffers/cache:1663900   64299472
 Swap:   997372  0 997372

 thanks


 On Tue, Jan 6, 2015 at 12:10 PM, David Birdsong david.birds...@gmail.com
  wrote:

 I'm keen to hear about how to work one's way out of a filled partition
 since I've run into this many times after having tuned retention bytes or
 retention (time?) incorrectly. The proper path to resolving this isn't
 obvious based on my many harried searches through documentation.

 I often end up stopping the particular broker, picking an unlucky
 topic/partition, deleting, modifying the any topics that consumed too
 much
 space by lowering their retention bytes, and restarting.

 On Tue, Jan 6, 2015 at 12:02 PM, Sa Li sal...@gmail.com wrote:

  Continue this issue, when I restart the server, like
  bin/kafka-server-start.sh config/server.properties
 
  it will fails to start the server, like
 
  [2015-01-06 20:00:55,441] FATAL Fatal error during KafkaServerStable
  startup. Prepare to shutdown (kafka.server.KafkaServerStartable)
  java.lang.InternalError: a fault occurred in a recent unsafe memory
 access
  operation in compiled Java code
  at java.nio.HeapByteBuffer.init(HeapByteBuffer.java:57)
  at java.nio.ByteBuffer.allocate(ByteBuffer.java:331)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:188)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:165)
  at
 
 kafka.utils.IteratorTemplate.maybeComputeNext(IteratorTemplate.scala:66)
  at
 kafka.utils.IteratorTemplate.hasNext(IteratorTemplate.scala:58)
  at kafka.log.LogSegment.recover(LogSegment.scala:165)
  at kafka.log.Log.recoverLog(Log.scala:179)
  at kafka.log.Log.loadSegments(Log.scala:155)
  at kafka.log.Log.init(Log.scala:64)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:118)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:113)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:113)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:105)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:34)
  at kafka.log.LogManager.loadLogs(LogManager.scala:105)
  at kafka.log.LogManager.init(LogManager.scala:57)
  at
 kafka.server.KafkaServer.createLogManager(KafkaServer.scala:275)
  at kafka.server.KafkaServer.startup(KafkaServer.scala:72)
  at
 
 kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34)
  at kafka.Kafka$.main(Kafka.scala:46)
  at kafka.Kafka.main(Kafka.scala)
  [2015-01-06 20:00:55,443] INFO [Kafka Server 100], shutting down
  (kafka.server.KafkaServer)
  [2015-01-06 20:00:55,444] INFO Terminate ZkClient event thread

Re: no space left error

Hi, All

We fix the problem, I like to share the what the problem is in case someone
come across the similar issues. We add the data drive for each node
/dev/sdb1 , but specify the wrong path in server.properties, which means
the data was written into the wrong drive /dev/sda2, quickly eat up all the
space in sda2, now we change the path. The sdb1 has 15Tb, which allows us
to store data for a while and will be deleted in 1/2 weeks as config
mentioned.

But I am kinda curious about David's comments,  ... after having tuned
retention bytes or retention (time?) incorrectly. ..  How do you guys set
log.retention.bytes?  I set log.retention.hours=336 (2 weeks), and should I
set log.retention.bytes as default -1 or some other amount?

thanks

AL

On Tue, Jan 6, 2015 at 12:43 PM, Sa Li sal...@gmail.com wrote:

 Thanks the reply, the disk is not full:

 root@exemplary-birds:~# df -h
 Filesystem  Size  Used Avail Use% Mounted on
 /dev/sda2   133G  3.4G  123G   3% /
 none4.0K 0  4.0K   0% /sys/fs/cgroup
 udev 32G  4.0K   32G   1% /dev
 tmpfs   6.3G  764K  6.3G   1% /run
 none5.0M 0  5.0M   0% /run/lock
 none 32G 0   32G   0% /run/shm
 none100M 0  100M   0% /run/user
 /dev/sdb114T   15G   14T   1% /srv

 Neither the memory

 root@exemplary-birds:~# free
  total   used   free sharedbuffers cached
 Mem:  659633729698380   56264992776 1706687863812
 -/+ buffers/cache:1663900   64299472
 Swap:   997372  0 997372

 thanks


 On Tue, Jan 6, 2015 at 12:10 PM, David Birdsong david.birds...@gmail.com
 wrote:

 I'm keen to hear about how to work one's way out of a filled partition
 since I've run into this many times after having tuned retention bytes or
 retention (time?) incorrectly. The proper path to resolving this isn't
 obvious based on my many harried searches through documentation.

 I often end up stopping the particular broker, picking an unlucky
 topic/partition, deleting, modifying the any topics that consumed too much
 space by lowering their retention bytes, and restarting.

 On Tue, Jan 6, 2015 at 12:02 PM, Sa Li sal...@gmail.com wrote:

  Continue this issue, when I restart the server, like
  bin/kafka-server-start.sh config/server.properties
 
  it will fails to start the server, like
 
  [2015-01-06 20:00:55,441] FATAL Fatal error during KafkaServerStable
  startup. Prepare to shutdown (kafka.server.KafkaServerStartable)
  java.lang.InternalError: a fault occurred in a recent unsafe memory
 access
  operation in compiled Java code
  at java.nio.HeapByteBuffer.init(HeapByteBuffer.java:57)
  at java.nio.ByteBuffer.allocate(ByteBuffer.java:331)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:188)
  at
  kafka.log.FileMessageSet$$anon$1.makeNext(FileMessageSet.scala:165)
  at
  kafka.utils.IteratorTemplate.maybeComputeNext(IteratorTemplate.scala:66)
  at
 kafka.utils.IteratorTemplate.hasNext(IteratorTemplate.scala:58)
  at kafka.log.LogSegment.recover(LogSegment.scala:165)
  at kafka.log.Log.recoverLog(Log.scala:179)
  at kafka.log.Log.loadSegments(Log.scala:155)
  at kafka.log.Log.init(Log.scala:64)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:118)
  at
 
 
 kafka.log.LogManager$$anonfun$loadLogs$1$$anonfun$apply$4.apply(LogManager.scala:113)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:105)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:113)
  at
  kafka.log.LogManager$$anonfun$loadLogs$1.apply(LogManager.scala:105)
  at
 
 
 scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at
  scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:34)
  at kafka.log.LogManager.loadLogs(LogManager.scala:105)
  at kafka.log.LogManager.init(LogManager.scala:57)
  at
 kafka.server.KafkaServer.createLogManager(KafkaServer.scala:275)
  at kafka.server.KafkaServer.startup(KafkaServer.scala:72)
  at
  kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34)
  at kafka.Kafka$.main(Kafka.scala:46)
  at kafka.Kafka.main(Kafka.scala)
  [2015-01-06 20:00:55,443] INFO [Kafka Server 100], shutting down
  (kafka.server.KafkaServer)
  [2015-01-06 20:00:55,444] INFO Terminate ZkClient event thread.
  (org.I0Itec.zkclient.ZkEventThread)
  [2015-01-06 20:00:55,446] INFO Session: 0x684a5ed9da3a1a0f closed
  (org.apache.zookeeper.ZooKeeper)
  [2015-01-06 20:00:55,446] INFO EventThread shut down
  (org.apache.zookeeper.ClientCnxn)
  [2015-01-06 20:00:55,447] INFO [Kafka Server 100], shut down completed

Re: no space left error

the complete error message:

-su: cannot create temp file for here-document: No space left on device
OpenJDK 64-Bit Server VM warning: Insufficient space for shared memory file:
   /tmp/hsperfdata_root/19721
Try using the -Djava.io.tmpdir= option to select an alternate temp location.
[2015-01-06 19:50:49,244] FATAL  (kafka.Kafka$)
java.io.FileNotFoundException: conf (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.init(FileInputStream.java:146)
at java.io.FileInputStream.init(FileInputStream.java:101)
at kafka.utils.Utils$.loadProps(Utils.scala:144)
at kafka.Kafka$.main(Kafka.scala:34)
at kafka.Kafka.main(Kafka.scala)

On Tue, Jan 6, 2015 at 11:58 AM, Sa Li sal...@gmail.com wrote:


 Hi, All

 I am doing performance test on our new kafka production server, but after
 sending some messages (even faked message by using bin/kafka-run-class.sh
 org.apache.kafka.clients.tools.ProducerPerformance), it comes out the error
 of connection, and shut down the brokers, after that, I see such errors,

 conf-su: cannot create temp file for here-document: No space left on device

 How can I fix it, I am concerning that will happen when we start to
 publish real messages in kafka, and should I create some cron to regularly
 clean certain directories?

 thanks

 --

 Alec Li




-- 

Alec Li

Re: messages lost

Thanks a lot, Tim, this is the config of brokers

--
broker.id=1
port=9092
host.name=10.100.70.128
num.network.threads=4
num.io.threads=8
socket.send.buffer.bytes=1048576
socket.receive.buffer.bytes=1048576
socket.request.max.bytes=104857600
auto.leader.rebalance.enable=true
auto.create.topics.enable=true
default.replication.factor=3

log.dirs=/tmp/kafka-logs-1
num.partitions=8

log.flush.interval.messages=1
log.flush.interval.ms=1000
log.retention.hours=168
log.segment.bytes=536870912
log.cleanup.interval.mins=1

zookeeper.connect=10.100.70.128:2181,10.100.70.28:2181,10.100.70.29:2181
zookeeper.connection.timeout.ms=100

---


We actually play around request.required.acks in producer config, -1 cause
long latency, 1 is the parameter to cause messages lost. But I am not sure,
if this is the reason to lose the records.


thanks

AL







On Fri, Jan 2, 2015 at 9:59 AM, Timothy Chen tnac...@gmail.com wrote:

 What's your configured required.acks? And also are you waiting for all
 your messages to be acknowledged as well?

 The new producer returns futures back, but you still need to wait for
 the futures to complete.

 Tim

 On Fri, Jan 2, 2015 at 9:54 AM, Sa Li sal...@gmail.com wrote:
  Hi, all
 
  We are sending the message from a producer, we send 10 records, but
 we
  see only 99573 records for that topics, we confirm this by consume this
  topic and check the log size in kafka web console.
 
  Any ideas for the message lost, what is the reason to cause this?
 
  thanks
 
  --
 
  Alec Li




-- 

Alec Li

messages lost

Hi, all

We are sending the message from a producer, we send 10 records, but we
see only 99573 records for that topics, we confirm this by consume this
topic and check the log size in kafka web console.

Any ideas for the message lost, what is the reason to cause this?

thanks

-- 

Alec Li

Re: kafka logs gone after reboot the server

One more question, when I set the log.dirs in different nodes in the
cluster, should I set them different name, say kafka-logs-1 which
associated with broker id, or I can set the same directory name, like
/var/log/kafka for every node (assume one broker in each server).

thanks


On Fri, Jan 2, 2015 at 2:20 PM, Sa Li sal...@gmail.com wrote:

 Thanks a lot!


 On Fri, Jan 2, 2015 at 12:15 PM, Jay Kreps jay.kr...@gmail.com wrote:

 Nice catch Joe--several people have complained about this as a problem and
 we were a bit mystified as to what kind of bug could lead to all their
 logs
 getting deleted and re-replicated when they bounced the server. We assumed
 bounced meant restarted the app, but I think likely what is happening is
 what you describe--the logs were in /tmp and bouncing the server meant
 restarting.

 -Jay

 On Fri, Jan 2, 2015 at 11:02 AM, Joe Stein joe.st...@stealth.ly wrote:

  That is because your logs are in /tmp which you can change by
  setting log.dirs to something else.
 
  /***
   Joe Stein
   Founder, Principal Consultant
   Big Data Open Source Security LLC
   http://www.stealth.ly
   Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
  /
 
  On Fri, Jan 2, 2015 at 1:58 PM, Sa Li sal...@gmail.com wrote:
 
   Hi, All
  
   I've just notice one thing, when I am experiencing some errors in
 Kafka
   servers, I reboot the dev servers (not a good way), after reboot, I
 get
   into zkCli, I can see all the topics still exist. But when I get into
  kafka
   log directory, I found all data gone, see
  
   root@DO-mq-dev:/tmp/kafka-logs-1/ui_test_topic_4-0# ll
   total 8
   drwxr-xr-x  2 root root 4096 Jan  2 09:39 ./
   drwxr-xr-x 46 root root 4096 Jan  2 10:46 ../
   -rw-r--r--  1 root root 10485760 Jan  2 09:39
 .index
   -rw-r--r--  1 root root0 Jan  2 09:39 .log
  
   I wonder, if for some reasons, the server down, and restart it, all
 the
   data in hard drive will be gone?
  
   thanks
  
   --
  
   Alec Li
  
 




 --

 Alec Li




-- 

Alec Li

kafka-web-console error

Hi, all

I am running kafka-web-console, I periodically getting such error and cause
the UI down:



! @6kldaf9lj - Internal server error, for (GET)
[/assets/images/zookeeper_small.gif] -
play.api.Application$$anon$1: Execution exception[[FileNotFoundException:
/vagrant/kafka-web-console-master/target/scala-2.10/classes/public/images/zookeeper_small.gif
(Too many open files)]]
at play.api.Application$class.handleError(Application.scala:293)
~[play_2.10-2.2.1.jar:2.2.1]
at play.api.DefaultApplication.handleError(Application.scala:399)
[play_2.10-2.2.1.jar:2.2.1]
at
play.core.server.netty.PlayDefaultUpstreamHandler$$anonfun$12$$anonfun$apply$1.applyOrElse(PlayDefaultUpstreamHandler.scala:165)
[play_2.10-2.2.1.jar:2.2.1]
at
play.core.server.netty.PlayDefaultUpstreamHandler$$anonfun$12$$anonfun$apply$1.applyOrElse(PlayDefaultUpstreamHandler.scala:162)
[play_2.10-2.2.1.jar:2.2.1]
at
scala.runtime.AbstractPartialFunction.apply(AbstractPartialFunction.scala:33)
[scala-library-2.10.2.jar:na]
at scala.util.Failure$$anonfun$recover$1.apply(Try.scala:185)
[scala-library-2.10.2.jar:na]
Caused by: java.io.FileNotFoundException:
/vagrant/kafka-web-console-master/target/scala-2.10/classes/public/images/zookeeper_small.gif
(Too many open files)
at java.io.FileInputStream.open(Native Method) ~[na:1.7.0_65]
at java.io.FileInputStream.init(FileInputStream.java:146)
~[na:1.7.0_65]
at java.io.FileInputStream.init(FileInputStream.java:101)
~[na:1.7.0_65]
at
sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:90)
~[na:1.7.0_65]
at
sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:188)
~[na:1.7.0_65]
at java.net.URL.openStream(URL.java:1037) ~[na:1.7.0_65]

[debug] application - Getting partition offsets for topic ui_test_topic_6
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[debug] application - Getting partition offsets for topic ui_test_topic_5
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
harmful-jar.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
voluminous-mass.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
voluminous-mass.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
harmful-jar.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
harmful-jar.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
voluminous-mass.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
voluminous-mass.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
harmful-jar.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[debug] application - Getting partition offsets for topic PofApiTest
[warn] application - Could not connect to partition leader
harmful-jar.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
voluminous-mass.master:9092. Error message: Failed to open a socket.
[warn] application - Could not connect to partition leader
exemplary-birds.master:9092. Error message: Failed to open a socket.
[debug] application - Getting partition offsets for topic ui_test_topic_4


What that means too many files open, is that mean the insufficient local
memory?

thanks

-- 

Alec Li

Re: kafka logs gone after reboot the server

Thanks a lot!


On Fri, Jan 2, 2015 at 12:15 PM, Jay Kreps jay.kr...@gmail.com wrote:

 Nice catch Joe--several people have complained about this as a problem and
 we were a bit mystified as to what kind of bug could lead to all their logs
 getting deleted and re-replicated when they bounced the server. We assumed
 bounced meant restarted the app, but I think likely what is happening is
 what you describe--the logs were in /tmp and bouncing the server meant
 restarting.

 -Jay

 On Fri, Jan 2, 2015 at 11:02 AM, Joe Stein joe.st...@stealth.ly wrote:

  That is because your logs are in /tmp which you can change by
  setting log.dirs to something else.
 
  /***
   Joe Stein
   Founder, Principal Consultant
   Big Data Open Source Security LLC
   http://www.stealth.ly
   Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
  /
 
  On Fri, Jan 2, 2015 at 1:58 PM, Sa Li sal...@gmail.com wrote:
 
   Hi, All
  
   I've just notice one thing, when I am experiencing some errors in Kafka
   servers, I reboot the dev servers (not a good way), after reboot, I get
   into zkCli, I can see all the topics still exist. But when I get into
  kafka
   log directory, I found all data gone, see
  
   root@DO-mq-dev:/tmp/kafka-logs-1/ui_test_topic_4-0# ll
   total 8
   drwxr-xr-x  2 root root 4096 Jan  2 09:39 ./
   drwxr-xr-x 46 root root 4096 Jan  2 10:46 ../
   -rw-r--r--  1 root root 10485760 Jan  2 09:39
 .index
   -rw-r--r--  1 root root0 Jan  2 09:39 .log
  
   I wonder, if for some reasons, the server down, and restart it, all the
   data in hard drive will be gone?
  
   thanks
  
   --
  
   Alec Li
  
 




-- 

Alec Li

kafka logs gone after reboot the server

Hi, All

I've just notice one thing, when I am experiencing some errors in Kafka
servers, I reboot the dev servers (not a good way), after reboot, I get
into zkCli, I can see all the topics still exist. But when I get into kafka
log directory, I found all data gone, see

root@DO-mq-dev:/tmp/kafka-logs-1/ui_test_topic_4-0# ll
total 8
drwxr-xr-x  2 root root 4096 Jan  2 09:39 ./
drwxr-xr-x 46 root root 4096 Jan  2 10:46 ../
-rw-r--r--  1 root root 10485760 Jan  2 09:39 .index
-rw-r--r--  1 root root0 Jan  2 09:39 .log

I wonder, if for some reasons, the server down, and restart it, all the
data in hard drive will be gone?

thanks

-- 

Alec Li

auto.create.topics.enable in config file

2014-12-29 Thread Sa Li

Hi, all

I add auto.create.topics.enable=true in server.properties file, but I got
such error

java.lang.IllegalArgumentException: requirement failed: Unacceptable value
for property 'auto.create.topics.enable', boolean values must be either
'true' or 'false


when I start the kafka server, any clue for this?


thanks


-- 

Alec Li

the impact of partition number

Hi, All

I've run bin/kafka-producer-perf-test.sh on our kafka-production cluster, I
found the number of partitions really have huge impacts on the producer
performance, see:

start.time, end.time, compression, message.size, batch.size,
total.data.sent.in.MB, MB.sec, total.data.sent.in.nMsg, nMsg.sec
2014-12-22 19:53:27:392, 2014-12-22 19:54:25:581, 1, 3000, 200, 2861.02,
49.1678, 100, 17185.3787
2014-12-22 19:55:27:048, 2014-12-22 19:56:23:318, 1, 3000, 200, 2861.02,
50.8446, 100, 17771.4590
2014-12-22 19:58:09:466, 2014-12-22 19:59:05:068, 1, 3000, 200, 2861.02,
51.4554, 100, 17984.9646
2014-12-22 19:59:40:389, 2014-12-22 20:00:28:646, 1, 3000, 200, 2861.02,
59.2872, 100, 20722.3822
2014-12-22 20:02:41:993, 2014-12-22 20:03:22:481, 1, 3000, 200, 2861.02,
70.6635, 100, 24698.6762
2014-12-22 20:03:47:594, 2014-12-22 20:04:26:238, 1, 3000, 200, 2861.02,
74.0354, 100, 25877.2384
2014-12-22 20:11:49:492, 2014-12-22 20:12:25:843, 1, 3000, 200, 2861.02,
78.7055, 100, 27509.5596
2014-12-22 20:12:53:290, 2014-12-22 20:13:29:746, 1, 3000, 200, 2861.02,
78.4788, 100, 27430.3270
2014-12-22 20:13:53:194, 2014-12-22 20:14:29:470, 1, 3000, 200, 2861.02,
78.8682, 100, 27566.4351
2014-12-22 20:14:51:491, 2014-12-22 20:15:25:451, 1, 3000, 200, 2861.02,
84.2468, 100, 29446.4075
2014-12-22 20:16:51:369, 2014-12-22 20:17:27:452, 1, 3000, 200, 2861.02,
79.2901, 100, 27713.8819
2014-12-22 20:17:57:882, 2014-12-22 20:18:33:957, 1, 3000, 200, 2861.02,
79.3076, 100, 27720.0277


The number of partitions above are from 1 to 12, I wonder why it has such
big difference?

thanks

-- 

Alec Li

leader and isr were not set when create the topic

Hi, All

I created a topic with 3 replications and 6 partitions, but when I check
this topic, seems there is no leader and isr were set for this topic, see

bin/kafka-topics.sh --create --zookeeper 10.100.98.100:2181
--replication-factor 3 --partitions 6 --topic perf_producer_p6_test
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Created topic perf_producer_p6_test.

root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
10.100.98.100:2181 --topic perf_producer_p6_test
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Topic:perf_producer_p6_test PartitionCount:6
ReplicationFactor:3 Configs:
Topic: perf_producer_p6_testPartition: 0Leader: none
Replicas: 100,101,102   Isr:
Topic: perf_producer_p6_testPartition: 1Leader: none
Replicas: 101,102,100   Isr:
Topic: perf_producer_p6_testPartition: 2Leader: none
Replicas: 102,100,101   Isr:
Topic: perf_producer_p6_testPartition: 3Leader: none
Replicas: 100,102,101   Isr:
Topic: perf_producer_p6_testPartition: 4Leader: none
Replicas: 101,100,102   Isr:
Topic: perf_producer_p6_testPartition: 5Leader: none
Replicas: 102,101,100   Isr:

Is there a way to specifically set leader and isr in command line, it is
strange when I create the topic with 5 partitions, it has leader and isr:
root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
10.100.98.100:2181 --topic perf_producer_p5_test
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Topic:perf_producer_p5_test PartitionCount:5
ReplicationFactor:3 Configs:
Topic: perf_producer_p5_testPartition: 0Leader: 102
Replicas: 102,100,101   Isr: 102,100,101
Topic: perf_producer_p5_testPartition: 1Leader: 102
Replicas: 100,101,102   Isr: 102,101
Topic: perf_producer_p5_testPartition: 2Leader: 101
Replicas: 101,102,100   Isr: 101,102,100
Topic: perf_producer_p5_testPartition: 3Leader: 102
Replicas: 102,101,100   Isr: 102,101,100
Topic: perf_producer_p5_testPartition: 4Leader: 102
Replicas: 100,102,101   Isr: 102,101


Any ideas?

thanks

-- 

Alec Li

Re: leader and isr were not set when create the topic

I restart the kafka server, it is the same thing, sometime nothing listed
on ISR, leader, I checked the state-change log

[2014-12-22 23:46:38,164] TRACE Broker 100 cached leader info
(LeaderAndIsrInfo:(Leader:101,ISR:101,102,100,LeaderEpoch:0,ControllerEpoch:4),ReplicationFactor:3),AllReplicas:101,102,100)
for partition [perf_producer_p8_test,1] in response to UpdateMetadata
request sent by controller 101 epoch 4 with correlation id 138
(state.change.logger)



On Mon, Dec 22, 2014 at 2:46 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 I created a topic with 3 replications and 6 partitions, but when I check
 this topic, seems there is no leader and isr were set for this topic, see

 bin/kafka-topics.sh --create --zookeeper 10.100.98.100:2181
 --replication-factor 3 --partitions 6 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Created topic perf_producer_p6_test.

 root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
 10.100.98.100:2181 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Topic:perf_producer_p6_test PartitionCount:6
 ReplicationFactor:3 Configs:
 Topic: perf_producer_p6_testPartition: 0Leader: none
 Replicas: 100,101,102   Isr:
 Topic: perf_producer_p6_testPartition: 1Leader: none
 Replicas: 101,102,100   Isr:
 Topic: perf_producer_p6_testPartition: 2Leader: none
 Replicas: 102,100,101   Isr:
 Topic: perf_producer_p6_testPartition: 3Leader: none
 Replicas: 100,102,101   Isr:
 Topic: perf_producer_p6_testPartition: 4Leader: none
 Replicas: 101,100,102   Isr:
 Topic: perf_producer_p6_testPartition: 5Leader: none
 Replicas: 102,101,100   Isr:

 Is there a way to specifically set leader and isr in command line, it is
 strange when I create the topic with 5 partitions, it has leader and isr:
 root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
 10.100.98.100:2181 --topic perf_producer_p5_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Topic:perf_producer_p5_test PartitionCount:5
 ReplicationFactor:3 Configs:
 Topic: perf_producer_p5_testPartition: 0Leader: 102
 Replicas: 102,100,101   Isr: 102,100,101
 Topic: perf_producer_p5_testPartition: 1Leader: 102
 Replicas: 100,101,102   Isr: 102,101
 Topic: perf_producer_p5_testPartition: 2Leader: 101
 Replicas: 101,102,100   Isr: 101,102,100
 Topic: perf_producer_p5_testPartition: 3Leader: 102
 Replicas: 102,101,100   Isr: 102,101,100
 Topic: perf_producer_p5_testPartition: 4Leader: 102
 Replicas: 100,102,101   Isr: 102,101


 Any ideas?

 thanks

 --

 Alec Li



-- 

Alec Li

Re: leader and isr were not set when create the topic

Hello, Neha

This is the error from server.log

[2014-12-22 23:53:25,663] WARN [KafkaApi-100] Fetch request with
correlation id 1227732 from client ReplicaFetcherThread-0-100 on partition
[perf_producer_p8_test,1] failed due to Leader not local for partition
[perf_producer_p8_test,1] on broker 100 (kafka.server.KafkaApis)


On Mon, Dec 22, 2014 at 3:50 PM, Sa Li sal...@gmail.com wrote:

 I restart the kafka server, it is the same thing, sometime nothing listed
 on ISR, leader, I checked the state-change log

 [2014-12-22 23:46:38,164] TRACE Broker 100 cached leader info
 (LeaderAndIsrInfo:(Leader:101,ISR:101,102,100,LeaderEpoch:0,ControllerEpoch:4),ReplicationFactor:3),AllReplicas:101,102,100)
 for partition [perf_producer_p8_test,1] in response to UpdateMetadata
 request sent by controller 101 epoch 4 with correlation id 138
 (state.change.logger)



 On Mon, Dec 22, 2014 at 2:46 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 I created a topic with 3 replications and 6 partitions, but when I check
 this topic, seems there is no leader and isr were set for this topic, see

 bin/kafka-topics.sh --create --zookeeper 10.100.98.100:2181
 --replication-factor 3 --partitions 6 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Created topic perf_producer_p6_test.

 root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
 10.100.98.100:2181 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Topic:perf_producer_p6_test PartitionCount:6
 ReplicationFactor:3 Configs:
 Topic: perf_producer_p6_testPartition: 0Leader: none
 Replicas: 100,101,102   Isr:
 Topic: perf_producer_p6_testPartition: 1Leader: none
 Replicas: 101,102,100   Isr:
 Topic: perf_producer_p6_testPartition: 2Leader: none
 Replicas: 102,100,101   Isr:
 Topic: perf_producer_p6_testPartition: 3Leader: none
 Replicas: 100,102,101   Isr:
 Topic: perf_producer_p6_testPartition: 4Leader: none
 Replicas: 101,100,102   Isr:
 Topic: perf_producer_p6_testPartition: 5Leader: none
 Replicas: 102,101,100   Isr:

 Is there a way to specifically set leader and isr in command line, it is
 strange when I create the topic with 5 partitions, it has leader and isr:
 root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
 10.100.98.100:2181 --topic perf_producer_p5_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Topic:perf_producer_p5_test PartitionCount:5
 ReplicationFactor:3 Configs:
 Topic: perf_producer_p5_testPartition: 0Leader: 102
 Replicas: 102,100,101   Isr: 102,100,101
 Topic: perf_producer_p5_testPartition: 1Leader: 102
 Replicas: 100,101,102   Isr: 102,101
 Topic: perf_producer_p5_testPartition: 2Leader: 101
 Replicas: 101,102,100   Isr: 101,102,100
 Topic: perf_producer_p5_testPartition: 3Leader: 102
 Replicas: 102,101,100   Isr: 102,101,100
 Topic: perf_producer_p5_testPartition: 4Leader: 102
 Replicas: 100,102,101   Isr: 102,101


 Any ideas?

 thanks

 --

 Alec Li



 --

 Alec Li



-- 

Alec Li

Re: leader and isr were not set when create the topic

I have three nodes: 100, 101, and 102

When I restart all of them, seems now everything is ok, but I would like to
paste the error messages I got from server.log from each node, see if you
can help to understand what is the problem.

on node 100
[2014-12-23 00:04:39,401] ERROR [KafkaApi-100] Error when processing fetch
request for partition [perf_producer_p8_test,7] offset 125000 from follower
with correlation id 0 (kafka.server.KafkaApis)
kafka.common.OffsetOutOfRangeException: Request for offset 125000 but we
only have log segments in the range 0 to 0.
 at kafka.log.Log.read(Log.scala:380)
 at
kafka.server.KafkaApis.kafka$server$KafkaApis$$readMessageSet(KafkaApis.scala:530)
 at
kafka.server.KafkaApis$$anonfun$kafka$server$KafkaApis$$readMessageSets$1.apply(KafkaApis.scala:476)

 at
kafka.server.KafkaApis$$anonfun$kafka$server$KafkaApis$$readMessageSets$1.apply(KafkaApis.scala:471)

 at
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:244)
 at
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:244)

 at
scala.collection.immutable.Map$Map3.foreach(Map.scala:154)
 at
scala.collection.TraversableLike$class.map(TraversableLike.scala:244)
 at
scala.collection.AbstractTraversable.map(Traversable.scala:105)
..
..


in Node 101 and 102
[2014-12-23 00:04:39,440] ERROR [ReplicaFetcherThread-0-100], Current
offset 1 25000 for partition [perf_producer_p8_test,1] out of range; reset
offset to 0 (kafka.server.ReplicaFetcherThread)
[2014-12-23 00:04:39,442] INFO Truncating log perf_producer_p8_test-7 to
offset 0. (kafka.log.Log)
[2014-12-23 00:04:39,452] WARN [ReplicaFetcherThread-0-100], Replica 102
for partition [perf_producer_p8_test,7] reset its fetch offset to current
leader 100's latest offset 0 (kafka.server.ReplicaFetcherThread)






On Mon, Dec 22, 2014 at 3:55 PM, Sa Li sal...@gmail.com wrote:

 Hello, Neha

 This is the error from server.log

 [2014-12-22 23:53:25,663] WARN [KafkaApi-100] Fetch request with
 correlation id 1227732 from client ReplicaFetcherThread-0-100 on partition
 [perf_producer_p8_test,1] failed due to Leader not local for partition
 [perf_producer_p8_test,1] on broker 100 (kafka.server.KafkaApis)


 On Mon, Dec 22, 2014 at 3:50 PM, Sa Li sal...@gmail.com wrote:

 I restart the kafka server, it is the same thing, sometime nothing listed
 on ISR, leader, I checked the state-change log

 [2014-12-22 23:46:38,164] TRACE Broker 100 cached leader info
 (LeaderAndIsrInfo:(Leader:101,ISR:101,102,100,LeaderEpoch:0,ControllerEpoch:4),ReplicationFactor:3),AllReplicas:101,102,100)
 for partition [perf_producer_p8_test,1] in response to UpdateMetadata
 request sent by controller 101 epoch 4 with correlation id 138
 (state.change.logger)



 On Mon, Dec 22, 2014 at 2:46 PM, Sa Li sal...@gmail.com wrote:

 Hi, All

 I created a topic with 3 replications and 6 partitions, but when I check
 this topic, seems there is no leader and isr were set for this topic, see

 bin/kafka-topics.sh --create --zookeeper 10.100.98.100:2181
 --replication-factor 3 --partitions 6 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Created topic perf_producer_p6_test.

 root@precise64:/etc/kafka# bin/kafka-topics.sh --describe --zookeeper
 10.100.98.100:2181 --topic perf_producer_p6_test
 SLF4J: Class path contains multiple SLF4J bindings.
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: Found binding in
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
 SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
 explanation.
 SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
 Topic:perf_producer_p6_test PartitionCount:6
 ReplicationFactor:3 Configs:
 Topic: perf_producer_p6_testPartition: 0Leader: none
 Replicas: 100,101,102   Isr:
 Topic: perf_producer_p6_testPartition: 1Leader: none
 Replicas: 101,102,100   Isr:
 Topic: perf_producer_p6_testPartition: 2Leader: none
 Replicas: 102,100,101   Isr:
 Topic: perf_producer_p6_testPartition: 3Leader: none
 Replicas: 100,102,101   Isr:
 Topic: perf_producer_p6_test

kafka monitoring system

Hi, all

I am thinking to make a reliable monitoring system for our kafka production
cluster. I read such from documents:

Kafka uses Yammer Metrics for metrics reporting in both the server and the
client. This can be configured to report stats using pluggable stats
reporters to hook up to your monitoring system.

The easiest way to see the available metrics to fire up jconsole and point
it at a running kafka client or server; this will all browsing all metrics
with JMX.

We pay particular we do graphing and alerting on the following metrics:

..


I am wondering if anyone ever use Jconsole to monitor the kafka, or anyone
can recommend a good monitoring tool for kafka production.


thanks


-- 

Alec Li

can't produce message in kafka production

2014-12-18 Thread Sa Li

Dear all

We just build a kafka production cluster, I can create topics in kafka
production from another host. But when I am send very simple message as
producer, it generate such errors:

root@precise64:/etc/kafka# bin/kafka-console-producer.sh --broker-list
10.100.98.100:9092 --topic my-replicated-topic-production
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
my test message 1
[2014-12-18 21:44:25,830] WARN Failed to send producer request with
correlation id 2 to broker 101 with data for partitions
[my-replicated-topic-production,1]
(kafka.producer.async.DefaultEventHandler)
java.nio.channels.ClosedChannelException
at kafka.network.BlockingChannel.send(BlockingChannel.scala:100)
at kafka.producer.SyncProducer.liftedTree1$1(SyncProducer.scala:73)
at
kafka.producer.SyncProducer.kafka$producer$SyncProducer$$doSend(SyncProducer.scala:72)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SyncProducer.scala:103)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply$mcV$sp(SyncProducer.scala:102)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
at kafka.producer.SyncProducer.send(SyncProducer.scala:101)
at
kafka.producer.async.DefaultEventHandler.kafka$producer$async$DefaultEventHandler$$send(DefaultEventHandler.scala:256)
at
kafka.producer.async.DefaultEventHandler$$anonfun$dispatchSerializedData$2.apply(DefaultEventHandler.scala:107)
at
kafka.producer.async.DefaultEventHandler$$anonfun$dispatchSerializedData$2.apply(DefaultEventHandler.scala:99)
at
scala.collection.TraversableLike$WithFilter$$anonfun$foreach$1.apply(TraversableLike.scala:772)
at
scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
at
scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
at
scala.collection.mutable.HashTable$class.foreachEntry(HashTable.scala:226)
at scala.collection.mutable.HashMap.foreachEntry(HashMap.scala:39)
at scala.collection.mutable.HashMap.foreach(HashMap.scala:98)
at
scala.collection.TraversableLike$WithFilter.foreach(TraversableLike.scala:771)
at
kafka.producer.async.DefaultEventHandler.dispatchSerializedData(DefaultEventHandler.scala:99)
at
kafka.producer.async.DefaultEventHandler.handle(DefaultEventHandler.scala:72)
at
kafka.producer.async.ProducerSendThread.tryToHandle(ProducerSendThread.scala:105)
at
kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(ProducerSendThread.scala:88)
at
kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(ProducerSendThread.scala:68)
at scala.collection.immutable.Stream.foreach(Stream.scala:547)
at
kafka.producer.async.ProducerSendThread.processEvents(ProducerSendThread.scala:67)
at
kafka.producer.async.ProducerSendThread.run(ProducerSendThread.scala:45)
[2014-12-18 21:44:25,948] WARN Failed to send producer request with
correlation id 5 to broker 101 with data for partitions
[my-replicated-topic-production,1]
(kafka.producer.async.DefaultEventHandler)
java.nio.channels.ClosedChannelException
at kafka.network.BlockingChannel.send(BlockingChannel.scala:100)
at kafka.producer.SyncProducer.liftedTree1$1(SyncProducer.scala:73)
at
kafka.producer.SyncProducer.kafka$producer$SyncProducer$$doSend(SyncProducer.scala:72)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SyncProducer.scala:103)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
at
kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply$mcV$sp(SyncProducer.scala:102)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
at
kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
at

Re: can't produce message in kafka production

2014-12-18 Thread Sa Li

Thanks, Gwen, I telnet it,
root@precise64:/etc/kafka# telnet 10.100.98.100 9092
Trying 10.100.98.100...
Connected to 10.100.98.100.
Escape character is '^]'.

seems it connected, and I check with system operation people, netstate
should 9092 is listening. I am assuming this is the connection issue, since
I can run the same command to my dev-cluster with no problem at all, which
is 10.100.70.128:9092.

Just in case, is it possibly caused by other types of issues?

thanks

Alec

On Thu, Dec 18, 2014 at 2:33 PM, Gwen Shapira gshap...@cloudera.com wrote:

 Looks like you can't connect to: 10.100.98.100:9092

 I'd validate that this is the issue using telnet and then check the
 firewall / ipfilters settings.

 On Thu, Dec 18, 2014 at 2:21 PM, Sa Li sal...@gmail.com wrote:
  Dear all
 
  We just build a kafka production cluster, I can create topics in kafka
  production from another host. But when I am send very simple message as
  producer, it generate such errors:
 
  root@precise64:/etc/kafka# bin/kafka-console-producer.sh --broker-list
  10.100.98.100:9092 --topic my-replicated-topic-production
  SLF4J: Class path contains multiple SLF4J bindings.
  SLF4J: Found binding in
 
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  SLF4J: Found binding in
 
 [jar:file:/etc/kafka/core/build/dependant-libs-2.10.4/slf4j-log4j12-1.7.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
  explanation.
  SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
  my test message 1
  [2014-12-18 21:44:25,830] WARN Failed to send producer request with
  correlation id 2 to broker 101 with data for partitions
  [my-replicated-topic-production,1]
  (kafka.producer.async.DefaultEventHandler)
  java.nio.channels.ClosedChannelException
  at kafka.network.BlockingChannel.send(BlockingChannel.scala:100)
  at
 kafka.producer.SyncProducer.liftedTree1$1(SyncProducer.scala:73)
  at
 
 kafka.producer.SyncProducer.kafka$producer$SyncProducer$$doSend(SyncProducer.scala:72)
  at
 
 kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SyncProducer.scala:103)
  at
 
 kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
  at
 
 kafka.producer.SyncProducer$$anonfun$send$1$$anonfun$apply$mcV$sp$1.apply(SyncProducer.scala:103)
  at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
  at
 
 kafka.producer.SyncProducer$$anonfun$send$1.apply$mcV$sp(SyncProducer.scala:102)
  at
  kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
  at
  kafka.producer.SyncProducer$$anonfun$send$1.apply(SyncProducer.scala:102)
  at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
  at kafka.producer.SyncProducer.send(SyncProducer.scala:101)
  at
 
 kafka.producer.async.DefaultEventHandler.kafka$producer$async$DefaultEventHandler$$send(DefaultEventHandler.scala:256)
  at
 
 kafka.producer.async.DefaultEventHandler$$anonfun$dispatchSerializedData$2.apply(DefaultEventHandler.scala:107)
  at
 
 kafka.producer.async.DefaultEventHandler$$anonfun$dispatchSerializedData$2.apply(DefaultEventHandler.scala:99)
  at
 
 scala.collection.TraversableLike$WithFilter$$anonfun$foreach$1.apply(TraversableLike.scala:772)
  at
 
 scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
  at
 
 scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:98)
  at
 
 scala.collection.mutable.HashTable$class.foreachEntry(HashTable.scala:226)
  at
 scala.collection.mutable.HashMap.foreachEntry(HashMap.scala:39)
  at scala.collection.mutable.HashMap.foreach(HashMap.scala:98)
  at
 
 scala.collection.TraversableLike$WithFilter.foreach(TraversableLike.scala:771)
  at
 
 kafka.producer.async.DefaultEventHandler.dispatchSerializedData(DefaultEventHandler.scala:99)
  at
 
 kafka.producer.async.DefaultEventHandler.handle(DefaultEventHandler.scala:72)
  at
 
 kafka.producer.async.ProducerSendThread.tryToHandle(ProducerSendThread.scala:105)
  at
 
 kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(ProducerSendThread.scala:88)
  at
 
 kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(ProducerSendThread.scala:68)
  at scala.collection.immutable.Stream.foreach(Stream.scala:547)
  at
 
 kafka.producer.async.ProducerSendThread.processEvents(ProducerSendThread.scala:67)
  at
  kafka.producer.async.ProducerSendThread.run(ProducerSendThread.scala:45)
  [2014-12-18 21:44:25,948] WARN Failed to send producer request with
  correlation id 5 to broker 101 with data for partitions
  [my-replicated-topic-production,1]
  (kafka.producer.async.DefaultEventHandler

Re: kafka consumer to write into DB

2014-12-05 Thread Sa Li

Thank you very much for the reply, Neha, I have a question about consumer,
I consume the data from kafka and write into DB, of course I have to create
a hash map in memory, load data into memory and bulk copy to DB instead of
insert into DB line by line. Does it mean I need to ack each message while
load to memory?

thanks



On Thu, Dec 4, 2014 at 1:21 PM, Neha Narkhede n...@confluent.io wrote:

 This is specific for pentaho but may be useful -
 https://github.com/RuckusWirelessIL/pentaho-kafka-consumer

 On Thu, Dec 4, 2014 at 12:58 PM, Sa Li sal...@gmail.com wrote:

  Hello, all
 
  I never developed a kafka consumer, I want to be able to make an advanced
  kafka consumer in java to consume the data and continuously write the
 data
  into postgresql DB. I am thinking to create a map in memory and getting a
  predefined number of messages in memory then write into DB in batch, is
  there a API or sample code to allow me to do this?
 
 
  thanks
 
 
  --
 
  Alec Li
 



 --
 Thanks,
 Neha




-- 

Alec Li

Re: kafka consumer to write into DB

2014-12-05 Thread Sa Li

Thanks, Neha, is there a java version batch consumer?

thanks



On Fri, Dec 5, 2014 at 9:41 AM, Scott Clasen sc...@heroku.com wrote:

 if you are using scala/akka this will handle the batching and acks for you.

 https://github.com/sclasen/akka-kafka#akkabatchconsumer

 On Fri, Dec 5, 2014 at 9:21 AM, Sa Li sal...@gmail.com wrote:

  Thank you very much for the reply, Neha, I have a question about
 consumer,
  I consume the data from kafka and write into DB, of course I have to
 create
  a hash map in memory, load data into memory and bulk copy to DB instead
 of
  insert into DB line by line. Does it mean I need to ack each message
 while
  load to memory?
 
  thanks
 
 
 
  On Thu, Dec 4, 2014 at 1:21 PM, Neha Narkhede n...@confluent.io wrote:
 
   This is specific for pentaho but may be useful -
   https://github.com/RuckusWirelessIL/pentaho-kafka-consumer
  
   On Thu, Dec 4, 2014 at 12:58 PM, Sa Li sal...@gmail.com wrote:
  
Hello, all
   
I never developed a kafka consumer, I want to be able to make an
  advanced
kafka consumer in java to consume the data and continuously write the
   data
into postgresql DB. I am thinking to create a map in memory and
  getting a
predefined number of messages in memory then write into DB in batch,
 is
there a API or sample code to allow me to do this?
   
   
thanks
   
   
--
   
Alec Li
   
  
  
  
   --
   Thanks,
   Neha
  
 
 
 
  --
 
  Alec Li
 




-- 

Alec Li

kafka consumer to write into DB

2014-12-04 Thread Sa Li

Hello, all

I never developed a kafka consumer, I want to be able to make an advanced
kafka consumer in java to consume the data and continuously write the data
into postgresql DB. I am thinking to create a map in memory and getting a
predefined number of messages in memory then write into DB in batch, is
there a API or sample code to allow me to do this?


thanks


-- 

Alec Li

Re: how many brokers to set in kafka

2014-11-29 Thread Sa Li

thanks a lot
 On Nov 29, 2014, at 8:29 AM, Jun Rao jun...@gmail.com wrote:
 
 Typically, you will just have one broker per server. If you do want to set
 up multiple brokers on the same server, ideally you need to give each
 broker dedicated storage.
 
 Thanks,
 
 Jun
 
 On Thu, Nov 27, 2014 at 11:09 AM, Sa Li sal...@gmail.com wrote:
 
 Hi, all
 
 We are having 3 production server to setup for kafka cluster, I wonder how
 many brokers to configure for each server.
 
 
 thanks
 
 
 --
 
 Alec Li

rule to set number of brokers for each server

2014-11-28 Thread Sa Li

Dear all

I am provision production kafka cluster, which has 3 servers, I am
wondering how many brokers I should set for each servers, I set 3 brokers
in dev clusters, but I really don't what is the advantages to set more than
1 broker for each server, what about 1 broker for each server, totally 3
brokers, instead of 9 brokers.

thanks


-- 

Alec Li

how many brokers to set in kafka

Hi, all

We are having 3 production server to setup for kafka cluster, I wonder how
many brokers to configure for each server.


thanks


-- 

Alec Li

Re: how many brokers to set in kafka

Is there any rules to determine or optimize the number of brokers?



On Thu, Nov 27, 2014 at 11:09 AM, Sa Li sal...@gmail.com wrote:

 Hi, all

 We are having 3 production server to setup for kafka cluster, I wonder how
 many brokers to configure for each server.


 thanks


 --

 Alec Li




-- 

Alec Li

Re: kafka web console running error

I am using https://github.com/claudemamo/kafka-web-console version, and do
you mind to tell where about to make such modification?

thanks

Alec

On Mon, Nov 24, 2014 at 11:16 PM, Yang Fang franklin.f...@gmail.com wrote:

 do you see error msg Too many open files? it tips you should modify
 nofile

 On Tue, Nov 25, 2014 at 1:26 PM, Jun Rao jun...@gmail.com wrote:

  Which web console are you using?
 
  Thanks,
 
  Jun
 
  On Fri, Nov 21, 2014 at 8:34 AM, Sa Li sal...@gmail.com wrote:
 
   Hi, all
  
   I am trying to get kafka web console work, but seems it only works few
   hours and fails afterwards, below is the error messages on the screen.
 I
  am
   assuming something wrong with the DB, I used to swap H2 to mysql, but
   didn't help. Anyone has similar problem?
  
  
   -
   .
   .
  
  
  at sun.misc.Resource.getByteBuffer(Resource.java:160) ~[na:1.7.0_65]
   at java.net.URLClassLoader.defineClass(URLClassLoader.java:436)
   ~[na:1.7.0_65]
  
   at
  
  
 
 akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.processBatch$1(BatchingExecutor.scala:67)
   at
  
  
 
 akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:82)
   at
  
  
 
 akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
   at
  
  
 
 akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
   at
   scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72)
   at
   akka.dispatch.BatchingExecutor$Batch.run(BatchingExecutor.scala:58)
   at
 akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:42)
   at
  
  
 
 akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:386)
   at
   scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
   at
  
  
 
 scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
   at
  
 scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
   at
  
  
 
 scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
   [ERROR] Failed to construct terminal; falling back to unsupported
   java.io.IOException: Cannot run program sh: error=24, Too many open
  files
   at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
   at java.lang.Runtime.exec(Runtime.java:617)
   at java.lang.Runtime.exec(Runtime.java:485)
   at
   jline.internal.TerminalLineSettings.exec(TerminalLineSettings.java:183)
   at
   jline.internal.TerminalLineSettings.exec(TerminalLineSettings.java:173)
   at
   jline.internal.TerminalLineSettings.stty(TerminalLineSettings.java:168)
   at
   jline.internal.TerminalLineSettings.get(TerminalLineSettings.java:72)
   at
  
 jline.internal.TerminalLineSettings.init(TerminalLineSettings.java:52)
   at jline.UnixTerminal.init(UnixTerminal.java:31)
   at
 sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
   Method)
   at
  
  
 
 sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
   at
  
  
 
 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
   at
  java.lang.reflect.Constructor.newInstance(Constructor.java:526)
   at java.lang.Class.newInstance(Class.java:379)
   [error] a.a.ActorSystemImpl - Uncaught error from thread
   [play-akka.actor.default-dispatcher-944] shutting down JVM since
   'akka.jvm-exit-on-fatal-error'
   java.lang.NoClassDefFoundError:
  
  
 
 common/Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1$$anonfun$applyOrElse$1
   at
  
  
 
 common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:75)
   ~[na:na]
   at
  
  
 
 common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:74)
   ~[na:na]
   at
  
  
 
 scala.runtime.AbstractPartialFunction$mcJL$sp.apply$mcJL$sp(AbstractPartialFunction.scala:33)
   ~[scala-library.jar:na]
   at
  
  
 
 scala.runtime.AbstractPartialFunction$mcJL$sp.apply(AbstractPartialFunction.scala:33)
   ~[scala-library.jar:na]
   at
  
  
 
 scala.runtime.AbstractPartialFunction$mcJL$sp.apply(AbstractPartialFunction.scala:25)
   ~[scala-library.jar:na]
   at scala.util.Failure$$anonfun$recover$1.apply(Try.scala:185)
   ~[scala-library.jar:na]
   at jline.TerminalFactory.getFlavor(TerminalFactory.java:168)
   at jline.TerminalFactory.create(TerminalFactory.java:81)
   at jline.TerminalFactory.get(TerminalFactory.java:159)
   at sbt.MainLoop$$anon$1.run(MainLoop.scala:19)
   at java.lang.Thread.run(Thread.java:745)
   Caused by: java.io.IOException: error=24, Too many open files

maximum number of open file handles

Hi, all

I read the comments from
http://www.michael-noll.com/tutorials/running-multi-node-storm-cluster/,
Michael mentioned to increase the maximum number of open file handles for
the user kafka to 98,304 (change kafka to whatever user you are running the
Kafka daemons with – this can be your own user account, of course) you must
add the following line to /etc/security/limits.conf:

kafka  - nofile 98304

I am installing the latest version of kafka, do I need to do the same thing
above?

thanks
-- 

Alec Li

kafka web console running error

2014-11-21 Thread Sa Li

Hi, all

I am trying to get kafka web console work, but seems it only works few
hours and fails afterwards, below is the error messages on the screen. I am
assuming something wrong with the DB, I used to swap H2 to mysql, but
didn't help. Anyone has similar problem?


-
.
.


   at sun.misc.Resource.getByteBuffer(Resource.java:160) ~[na:1.7.0_65]
at java.net.URLClassLoader.defineClass(URLClassLoader.java:436)
~[na:1.7.0_65]

at
akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.processBatch$1(BatchingExecutor.scala:67)
at
akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:82)
at
akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
at
akka.dispatch.BatchingExecutor$Batch$$anonfun$run$1.apply(BatchingExecutor.scala:59)
at
scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72)
at
akka.dispatch.BatchingExecutor$Batch.run(BatchingExecutor.scala:58)
at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:42)
at
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:386)
at
scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
at
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
[ERROR] Failed to construct terminal; falling back to unsupported
java.io.IOException: Cannot run program sh: error=24, Too many open files
at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
at java.lang.Runtime.exec(Runtime.java:617)
at java.lang.Runtime.exec(Runtime.java:485)
at
jline.internal.TerminalLineSettings.exec(TerminalLineSettings.java:183)
at
jline.internal.TerminalLineSettings.exec(TerminalLineSettings.java:173)
at
jline.internal.TerminalLineSettings.stty(TerminalLineSettings.java:168)
at
jline.internal.TerminalLineSettings.get(TerminalLineSettings.java:72)
at
jline.internal.TerminalLineSettings.init(TerminalLineSettings.java:52)
at jline.UnixTerminal.init(UnixTerminal.java:31)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
Method)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at java.lang.Class.newInstance(Class.java:379)
[error] a.a.ActorSystemImpl - Uncaught error from thread
[play-akka.actor.default-dispatcher-944] shutting down JVM since
'akka.jvm-exit-on-fatal-error'
java.lang.NoClassDefFoundError:
common/Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1$$anonfun$applyOrElse$1
at
common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:75)
~[na:na]
at
common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:74)
~[na:na]
at
scala.runtime.AbstractPartialFunction$mcJL$sp.apply$mcJL$sp(AbstractPartialFunction.scala:33)
~[scala-library.jar:na]
at
scala.runtime.AbstractPartialFunction$mcJL$sp.apply(AbstractPartialFunction.scala:33)
~[scala-library.jar:na]
at
scala.runtime.AbstractPartialFunction$mcJL$sp.apply(AbstractPartialFunction.scala:25)
~[scala-library.jar:na]
at scala.util.Failure$$anonfun$recover$1.apply(Try.scala:185)
~[scala-library.jar:na]
at jline.TerminalFactory.getFlavor(TerminalFactory.java:168)
at jline.TerminalFactory.create(TerminalFactory.java:81)
at jline.TerminalFactory.get(TerminalFactory.java:159)
at sbt.MainLoop$$anon$1.run(MainLoop.scala:19)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.io.IOException: error=24, Too many open files
at java.lang.UNIXProcess.forkAndExec(Native Method)
at java.lang.UNIXProcess.init(UNIXProcess.java:186)
at java.lang.ProcessImpl.start(ProcessImpl.java:130)
at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
... 18 more

[error] a.a.ActorSystemImpl - Uncaught error from thread
[play-akka.actor.default-dispatcher-943] shutting down JVM since
'akka.jvm-exit-on-fatal-error'
java.lang.NoClassDefFoundError:
common/Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1$$anonfun$applyOrElse$1
at
common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:75)
~[na:na]
at
common.Util$$anonfun$getPartitionsLogSize$3$$anonfun$apply$19$$anonfun$apply$1.applyOrElse(Util.scala:74)
~[na:na]
at

kafka producer example

2014-11-03 Thread Sa Li

Hi, All

I am running the kafka producer code:

import java.util.*;

import kafka.javaapi.producer.Producer;
import kafka.producer.KeyedMessage;
import kafka.producer.ProducerConfig;

public class TestProducer {
public static void main(String[] args) {
long events = Long.parseLong(args[0]);
Random rnd = new Random();

Properties props = new Properties();
props.put(metadata.broker.list, 10.100.70.128:9092,
10.100.70.128:9093,10.100.70.128:9094);
props.put(serializer.class, kafka.serializer.StringEncoder);
props.put(partitioner.class, example.producer.SimplePartitioner
);
props.put(request.required.acks, 1);

ProducerConfig config = new ProducerConfig(props);

ProducerString, String producer = new ProducerString,
String(config);

for (long nEvents = 0; nEvents  events; nEvents++) {
   long runtime = new Date().getTime();
   String ip = “192.168.2.” + rnd.nextInt(255);
   String msg = runtime + “,www.example.com,” + ip;
   KeyedMessageString, String data = new KeyedMessageString,
String(page_visits, ip, msg);
   producer.send(data);
}
producer.close();
}
}

It should be straightforwards, but when I compile it in IntelliJ IDEA, I
got such error
 Failed to execute goal
org.apache.maven.plugins:maven-compiler-plugin:2.5.1:compile
(default-compile) on project kafka-producer: Compilation failure
[ERROR]
C:\Users\sa\Desktop\Workload\kafka\kafkaprj\kafka-json-producer\src\main\java\kafka\example\TestProducer.java:[35,20]
error: cannot access Serializable, which is the producer object.

Any idea of this?

thanks

-- 

Alec Li

Re: postgresql consumer

2014-10-18 Thread Sa Li


Hi, all

I've just made a 3-node kafka cluster (9 brokers, 3 for each node), the 
performance test is OK. Now I am using tridentKafkaSpout, and being able to 
getting data from producer, see

 BrokerHosts zk = new ZkHosts(10.100.70.128:2181);   
TridentKafkaConfig spoutConf = new TridentKafkaConfig(zk, topictest); 
spoutConf.scheme = new SchemeAsMultiScheme(new StringScheme());
OpaqueTridentKafkaSpout kafkaSpout = new OpaqueTridentKafkaSpout(spoutConf);
// TransactionalTridentKafkaSpout kafkaSpout = new 
TransactionalTridentKafkaSpout(spoutConf);
TridentTopology topology = new TridentTopology(); 
Stream stream = topology.newStream(topictestspout, kafkaSpout).shuffle()
.each(new 
Fields(str),  
  new 
PrintStream(),
  new 
Fields(event_object)) 

.parallelismHint(16); 


With above code, I can print out the json objects published to brokers. Instead 
of printing messages, I will like to simply populate the messages into 
postgresql DB. I download the code from 
https://github.com/geoforce/storm-postgresql

Here the problems I have:
1. When I am running the storm-postgresql code, the messages generated from a 
RandomTupleSpout(), I am only able to write data into postgresql DB 100 rows 
regardless how I change the PostgresqlStateConfig. 

2. Now I want to be able to write the json messages into postgresql DB, things 
seem to be simple, just 2 columns in the DB table, id and events which stores 
json messages. Forgive my dullness, I couldn't get it work by storm-postgresql. 

I wonder if anyone has done the similar jobs, getting data from 
tridentKafkaSpout and write exactly into postgresql DB. In addition, once the 
writer starts to work, if it stops and restarts for some reasons, and I will to 
writer to resume the consume process from the stop point instead of very 
beginning, how to manage the offset and restart to write into DB?


thanks

Alec 

 Hi, All
 
 I setup a kafka cluster, and plan to publish the messages from Web to kafka, 
 the messages are in the form of json, I want to implement a consumer to write 
 the message I consumer to postgresql DB, not aggregation at all. I was 
 thinking to use KafkaSpout in storm to make it happen, now I want to simplify 
 the step, just use kafka consumer to populate message into postgresql. This 
 consumer should have the functions of consumer data, write into postgresql DB 
 in batch, if servers down, consumer can retrieve the data it stored in hard 
 drive with no redundancy and can consume the data from where it stopped once 
 the server up.
 
 Is there any sample code for this?
 
 
 thanks a lot
 
 
 Alec

Re: kafka-web-console

2014-10-13 Thread Sa Li

All, 

Again, I am still unable to install, seems to stuck on ivy.lock, any ideas to 
continue?

thanks

Alec

On Oct 12, 2014, at 7:38 PM, Sa Li sal...@gmail.com wrote:

 Hi

Re: kafka-web-console

2014-10-12 Thread Sa Li

Hi, Palak

really? I terminated it since I truly thought it was stuck there, I will run it 
again.

thanks

Alec


On Oct 12, 2014, at 7:35 PM, Palak Shah spala...@gmail.com wrote:

 Hi,
 
 I am sure you must have got it running by now, but in case you gave up
 earlier, just have patience and it will start.
 
 Even I had faced this issue. The console takes a lot of time to start, but
 eventually it does. So this is not an error :)
 
 Hope this helped,
 -Palak
 
 On Sat, Oct 11, 2014 at 9:00 AM, Sa Li sal...@gmail.com wrote:
 
 Hi, all
 
 I am installing kafka-web-console on ubuntu server, when I sbt package it,
 it stuck on waiting for ivy.lock
 
 root@DO-mq-dev:/home/stuser/kafkaprj/kafka-web-console# sbt package
 Loading /usr/share/sbt/bin/sbt-launch-lib.bash
 [info] Loading project definition from
 /home/stuser/kafkaprj/kafka-web-console/project
 Waiting for lock on /root/.ivy2/.sbt.ivy.lock to be available...
 
 any idea? and any suggestion while further install.
 
 
 thanks
 
 
 Alec

kafka-web-console

2014-10-10 Thread Sa Li

Hi, all

I am installing kafka-web-console on ubuntu server, when I sbt package it, it 
stuck on waiting for ivy.lock

root@DO-mq-dev:/home/stuser/kafkaprj/kafka-web-console# sbt package
Loading /usr/share/sbt/bin/sbt-launch-lib.bash
[info] Loading project definition from 
/home/stuser/kafkaprj/kafka-web-console/project
Waiting for lock on /root/.ivy2/.sbt.ivy.lock to be available...

any idea? and any suggestion while further install.


thanks


Alec

create topic in multiple node kafka cluster

2014-10-09 Thread Sa Li

Hi, All

I setup a 3-node kafka cluster on top of 3-node zk ensemble. Now I launch 1
broker on each node,  the brokers will be randomly distributed to zk
ensemble, see

DO-mq-dev.1
[zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
[0, 1]
pof-kstorm-dev1.2
[zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
[]
pof-kstorm-dev2.3
[zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
[2]

which means zk1 hosts 2 brokers, zk3 hosts 1 brokers, that will raise a
problem, that I am unable to create a topic with replications, say 3, it
will throw such exceptions

Error while executing topic command replication factor: 3 larger than
available brokers: 0

Is there any ways that I can create a topic which can be replicated
throughout entire zk ensemble as I know we will have to introduce more than
1 broker in single zk Server if we want to be able to create replicated
topics/

thanks

-- 

Alec Li

Re: create topic in multiple node kafka cluster

2014-10-09 Thread Sa Li

Hi,

I kinda doubt whether I make it as an ensemble, since it shows

root@DO-mq-dev:/etc/zookeeper/conf# zkServer.sh status
JMX enabled by default
Using config: /etc/zookeeper/conf/zoo.cfg
Mode: standalone

Mode is standalone instead of something else, here is my zoo.cfg, I did
follow the instruction to config it

# http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html

# The number of milliseconds of each tick
tickTime=2000
# The number of ticks that the initial
# synchronization phase can take
initLimit=10
# The number of ticks that can pass between
# sending a request and getting an acknowledgement
syncLimit=5
# the directory where the snapshot is stored.
dataDir=/var/lib/zookeeper
# Place the dataLogDir to a separate physical disc for better performance
# dataLogDir=/disk2/zookeeper

# the port at which the clients will connect
clientPort=2181

# specify all zookeeper servers
# The fist port is used by followers to connect to the leader
# The second one is used for leader election
DO-mq-dev.1=10.100.70.128:2888:3888
pof-kstorm-dev1.2=10.100.70.28:2888:3888
pof-kstorm-dev2.3=10.100.70.29:2888:3888


# To avoid seeks ZooKeeper allocates space in the transaction log file in
# blocks of preAllocSize kilobytes. The default block size is 64M. One
reason
# for changing the size of the blocks is to reduce the block size if
snapshots
# are taken more often. (Also, see snapCount).
#preAllocSize=65536

# Clients can submit requests faster than ZooKeeper can process them,
# especially if there are a lot of clients. To prevent ZooKeeper from
running
# out of memory due to queued requests, ZooKeeper will throttle clients so
that
# there is no more than globalOutstandingLimit outstanding requests in the
# system. The default limit is 1,000.ZooKeeper logs transactions to a
# transaction log. After snapCount transactions are written to a log file a
# snapshot is started and a new transaction log file is started. The default
# snapCount is 10,000.
#snapCount=1000

# If this option is defined, requests will be will logged to a trace file
named
# traceFile.year.month.day.
#traceFile=

# Leader accepts client connections. Default value is yes. The leader
machine
# coordinates updates. For higher update throughput at thes slight expense
of
# read throughput the leader can be configured to not accept clients and
focus
# on coordination.
leaderServes=yes

# Enable regular purging of old data and transaction logs every 24 hours
autopurge.purgeInterval=24
autopurge.snapRetainCount=5


And myid in dataDir is

At   10.100.70.128  /var/lib/zookeeper  contains1
At   10.100.70.28   /var/lib/zookeeper  contains2
At   10.100.70.29   /var/lib/zookeeper  contains3

I did make myid as 1, 2, 3 corresponding 3 nodes before, but seeing
something make such myid, it might be more accurate.

Is there anything wrong or missing to not able to make it an ensemble?

thanks

Alec



On Thu, Oct 9, 2014 at 12:06 PM, Guozhang Wang wangg...@gmail.com wrote:

 Sa,

 Usually you would not want to set up kafka brokers at the same machines
 with zk nodes, as that will add depending failures to the server cluster.

 Back to your original question, it seems your zk nodes do not form an
 ensemble, since otherwise their zk data should be the same.

 Guozhang

 On Thu, Oct 9, 2014 at 11:37 AM, Sa Li sal...@gmail.com wrote:

  Hi, All
 
  I setup a 3-node kafka cluster on top of 3-node zk ensemble. Now I
 launch 1
  broker on each node,  the brokers will be randomly distributed to zk
  ensemble, see
 
  DO-mq-dev.1
  [zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
  [0, 1]
  pof-kstorm-dev1.2
  [zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
  []
  pof-kstorm-dev2.3
  [zk: localhost:2181(CONNECTED) 1] ls /brokers/ids
  [2]
 
  which means zk1 hosts 2 brokers, zk3 hosts 1 brokers, that will raise a
  problem, that I am unable to create a topic with replications, say 3, it
  will throw such exceptions
 
  Error while executing topic command replication factor: 3 larger than
  available brokers: 0
 
  Is there any ways that I can create a topic which can be replicated
  throughout entire zk ensemble as I know we will have to introduce more
 than
  1 broker in single zk Server if we want to be able to create replicated
  topics/
 
  thanks
 
  --
 
  Alec Li
 



 --
 -- Guozhang




-- 

Alec Li

postgresql consumer

2014-10-08 Thread Sa Li

Hi, All

I setup a kafka cluster, and plan to publish the messages from Web to kafka, 
the messages are in the form of json, I want to implement a consumer to write 
the message I consumer to postgresql DB, not aggregation at all. I was thinking 
to use KafkaSpout in storm to make it happen, now I want to simplify the step, 
just use kafka consumer to populate message into postgresql. This consumer 
should have the functions of consumer data, write into postgresql DB in batch, 
if servers down, consumer can retrieve the data it stored in hard drive with no 
redundancy and can consume the data from where it stopped once the server up.

Is there any sample code for this?


thanks a lot


Alec

Re: kafka producer performance test

Thanks, Jay,

Here is what I did this morning, I git clone the latest version of kafka
from git, (I am currently using kafka 8.0) now it is 8.1.1, and it use
gradle to build project. I am having trouble to build it. I installed
gradle, and run ./gradlew jar in kafka root directory, it comes out:
Error: Could not find or load main class
org.gradle.wrapper.GradleWrapperMain

Any idea about this.

Thanks

Alec

On Wed, Oct 1, 2014 at 9:21 PM, Jay Kreps jay.kr...@gmail.com wrote:

 Hi Sa,

 That script was developed with the new producer that is included on
 trunk. Checkout trunk and build and it should be there.

 -Jay

 On Wed, Oct 1, 2014 at 7:55 PM, Sa Li sal...@gmail.com wrote:
  Hi, All
 
  I built a 3-node kafka cluster, I want to make performance test, I found
 someone post following thread, that is exactly the problem I have:
  -
  While testing kafka producer performance, I found 2 testing scripts.
 
  1) performance testing script in kafka distribution
 
  bin/kafka-producer-perf-test.sh --broker-list localhost:9092 --messages
  1000 --topic test --threads 10 --message-size 100 --batch-size 1
  --compression-codec 1
 
  2) performance testing script mentioned in
 
 
 https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines
 
  bin/kafka-run-class.sh
  org.apache.kafka.clients.tools.ProducerPerformance test6 5000 100
  -1 acks=1 bootstrap.servers=esv4-hcl198.grid.linkedin.com:9092
  buffer.memory=67108864 batch.size=8196
 
  based on org.apache.kafka.clients.producer.Producer.
 
  ——
 
 
  I was unable to duplicate either of above method, I figure the commands
 are outdated, anyone point me how to do such test with new command?
 
 
  thanks
 
  Alec




-- 

Alec Li

can't run kafka example code

Hi, all

Here I want to run example code associated with kafka package, I run as
readme says:

To run the demo using scripts:
+
+   1. Start Zookeeper and the Kafka server
+   2. For simple consumer demo, run bin/java-simple-consumer-demo.sh
+   3. For unlimited producer-consumer run, run
bin/java-producer-consumer-demo.sh

but I got such error,

:bin/../../project/boot/scala-2.8.0/lib/*.jar:bin/../../core/lib_managed/scala_2.8.0/compile/*.jar:bin/../../core/lib/*.jar:bin/../../core/target/scala_2.8.0/
*.jar:bin/../../examples/target/scala_2.8.0/*.jar

 Error: Could not find or load main class kafka.examples.SimpleConsumerDemo

But I already build the package under kafka directory, I can see the
class in examples/target/classes/kafka/examples


Any idea about this issue?


thanks




-- 

Alec Li

Re: kafka producer performance test

Thanks Guozhang

I tried this as in KAFKA-1490:

git clone https://git-wip-us.apache.org/repos/asf/kafka.git

cd kafka

gradle


but fails to build:

FAILURE: Build failed with an exception.

* Where:

 Script '/home/stuser/trunk/gradle/license.gradle' line: 2

* What went wrong:

 A problem occurred evaluating script.

 Could not find method create() for arguments [downloadLicenses, class 
 nl.javadude.gradle.plugins.license.DownloadLicenses] on task set.

* Try:

Run with --stacktrace option to get the stack trace. Run with --info
or --debug option to get more log output.

BUILD FAILED




Seems it is really not that straightforward to build


thanks




On Thu, Oct 2, 2014 at 12:56 PM, Guozhang Wang wangg...@gmail.com wrote:

 Hello Sa,

 KAFKA-1490 introduces a new step of downloading the wrapper, details are
 included in the latest README file.

 Guozhang

 On Thu, Oct 2, 2014 at 11:00 AM, Sa Li sal...@gmail.com wrote:

  Thanks, Jay,
 
  Here is what I did this morning, I git clone the latest version of kafka
  from git, (I am currently using kafka 8.0) now it is 8.1.1, and it use
  gradle to build project. I am having trouble to build it. I installed
  gradle, and run ./gradlew jar in kafka root directory, it comes out:
  Error: Could not find or load main class
  org.gradle.wrapper.GradleWrapperMain
 
  Any idea about this.
 
  Thanks
 
  Alec
 
  On Wed, Oct 1, 2014 at 9:21 PM, Jay Kreps jay.kr...@gmail.com wrote:
 
   Hi Sa,
  
   That script was developed with the new producer that is included on
   trunk. Checkout trunk and build and it should be there.
  
   -Jay
  
   On Wed, Oct 1, 2014 at 7:55 PM, Sa Li sal...@gmail.com wrote:
Hi, All
   
I built a 3-node kafka cluster, I want to make performance test, I
  found
   someone post following thread, that is exactly the problem I have:
-
While testing kafka producer performance, I found 2 testing scripts.
   
1) performance testing script in kafka distribution
   
bin/kafka-producer-perf-test.sh --broker-list localhost:9092
 --messages
1000 --topic test --threads 10 --message-size 100 --batch-size
  1
--compression-codec 1
   
2) performance testing script mentioned in
   
   
  
 
 https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines
   
bin/kafka-run-class.sh
org.apache.kafka.clients.tools.ProducerPerformance test6 5000 100
-1 acks=1 bootstrap.servers=esv4-hcl198.grid.linkedin.com:9092
buffer.memory=67108864 batch.size=8196
   
based on org.apache.kafka.clients.producer.Producer.
   
——
   
   
I was unable to duplicate either of above method, I figure the
 commands
   are outdated, anyone point me how to do such test with new command?
   
   
thanks
   
Alec
  
 
 
 
  --
 
  Alec Li
 



 --
 -- Guozhang




-- 

Alec Li

can't gradle

I git clone the latest kafka package, why can't I build the package

gradle

FAILURE: Build failed with an exception.

* Where:
Script '/home/ubuntu/kafka/gradle/license.gradle' line: 2

* What went wrong:
A problem occurred evaluating script.
 Could not find method create() for arguments [downloadLicenses, class 
 nl.javadude.gradle.plugins.license.DownloadLicenses] on task set.

* Try:
Run with --stacktrace option to get the stack trace. Run with --info
or --debug option to get more log output.

BUILD FAILED


Thanks


-- 

Alec Li

Re: kafka producer performance test

I can't really gradle through, even clone the latest trunk, anyone having
same issue?



On Thu, Oct 2, 2014 at 1:55 PM, Sa Li sal...@gmail.com wrote:

 Thanks Guozhang

 I tried this as in KAFKA-1490:

 git clone https://git-wip-us.apache.org/repos/asf/kafka.git

 cd kafka

 gradle


 but fails to build:

 FAILURE: Build failed with an exception.

 * Where:

  Script '/home/stuser/trunk/gradle/license.gradle' line: 2

 * What went wrong:

  A problem occurred evaluating script.

  Could not find method create() for arguments [downloadLicenses, class 
  nl.javadude.gradle.plugins.license.DownloadLicenses] on task set.

 * Try:

 Run with --stacktrace option to get the stack trace. Run with --info or 
 --debug option to get more log output.

 BUILD FAILED


 

 Seems it is really not that straightforward to build


 thanks




 On Thu, Oct 2, 2014 at 12:56 PM, Guozhang Wang wangg...@gmail.com wrote:

 Hello Sa,

 KAFKA-1490 introduces a new step of downloading the wrapper, details are
 included in the latest README file.

 Guozhang

 On Thu, Oct 2, 2014 at 11:00 AM, Sa Li sal...@gmail.com wrote:

  Thanks, Jay,
 
  Here is what I did this morning, I git clone the latest version of kafka
  from git, (I am currently using kafka 8.0) now it is 8.1.1, and it use
  gradle to build project. I am having trouble to build it. I installed
  gradle, and run ./gradlew jar in kafka root directory, it comes out:
  Error: Could not find or load main class
  org.gradle.wrapper.GradleWrapperMain
 
  Any idea about this.
 
  Thanks
 
  Alec
 
  On Wed, Oct 1, 2014 at 9:21 PM, Jay Kreps jay.kr...@gmail.com wrote:
 
   Hi Sa,
  
   That script was developed with the new producer that is included on
   trunk. Checkout trunk and build and it should be there.
  
   -Jay
  
   On Wed, Oct 1, 2014 at 7:55 PM, Sa Li sal...@gmail.com wrote:
Hi, All
   
I built a 3-node kafka cluster, I want to make performance test, I
  found
   someone post following thread, that is exactly the problem I have:
-
While testing kafka producer performance, I found 2 testing scripts.
   
1) performance testing script in kafka distribution
   
bin/kafka-producer-perf-test.sh --broker-list localhost:9092
 --messages
1000 --topic test --threads 10 --message-size 100 --batch-size
  1
--compression-codec 1
   
2) performance testing script mentioned in
   
   
  
 
 https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines
   
bin/kafka-run-class.sh
org.apache.kafka.clients.tools.ProducerPerformance test6 5000
 100
-1 acks=1 bootstrap.servers=esv4-hcl198.grid.linkedin.com:9092
buffer.memory=67108864 batch.size=8196
   
based on org.apache.kafka.clients.producer.Producer.
   
——
   
   
I was unable to duplicate either of above method, I figure the
 commands
   are outdated, anyone point me how to do such test with new command?
   
   
thanks
   
Alec
  
 
 
 
  --
 
  Alec Li
 



 --
 -- Guozhang




 --

 Alec Li




-- 

Alec Li

Re: multi-node and multi-broker kafka cluster setup

Daniel, thanks for reply

It is still the learn curve to me to setup the cluster, we finally want to
make connection between kafka cluster and storm cluster. As you mentioned,
seems 1 single broker per node is more efficient, is it good to handle
multiple topics? For my case, say I can build the 3-node kafka cluster, and
three brokers, and certainly that will limit the replica number, as far as
I understand, broker number should greater or equal to replica number.

For the zk Server, my understanding after play around is: I should run zk
Server server for each kafka node, I could zk.connect to single zk server
in kafka server.properties, and all the broker info will store in that
zkserver, But I may think it might be better to store each individual
broker info in local zkServer, then when zkCli,sh, we can see things under
/brokers/ids.

Is that good solution? I am using such architecture now.

thanks

On Tue, Sep 30, 2014 at 1:02 PM, Daniel Compton d...@danielcompton.net
wrote:

Hi Sa

While it's possible to run multiple brokers on a single machine, I would
be interested to hear why you would want to. Kafka is very efficient and
can use all of the system resources under load. Running multiple brokers
would increase zookeeper load, force resource sharing between the Kafka
processes, and require more admin overhead.

Additionally, you almost certainly want to run three Zookeepers. Two
Zookeepers gives you no more reliability than one because ZK voting is
based on a majority vote. If neither ZK can reach a majority on its own
then it will fail. More info at
http://wiki.apache.org/hadoop/ZooKeeper/FAQ#A7

Daniel.

On 1/10/2014, at 4:35 am, Guozhang Wang wangg...@gmail.com wrote:

Hello,

In general it is not required to have the kafka brokers installed on the
same nodes of the zk servers, and each node can host multiple kafka
brokers: you just need to make sure they do not share the same port and
the
same data dir.

Guozhang

On Mon, Sep 29, 2014 at 8:31 PM, Sa Li sal...@gmail.com wrote:

Hi,
I am kinda newbie to kafka, I plan to build a cluster with multiple
nodes,
and multiple brokers on each node, I can find tutorials for set multiple
brokers cluster in single node, say

http://www.michael-noll.com/blog/2013/03/13/running-a-multi-broker-apache-kafka-cluster-on-a-single-node/
Also I can find some instructions for multiple node setup, but with
single
broker on each node. I have not seen any documents to teach me how to
setup
multiple nodes cluster and multiple brokers in each node. I notice some
documents points out: we should install kafka on each node which makes
sense, and all the brokers in each node should connect to same
zookeeper. I
am confused since I thought I could setup a zookeeper ensemble cluster
separately, and all the brokers connecting to this zookeeper cluster and
this zk cluster doesn’t have to be the server hosting the kafka, but
some
tutorial says I should install zookeeper on each kafka node.

Here is my plan:
- I have three nodes: kfServer1, kfserver2, kfserver3,
- kfserver1 and kfserver2 are configured as the zookeeper ensemble,
which
i have done.
zk.connect=kfserver1:2181,kfserver2:2181
- broker1, broker2, broker3 are in kfserver1,
broker4, broker5, broker6 are on kfserver2,
broker7, broker8, broker9 are on kfserver3.

When I am configuring, the zk DataDir is in local directory of each
node,
instead located at the zk ensemble directory, is that correct? So far, I
couldnot make above scheme working, anyone have ever made multi-node and
multi-broker kafka cluster setup?

thanks

Alec

--
-- Guozhang

Alec Li

Re: multi-node and multi-broker kafka cluster setup

Just clarify, I am using 3 zkServer ensemble, myid: 1, 2, 3. But in each
kafka node server.properties of each broker, I make zk.connect to
localhost, which means the broker info stored in local zkServer, I know it
is bit of weird, other than assign the broker info automatically by
zkServer leader.

On Thu, Oct 2, 2014 at 2:25 PM, Sa Li sal...@gmail.com wrote:

Daniel, thanks for reply

Is that good solution? I am using such architecture now.

thanks

On Tue, Sep 30, 2014 at 1:02 PM, Daniel Compton d...@danielcompton.net
wrote:

Hi Sa

Daniel.

On 1/10/2014, at 4:35 am, Guozhang Wang wangg...@gmail.com wrote:

Hello,

Guozhang

On Mon, Sep 29, 2014 at 8:31 PM, Sa Li sal...@gmail.com wrote:

Hi,
I am kinda newbie to kafka, I plan to build a cluster with multiple
nodes,
and multiple brokers on each node, I can find tutorials for set
multiple
brokers cluster in single node, say

http://www.michael-noll.com/blog/2013/03/13/running-a-multi-broker-apache-kafka-cluster-on-a-single-node/
Also I can find some instructions for multiple node setup, but with
single
broker on each node. I have not seen any documents to teach me how to
setup
multiple nodes cluster and multiple brokers in each node. I notice some
documents points out: we should install kafka on each node which makes
sense, and all the brokers in each node should connect to same
zookeeper. I
am confused since I thought I could setup a zookeeper ensemble cluster
separately, and all the brokers connecting to this zookeeper cluster
and
this zk cluster doesn’t have to be the server hosting the kafka, but
some
tutorial says I should install zookeeper on each kafka node.

When I am configuring, the zk DataDir is in local directory of each
node,
instead located at the zk ensemble directory, is that correct? So far,
I
couldnot make above scheme working, anyone have ever made multi-node
and
multi-broker kafka cluster setup?

thanks

Alec

--
-- Guozhang

Alec Li

Re: can't gradle