New Concurrent modification exception's after storm 0.10.0

2016-03-02 Thread Stephen Powis
Hey!

Did anything change between storm 0.9.5 and 0.10.0 regarding
ConcurrentModificationExceptions and how they are detected?  We've had a
topology running for the last 6months or so and never saw this exception.

After upgrading to Storm 0.10.x which didn't require any changes to our
topology/bolt/business logic, we're now seeing these intermittently and
have been struggling to see where we've gone wrong -- We don't seem to be
modifying values in the emitted tuples anywhere after emitting.

Thanks!
Stephen


Re: Best practices for running Storm, HBase, Kafka (regading Zookeeper cluster)

2016-03-02 Thread Peyman Mohajerian
Option 1 clearly is more reliable but some choose option 2 to reduce the
overhead of having multiple ZK cluster from resource and management point
of view. I've seen both cases and more than anything else depends on you
workload on each of the components. For example Kafka and Storm may run on
the same ZK cluster and just split up HBase if the load on HBase is way
more. Another practice I have seen is increase the ZK instance from default
of 3 to 5 nodes to handle additional load, I think that's more common.

On Wed, Mar 2, 2016 at 2:37 AM, Spico Florin  wrote:

> Hello!
>  I would like to know how it is best to run the three systems regarding
> the Zookeeper cluster usage:
>  1. separately per each system (ZK cluster/Storm, Zk Cluster/HBase,
> Zk/Hbase)
> 2. single cluster for all of them (Storm,HBase,Kafka->single ZK cluster)
>
> In my opinion the first one is the best option (since each system uses
> differently the ZK and we could have performance and availability issues).
>
> I look forward forward for your advises.
>   Regards,
>  Florin
>
>
>


Best practices for running Storm, HBase, Kafka (regading Zookeeper cluster)

2016-03-02 Thread Spico Florin
Hello!
 I would like to know how it is best to run the three systems regarding the
Zookeeper cluster usage:
 1. separately per each system (ZK cluster/Storm, Zk Cluster/HBase,
Zk/Hbase)
2. single cluster for all of them (Storm,HBase,Kafka->single ZK cluster)

In my opinion the first one is the best option (since each system uses
differently the ZK and we could have performance and availability issues).

I look forward forward for your advises.
  Regards,
 Florin