Re: Problem to submit a topology to my storm cluster

2015-07-15 Thread cutoutsy Feng
Exception:Found multiple defaults.yaml resources. You're probably bundling the Storm jars with your topology jar; you can read the source code storm-core/jvm/utils/Utils.java getConfigFileInputStream() 2015-07-15 22:12 GMT+08:00 charlie quillard charlie.quill...@epitech.eu: Yes thank you, i put

Re: storm bolt stops processing after about 2500 tuples

2015-07-15 Thread Soumi
Hi Harsha, Its not that the last bolt is slow. After it processes around 2500 tuples, it does not receive any more. The 2nd last bolt is still emitting, but the last bolt is not getting any more tuples. Question is what happens to the tuples emitted by the 2nd last bolt? Why does those not reach

Re: storm bolt stops processing after about 2500 tuples

2015-07-15 Thread Harsha
soumi,       if your downstream bolt doesn't ack before tuple timeout ( by default its 30 secs)  storm will consider it as failed tuple and kafka spout will replay those. Since your last bolt is slower in acking may be you shouldn't anchor the tuple to the last bolt . -harsha On Wed, Jul 15,

Re: Re-emitting failed tuples

2015-07-15 Thread Matthias J. Sax
What Spout do you use? Failing tuples result in back-calls to Spout.fail(). If you use your own Spout implementation, you need to overwrite this method. The default implementation does nothing. Or do you already use a (so-called) reliable Spout? -Matthias On 07/15/2015 07:37 AM, Rahul wrote:

Trident and bolts in same topology (Hybrid)?

2015-07-15 Thread Tousif
Hello, i have a topology which has kafka spout and multiple bolts now i want to do batch processing on same data which bolts have processed. is it possible to have both ? can anyone point me documentation or example ? -- Regards Tousif Khazi

RE: DELIVERY FAILURE: Error transferring to GAPAR017/SRV/SOCGEN mail.box; Maximum hop count exceeded. Message probably in a routing loop.

2015-07-15 Thread charlie quillard
Hi, Yes, I have the same problem if this problem recurs, we need to call an administrator for deleting this user. Best regards, Charlie De : Matthias J. Sax mj...@informatik.hu-berlin.de Envoyé : mercredi 15 juillet 2015 11:02 À : user@storm.apache.org

Using Storm on Windows OS cluster

2015-07-15 Thread Spico Florin
Hello! I would lie to ask you the following: 1. Is anyone using the Storm deployed on a Windows OS cluster (multi node Windows OS based machines)? 2. If yes is only for testing purpose or also production mode? I found a discussion about using Storm on Windows cluster here:

How to Re-balance Trident Topology

2015-07-15 Thread Aride Chettali
Hi, From this https://storm.apache.org/documentation/Understanding-the-parallelism-of-a-Storm-topology.html documentation link I could learn that re-balancing a topology as easy as below storm rebalance mytopology -n 5 -e blue-spout=3 -e yellow-bolt=10 But how do I re-balance a trident

RE: Problem to submit a topology to my storm cluster

2015-07-15 Thread charlie quillard
Hi, Just following up on my previous email, I found a similar problem on stack overflow ( http://stackoverflow.com/questions/29917158/apache-storm-java-lang-noclassdeffounderror-com-google-gson-gson ), this one gives a solution with the maven-shade-plugin . I tried this solution and that

storm bolt stops processing after about 2500 tuples

2015-07-15 Thread Soumi
Hello storm users, We have a simple topology with one kafka-spout and a series of 5 bolts. First bolt receives ~3K tuples/second from kafka-spout and emits only ~500/second. The last bolt processes very less tuples (~100 a hour) and it sends output to kafka. The execute latencies in all bolts

Re: Realtime computations using storm - questions on performance

2015-07-15 Thread Nathan Leung
Two things. Your math may be off depending on parallelism. One emit from A becomes 100 emitted from C, and you are joining all of them. Second, try the default number of ackers (one per worker). All your ack traffic is going to a single task. Also you can try local or shuffle grouping if

presistentAggregate problem

2015-07-15 Thread Hermann Gábor
Hi all, I've been trying to set up a really simple Trident topology to try *persistentAggregate* with a ReducerAggregator based on the examples found in the docs and I might have got something wrong because my topology fails with ArrayIndexOutOfBoundsException when trying to retrieve a value from

Re: Re-emitting failed tuples

2015-07-15 Thread Rahul
Thanks Matthias. I am using my own Spout implementation. I was under the assumption that Storm, by default, re-emits the tuples when Spout.fail() is called. Now I realize I have to implement the changes myself.  Could you let me know about any Spout which has done the re-emit implementation?

RE: Problem to submit a topology to my storm cluster

2015-07-15 Thread charlie quillard
Yes thank you, i put my storm-core dependency to provided and now I running my topology into my cluster. Normally, the add of maven-shade-plugin had to resolve the NoClassDefFoundError but i have them in runtime , i would like to know if someone solved this problem with JNI library in the past.

Re: Using Storm on Windows OS cluster

2015-07-15 Thread Harsha
Storm does support multi-node setup in windows. Our customers using it in multi-node setup . We haven't tested security features that recently released in 0.10 but non-secure setup will work. -Harsha On Wed, Jul 15, 2015, at 06:09 AM, Bobby Evans wrote: Storm does support multi-node on windows,

Re: Problem to submit a topology to my storm cluster

2015-07-15 Thread Matthias J. Sax
Your jar file contains two copies of defaults.yaml. You need to make sure that there is at max one. Do you include storm-core.jar in your own jar? For this case, exclude defaults.yaml that is contained in storm-core.jar -Matthias On 07/15/2015 02:08 PM, charlie quillard wrote: Hi, Just

Re: How fast can bolt access emitted data

2015-07-15 Thread Nathan Leung
I don't think you adequately addressed Seungtack's concern. I would run a simple topology as a test, with something like 2 workers, spout - empty bolt - empty bolt - empty bolt all with parallelism 2, and all on shuffle grouping. If storm is behaving as poorly as you think it is, then you will