Re: how to freeup workers in storm

2014-09-15 Thread Spico Florin
Hello! What is your conf/storm.yaml configuration? If you want to rellocate some of your components to different workers you need to do a special scheduler as explained here: http://xumingming.sinaapp.com/885/twitter-storm-how-to-develop-a-pluggable-scheduler/ In this example you are rellocatio

Re: Can storm cluster be separated with zookeeper cluster?

2014-09-15 Thread Spico Florin
Hello! There is no such a restriction. In storm.yaml you have the storm.zookeeper.servers section: storm.zookeeper.servers: - "server1" - "server2" Here you can configure your zookeper cluster. If you are using Amazon EC2, a good start for testing Storm in cloud, is Kasper' Madsen project

Re: TOPOLOGY_MAX_SPOUT_PENDING working only when the spout emits anchored tuples?

2014-09-10 Thread Spico Florin
Hello, Nathan! Thank you for confirm me this. Regards, Florin On Wed, Sep 10, 2014 at 2:55 PM, Nathan Leung wrote: > Yes, it works only with anchored tuples. If the tuple is un anchored there > is no way for the spout to know when it's been fully processed. > On Sep 10, 2014 4

Re: Limited number of arguments for the submitted topology (more than 9)

2014-09-10 Thread Spico Florin
that there is a simple solution. Best regards, Florin On Wed, Sep 10, 2014 at 1:54 PM, 潘臻轩 wrote: > why don't you use config.properties ? > > 2014-09-10 18:40 GMT+08:00 Spico Florin : > >> Hello! >> I have a topology that can take as input many arguments (more than

Limited number of arguments for the submitted topology (more than 9)

2014-09-10 Thread Spico Florin
Hello! I have a topology that can take as input many arguments (more than 9) set up in the storm.cmd line. How can I overpass this limitation without the brute force on the storm.cmd command (by adding there more parameters as I needed)? I look forward for your answers. Regards, Florin

Re: batch job processing

2014-09-10 Thread Spico Florin
Hello! I'll consider to slow down the spout. Set up a value for Config.TOPOLOGY_MAX_SPOUT_PENDING. It can happen that Cassanda, Solr and CouchDatabase do not cope with the requency that you emit your messages and thus you have backpressure. Also, the spout should emit the messages anchored (( n

TOPOLOGY_MAX_SPOUT_PENDING working only when the spout emits anchored tuples?

2014-09-10 Thread Spico Florin
Hello! I would like to know if the set up for TOPOLOGY_MAX_SPOUT_PENDING will be applied only when the spout was set up for emitting anchored tuples? I've made some testes and I could see that the spout was slow down only when the tuples were anchored. When the tuples were not anchored then, wit

Re: How big a cluster Nimbus to handle

2014-09-09 Thread Spico Florin
Hello! How many ZK nodes you are using? If are adding more zk nodes will be then well load balanced for the storm-cluster? What is the information that is exchanged via zk and how can I see it? I had a look on the Exhibitor but without help to get this information. I have a topology with aprox 900

Re: Running Topology on Remote Machine

2014-09-09 Thread Spico Florin
Hello, Vikas! I'm interested in this problem too. How can you run the worker java command on console? Can you please give an example? Thanks. Regards, Florin On Tue, Sep 9, 2014 at 10:27 AM, Vikas Agarwal wrote: > I stuck there for 2 weeks. :) > > Basically, I tried to debug every command th

Re: Error while loading a big topology (900 bolts) on a single node: java.lang.StackOverflowError

2014-09-06 Thread Spico Florin
p size above 2g? Thanks. Florin On Sat, Sep 6, 2014 at 4:49 PM, Spico Florin wrote: > Hello! > I have a topology that has 900 bolts. While deploying it, I'm getting > the following error in the supervisor.log. > Do you know what is the problem? > 2014-09-06 15:38:56 b.s.

Error while loading a big topology (900 bolts) on a single node: java.lang.StackOverflowError

2014-09-06 Thread Spico Florin
Hello! I have a topology that has 900 bolts. While deploying it, I'm getting the following error in the supervisor.log. Do you know what is the problem? 2014-09-06 15:38:56 b.s.event [ERROR] Error when processing event java.lang.StackOverflowError: null at java.lang.ref.ReferenceQueue.po

Zookeeper version

2014-09-06 Thread Spico Florin
Hello! I have installed the zookeeper 3.4.6 and storm 0.9.2-incubating. I've observed that storms uses zookeeper-3.4.5 library. Should this version be in sync? Meaning, if I'm changing the zookeeper library with 3.4.6 will storm compatible with it? I look forward for your answers. Regards, F

Re: Processing large serial data flow strategy

2014-09-05 Thread Spico Florin
er for network issues >> could be when a node with a faster connection than the rest contacts a 3rd >> party API and therefore makes sense to assign all tasks that make these >> calls to that node. >> >> Hope that helps. >> Regards, >> Michael >> >

Topology dies in LocalCluster mode with zookeperKeeper ConnectionLossException ErrorCode = ConnectionLoss

2014-09-05 Thread Spico Florin
Hello! I'm running (in LocalCluster) a topology containing one spout that generates data and 600 dummy bolts that are passing the data from one to the other like a chain, meaning: Spout->B1->B2->...->B600. After a while the abolve exception occurs. Can you please advice me how to avoid this error

Processing large serial data flow strategy

2014-09-05 Thread Spico Florin
Hello! I have a test case that involves a dataflow containing a spout and N serial bolts. For better understanding the dataflow looks like this: Spout->B_1->B_2->...->B_N where the "->" repersents the transition of the data from one bolt to another. Given the fact that by default Storm is using

Re: Topology Restart due to Executor Not Alive

2014-09-02 Thread Spico Florin
Hello! The nimbus.childopts is to set up the heap for the nimbus master node. You can set up this either. For the worker use worker.childopts. Please respond to this mail if it help myou or how did you solve it. Regards, Florin On Tue, Sep 2, 2014 at 11:54 AM, Spico Florin wrote: > He

Re: java.io.FileNotFoundException: File '.../stormconf.ser' does not exist

2014-09-02 Thread Spico Florin
Hello! I also encountered the same issue. Perhaps someone will explain the reason for this error (maybe an inconsistent state of the supervisor, a killed topology that didn't succeed ???) Stop your supervisor. Delete the the storm-local folder. And restart the supervisor daemon. Check if is runn

Re: Topology Restart due to Executor Not Alive

2014-09-02 Thread Spico Florin
Hello! I have encountered the same issue in a case of out of memory in worker process. Try increase the memory of the wokers by setting nimbus.childopts property. Also, if you are creating short living object at higher rate use +UseG1GC . Since you are saying that you hold data in your memory, I'

Re: issue in storm submit on multiple machine

2014-08-29 Thread Spico Florin
Hello! Perhaps you need a combination of scheduling of topology isolation https://storm.incubator.apache.org/2013/01/11/storm082-released.html and customized scheduling http://xumingming.sinaapp.com/885/twitter-storm-how-to-develop-a-pluggable-scheduler/ . Hope that these help. Regards, Florin

Re: [DISCUSS] Apache Storm Release 0.9.3/0.10.0

2014-08-29 Thread Spico Florin
Hello! Any improvements regarding the performance that could handle the load related on https://www.loggly.com/blog/what-we-learned-about-scaling-with-apache-storm/ and also on https://developer.ibm.com/streamsdev/wp-content/uploads/sites/15/2014/04/Streams-and-Storm-April-2014-Final.pdf ? I look

Re: How to stress test storm?

2014-08-18 Thread Spico Florin
Hello! The spout or bolt are running as instances (tasks) inside in an executor threa. You can run more tasks per executor thread by they will tun sequential. You can read more about the storm parallelism here: http://storm.incubator.apache.org/documentation/Understanding-the-parallelism-of-a-

Re: Cleanup method not called for the BaseBasicBolt when the topology is killed

2014-08-18 Thread Spico Florin
public void cleanup() { csvWriter.close() } } public class CleanupShutdownHook { public void attachShutDownHook() { Runtime.getRuntime().addShutdownHook(new Thread() { @Override public void run() { cleanup(); } }); } } On Mon, Aug 18, 2014 at 3:12 PM, Spico Florin wrote: > Hello! &

Cleanup method not called for the BaseBasicBolt when the topology is killed

2014-08-18 Thread Spico Florin
Hello! I'm using the storm version 0.9.2 incubating. I have a CSV file is initialized in the prepare method of a BaseBasicBolt prepare() { csvWriter = new CSVWriter(new BufferedWriter(new FileWriter("myFile"))); } and I would like to close it when I kill the topology public void cleanup()

Re: Unable to deploy jar onto nimbus

2014-08-16 Thread Spico Florin
e-of-frame-size (look at the bottom of the page the Adam's response) set up a larger size. I hope that these help. Good luck. Florin On Fri, Aug 15, 2014 at 6:52 PM, Spico Florin wrote: > Hello! > How many bolts do you have in the topology? I'm getting the same error > when

Re: Unable to deploy jar onto nimbus

2014-08-15 Thread Spico Florin
Hello! How many bolts do you have in the topology? I'm getting the same error when I have more than 20 bolts in the topology. My cluster configuration is: 1 ZK, 4 Woker Machines. Is there any limitation regarding the number of bolts in the topology or is due the fact that I have 1 ZK in the topol

Re: Storm submit problem

2014-08-11 Thread Spico Florin
Hello! By default when you are sending your jar file to the nimbus, with the "storm jar" command the default host for nimbus is localhost. If you'd like to specify a different one you can say something like: storm jar-c nimbus.host= A different solution is to create a folder named .storm in

Re: Jar not working

2014-08-11 Thread Spico Florin
Hello! Check your topology if it runs locally. Log your bolts, spouts to check that are initialized correctly. Check that your jat file contains the required libraries (except the storm jar files). It is possible to have exception in the initialization phase in one of them. I hope that these he

Re: Cannot create run the topology due to java.io.NotSerializableException: java.util.concurrent.CountDownLatch

2014-08-08 Thread Spico Florin
ing using a network > connection, open file, etc, should be created in the prepare() method, and > not on construction. > > > On Thu, Aug 7, 2014 at 10:43 AM, Spico Florin > wrote: > >> Hello! >>I have a bolt that is using a third pa

Cannot create run the topology due to java.io.NotSerializableException: java.util.concurrent.CountDownLatch

2014-08-07 Thread Spico Florin
Hello! I have a bolt that is using a third party class that has a reference to java.util.concurrent.CountDownLatch. Due to this fact. I'm getting the following exception: Exception in thread "main" java.lang.RuntimeException: java.io.NotSerializableException: java.util.concurrent.CountDownLatch

Re: Task colocation in the same JVM or same node

2014-08-06 Thread Spico Florin
app.com/885/twitter-storm-how-to-develop-a-pluggable-scheduler/ > > > On Wed, Aug 6, 2014 at 5:08 AM, Spico Florin > wrote: > >> Hello! >>I have a use case where I need that two bolts should be colocated >> either on the same worker JVM or in the same node. &

Task colocation in the same JVM or same node

2014-08-06 Thread Spico Florin
Hello! I have a use case where I need that two bolts should be colocated either on the same worker JVM or in the same node. We would like to know about this feature for the following reasons: 1. Computing the time that took for a tuple to be processed by the whole topology Suppose that you h

Storm UI 0.9.2 bug (num workers displaying the num tasks and vice-versa)

2014-08-04 Thread Spico Florin
Hello! In the Storm UI 0.9.2 I have observed that the num executors field is displaying the number of workers and the num workers is displaying the num executors. Is this a reported bug? Best regards, Florin

Re: Implementing a barrier mechanism in storm

2014-07-31 Thread Spico Florin
olt is stream grouped by "msgId" - you could keep the partially >> processed results in memory (or in a persisted state somewhere) - till you >> see the processed result for all the bolts. >> >> I would also expire msgIds which have not seen further results for beyo

Implementing a barrier mechanism in storm

2014-07-31 Thread Spico Florin
Hello! I have a case study where the same message (identified by an id) is spread over a couple of processing bolts and a final bolt should act as a barrier. This final bolt should do its job on the message ID only when all the upfront bolts have finished their process on the message. As sketch

Re: Can i get the metrics(data) from storm cluster regarding traffic load

2014-07-25 Thread Spico Florin
Hello! I'm interesting in this subject too. Can you please point out where in the StormUI you'll find this feature? Thanks. Best regards, Florin On Fri, Jul 25, 2014 at 4:36 AM, Srinath C wrote: > I think the latest storm 0.9.2-incubating has a graphical representation > of your topology wit

Re: Configuring a storm cluster with wirbelsturm

2014-07-21 Thread Spico Florin
l > servers. Take a look here: > > https://github.com/miguno/wirbelsturm/blob/master/docs/Wirbelsturm-less_deployment.md > > Hope that gives you a start! > Andrew > > > On Mon, Jul 21, 2014 at 7:55 AM, Spico Florin > wrote: > >> Hello! >> I have 4 GNU Linux mach

Configuring a storm cluster with wirbelsturm

2014-07-21 Thread Spico Florin
Hello! I have 4 GNU Linux machines (not in a cloud but in our own intranet) and I would lie to use wirbelsturm to install a Storm cluster. From the documentation I could not understand if is possible to use wirbelsturm for configuring Storm when you have machines at your own disposition. Is it po

Re: Storm hostnames and AWS

2014-07-15 Thread Spico Florin
Hello! I'm interesting in this subject too. Can you please tell me if you solved or not? If yes, can you share the solution? Thanks. Florin On Tue, Jul 8, 2014 at 2:56 PM, Kristian Øllegaard < krist...@livesystems.info> wrote: > Hi everyone, > > I’m currently setting up a storm cluster on AW

Re: Curiosity

2014-04-10 Thread Spico Florin
Hello! If you'd like to deploy your storm topology on Amazon cloud you can use the https://github.com/KasperMadsen/storm-deploy-alternative. Kasper has helped me a lot with the installation. I try it and it works fine with this configuration: cluster: - t1.micro {MASTER,UI} - t1.micro {WO

Which AMQP Spout are you using?

2014-04-07 Thread Spico Florin
Hello! I would like to consume messages from the RabbitMQ using a an AMQP Spout. Can you recommend me some good one (meaning tested also on the production)? I look forward for your answers. Best regards, Florin

Re: Can be storm-deploy script used for deploying storm 0.9.x version on Amazon?

2014-03-28 Thread Spico Florin
blog post about it: >> >> http://www.michael-noll.com/blog/2014/03/17/wirbelsturm-one-click-deploy-storm-kafka-clusters-with-vagrant-puppet >> >> Best, >> Marc >> >> On Thu, Mar 27, 2014 at 10:19:24AM +0200, Spico Florin wrote: >> > Hello, Sasi! &g

Re: Can be storm-deploy script used for deploying storm 0.9.x version on Amazon?

2014-03-27 Thread Spico Florin
> > :private-key-path > "/home/ubuntu/.ssh/id_rsa" > > :public-key-path > "/home/ubuntu/.ssh/id_rsa.pub"} > > :aws-user-id "} > >

Can be storm-deploy script used for deploying storm 0.9.x version on Amazon?

2014-03-26 Thread Spico Florin
Hello! I would like to know what changes should be applied to the storm-deploy script (https://github.com/nathanmarz/storm-deploy) in order to install it on Amazon? Thank you in advance. Regards, Florin

Re: Amazon Services that should be used for deploying storm Amazon Cloud

2014-03-03 Thread Spico Florin
.safaribooksonline.com/2013/12/27/storm-deploy-amazon-ec2/ > > What specific services from amazon you use will depend on your > requirements for your topologies. So far all we have used is EC2 and Cloud > Watch. > > S > > > > > > > On Mon, Mar 3, 2014 at 2:33

Using CEP Esper in Storm Bolt and scaling out/rebalance topology

2014-03-03 Thread Spico Florin
Hello! I would like to use Esper in a Storm Bolt. In my use case I have to scale out the number of machines that are processing the incoming data (sensors data). The basic data model sent by sensor is : {sensorId, value, geoLocation} In order to keep data consistent in the Esper Bolt, the Esp

Amazon Services that should be used for deploying storm Amazon Cloud

2014-03-03 Thread Spico Florin
Hello! I would like to deploy a storm application on Amazon Cloud. Therefore, I would like to know what Amazon services should I use in order to deploy? For example: Amazon Elastic Cloud, Amazon S3, Amazon CloudWatch? If somebody has already deployed a Storm on Amazon, can you please tell me wha

Re: [RELEASE] Apache Storm 0.9.1-incubating released

2014-02-26 Thread Spico Florin
Hello, Padma! You can create a storm cluster on Windows with one node as described here: http://ptgoetz.github.io/blog/2013/12/18/running-apache-storm-on-windows/ I could set up following the instructions from this article. I hope that will help you also. Regards,\ Florin On Wed, Feb 26, 2014

Is ZeroMQ dependency needed in the storm-deploy cloud script when using Netty (storm 0.9.1)?

2014-02-20 Thread Spico Florin
Hello! In the script used for deploying storm to cloud( https://github.com/nathanmarz/storm-deploy ) there is a dependency on ZeroMQ. If I would like to use the latest version of Storm ( https://github.com/apache/incubator-storm) with Netty support, I think this dependency should be removed. Am

Re: Number of executors < number of tasks?

2014-02-18 Thread Spico Florin
Hello! Unfortunately you cannot change the number of tasks when doing rebalancing. Considering the following scenario: 1. A storm topology with maximum parallelism of 10 tasks. 2. Your storm topology is deployed on cloud 3. Your application should scale horizontally depending on the incoming vol

Faulty example for rebalancing in the wiki documentation about understanding storm paralellism

2014-02-05 Thread Spico Florin
Hello! In the wiki documentation regarding the storm parallelism https://github.com/nathanmarz/storm/wiki/Understanding-the-parallelism-of-a-Storm-topology the given example command storm rebalance mytopology -n 5 -e blue-spout=3 -e yellow-bolt=10 will not have any effect in terms of changing

Fwd: Rebalance topology throws java.lang.StringIndexOutOfBoundsException: String index out of range: -1

2014-02-05 Thread Spico Florin
to support rebalancing for multiple components. I look forward for your opinions. Regards, Florin ------ Forwarded message -- From: Spico Florin Date: Tue, Feb 4, 2014 at 11:10 AM Subject: Rebalance topology throws java.lang.StringIndexOutOfBoundsException: String index out

Rebalance topology throws java.lang.StringIndexOutOfBoundsException: String index out of range: -1

2014-02-04 Thread Spico Florin
Hello! I'm using the Taylor Goetz's storm version pointed out by article http://ptgoetz.github.io/blog/2013/12/18/running-apache-storm-on-windows/ and located at: https://github.com/ptgoetz/incubator-storm/tree/windows-test I have succeeded to install everything on my computer (running windows 7

Proportional number of spouts/bolts with the number of workers

2014-01-23 Thread Spico Florin
Hello! Supposing that you have the following topology spout spout (sends tuple to )bolt1 bolt1 (sends tuple to) bolt2 and the following settings: 3 process workers spout,parallel hint=1 bolt1, parallel hint=2 bolt2,parallel hint=3 my questions are: -Am I right with the following (possible) distribu

Re: Question about the tuple definition

2014-01-21 Thread Spico Florin
Hello, Richard! Thank you very much for your response. I was thinking to the same explication but I tried to double check. Regards, Florin On Tue, Jan 21, 2014 at 4:20 PM, Richards Peter wrote: > Hi, > > To keep it simple I would say that a tuple is a kind of key-value pair. In > storm the l

Re: Rebalance and Distributed load on the workers

2014-01-21 Thread Spico Florin
ing lost. > > Part of rebalancing the topology involves killing existing workers. > > So the short answer is that Storm doesn't do anything for you to try to > keep your bolts consistent. > > -- Kyle > > From: Spico Florin > Reply-To: "user@storm.incubato

Question about the tuple definition

2014-01-21 Thread Spico Florin
Hello! According to the definition of the tuple as stated in http://nathanmarz.github.io/storm/doc/backtype/storm/tuple/Tuple.html " tuple is a named list of values, where each value can be any type". As I'm newbie in Storm my question is simple: What is giving the name of the tuple? The stream

Rebalance and Distributed load on the workers

2014-01-14 Thread Spico Florin
Hello! > I'm a newbie in Storm and I have some questions regarding the scaling > the number of workers/executors among clusters and how data is correctly > handled between them. >In the case of the WordCountTopology the WordCount Bolt is used to > count the words from a text. From my observat

[no subject]

2014-01-14 Thread Spico Florin
Hello! I'm a newbie in Storm and I have some questions regarding the scaling the number of workers/executors among clusters and how data is correctly handled between them. In the case of the WordCountTopology the WordCount Bolt is used to count the words from a text. From my observations and u

Re: Workers elasticity

2014-01-06 Thread Spico Florin
ee > https://github.com/nathanmarz/storm/blob/master/conf/defaults.yaml#L77 > > Michael Rose (@Xorlev <https://twitter.com/xorlev>) > Senior Platform Engineer, FullContact <http://www.fullcontact.com/> > mich...@fullcontact.com > > > On Mon, Jan 6, 2014 at 11

Workers elasticity

2014-01-06 Thread Spico Florin
Hello! I'm newbie to storm and also to Amazon Cloud. I have the following scenario: 1. I have topology that runs on 3 workers on EC2. 2. Due to the increasing load, EC2 intantiates 2 new instances and I have to rebalance to 5 workers. 3. After the resource demand, EC2 released 2 instances a

Re: Deploying storm on cloud IAAS support

2013-12-26 Thread Spico Florin
gt; >> Florin, >> >> Have you looked at Pallet? http://palletops.com/ >> >> -Roger >> >> >> >> >> >> *From:* Spico Florin [mailto:spicoflo...@gmail.com] >> *Sent:* Tuesday, December 24, 2013 12:51 AM >> *To:* user@sto

Deploying storm on cloud IAAS support

2013-12-23 Thread Spico Florin
Hello! I've seen on the storm tutorials that there is only one example of deployment of Storm in the Amazon Cloud. My questions are: 1.Is there any support for other cloud IAAS providers? If yes, which are these providers? 2. It is planned to support any other IAAS? I'll look forward for you

Fwd: Deploying storm on cloud IAAS support

2013-12-17 Thread Spico Florin
Hello! I've seen on the storm tutorials that there is only one example of deployment of Storm in the Amazon Cloud. My questions are: 1.Is there any support for other cloud IAAS providers? If yes, who are they? 2. It is planned to support any other IAAS? I'll look forward for your answers.