Building large-scale analytics platform with Storm, Kafka and Cassandra - Meetup in NYC

2013-11-08 Thread Eugene
Please join NYC Storm user group for technical talk how one NYC company, Integral Ad Science, built large-scale platform using Storm. Details are here: http://www.meetup.com/New-York-City-Storm-User-Group/events/149806962/ Thanks -- Eugene Dvorkin Software Engineer WebMD email: edvor

Re: Spring bolts

2013-12-25 Thread Eugene
config and static initializers happen at deploy time client side. > > On Dec 25, 2013 7:51 AM, "Michal Singer" wrote: > > Hi, I am trying to understand how to use beans in spring as bolts/spouts. > > If I have the definition in spring which is initialized once the bolt

Re: [Blog Post] Storm Metrics Howto

2014-01-06 Thread Eugene
Excellent article. Thanks. On Jan 5, 2014 10:58 AM, "Jason Trost" wrote: > You might be interested in this: > > Storm Metrics Howto > http://www.bigdata-cookbook.com/post/72320512609/storm-metrics-how-to > > --Jason > >

Re: Is Storm a right tool for processing of thousands of small tasks?

2014-03-19 Thread Eugene
n Wed, Mar 19, 2014 at 5:38 PM, Eugene Dzhurinsky wrote: > On Wed, Mar 19, 2014 at 05:14:28PM -0400, P. Taylor Goetz wrote: > > Have you considered using DRPC [1]? > > > > [1] http://storm.incubator.apache.org/documentation/Distributed-RPC.html > > From what I've read s

Re: Storm with Drools Complex Event Processing rules

2014-04-24 Thread Eugene
events) and how to call CEP rules on those event to raise an alert? > > I checked following thread also : > https://groups.google.com/forum/#!topic/storm-user/VqBPnheDtQ8 > > -- > Thanks, > Nishu Tayal > > -- Eugene Dvorkin Software Engineer New York City Storm User Group -

Re: Storm with Drools Complex Event Processing rules

2014-04-25 Thread Eugene
hould pay - eBook version are reasonably priced for any country and the industry. Thanks. On Apr 24, 2014 2:33 PM, "Nishu" wrote: > Hi Eugene, > Thanks for the links. These links are quite useful to go forward in this > direction. > As these books are paid if you have copy of

Re: Is there a good reference book for storm users?

2014-05-03 Thread Eugene
irage > > PhD Student | Research Assistant > School of Informatics and Computing | Data to Insight Center > Indiana University > > twitter: milindalakmal > skype: milinda.pathirage > blog: http://milinda.pathirage.org > -- Eugene Dvorkin Software Engineer New Y

Re: [VOTE] Storm Logo Contest - Final Round

2014-06-11 Thread Eugene
ingle vote. A vote consists of 5 points >>> that can >>> >>>> be divided among multiple entries. To vote, list the entry number, >>> followed >>> >>>> by the number of points assigned. For example: >>> >>>> >>> >>>> #1 - 2 pts. >>> >>>> #2 - 1 pt. >>> >>>> #3 - 2 pts. >>> >>>> >>> >>>> Votes cast by PPMC members are considered binding, but voting is >>> open to >>> >>>> anyone. In the event of a tie vote from the PPMC, votes from the >>> community >>> >>>> will be used to break the tie. >>> >>>> >>> >>>> This vote will be open until Monday, June 16 11:59 PM UTC. >>> >>>> >>> >>>> - Taylor >>> >> >>> >> >>> > >>> >> >> >> >> -- >> With Regards, >> Vinoth Kumar K >> > > -- Eugene Dvorkin Software Engineer New York City Storm User Group - organizer WebMD email: edvor...@gmail.com phone: 646-250-9649 eugenedvorkin.com Connect with me on: LinkedIn <http://www.linkedin.com/in/eugenedvorkin> Twitter <http://twitter.com/edvorkin>

Re: Unit testing topologies

2014-07-04 Thread Eugene
t what is your bolt emitting. To test topology end-to-end and interaction between components you have to write integration test. Trigger event and check what you get at the end. Storm Real-time Processing Cookbook has couple chapters about testing. Hope that help Eugene Dvorkin. NYC Storm UG Organize

Re: storm performance tunning

2014-07-05 Thread Eugene
Thanks Eugene Dvorkin On Sat, Jul 5, 2014 at 7:19 AM, wrote: > i ' ve found some good advices reading these articles > > > http://www.michael-noll.com/blog/2013/06/21/understanding-storm-internal-message-buffers/ > > > http://www.michael-noll.com/blog/2012/10/16/

Re: The role of supervisor in Storm

2014-07-06 Thread Eugene
ops the > worker processes. But does it monitor them? how? Who monitors the > supervisor itself if it goes down? Launching something with "%java storm > something" is referred to as launching the item (nimbus) under supervision. > Can anything be launched like that? Is Sup

Re: The role of supervisor in Storm

2014-07-06 Thread Eugene
: > Thanks Eugene. > Building on this, it looks like one can monitor the DRPC server the same > way. Which is going to be very helpful to us. > ... checking out monit... > > -- > Date: Sun, 6 Jul 2014 22:12:32 -0400 > > Subject: Re: The role of

Trident topology UI show many processing tuples when in reality It shouldn't be any

2014-07-31 Thread Eugene
.put(Config.TOPOLOGY_TRIDENT_BATCH_EMIT_INTERVAL_MILLIS, 20); config.put(RichSpoutBatchExecutor.MAX_BATCH_SIZE_CONF, 500); StormSubmitter.submitTopology("OfflineRuleTopology", config, topology.build()) Is it something to do with Kafka/Zookeeper? What I am missing here? If it is endlessly

location of OpaqueTridentKafkaSpout offset in Zookeeper

2014-08-07 Thread Eugene
r, kafkatest, testgroup, deltaindexer, search-content-loader, mytest] [zk: localhost:2181(CONNECTED) 2] ls /transactional [offlineEvents] Thanks Eugene.

Understanding Trident parallelism

2014-08-07 Thread Eugene
quite clear with Storm topology, but not so with Trident. [image: Inline image 2] So question is how parallelism settings work in cluster, should I use partition operation to break into bolts and across machines, and how to increase parallelism of specific function. Thanks Eugene

Re: Storm processing in bursts

2014-09-07 Thread Eugene
going to be a pretty > inefficient way of using server resources. > > -TPP > -- Eugene Dvorkin Software Engineer New York City Storm User Group - organizer WebMD email: edvor...@gmail.com phone: 646-250-9649 eugenedvorkin.com Connect with me on: LinkedIn <http://www.linkedin.com/in/eugenedvorkin> Twitter <http://twitter.com/edvorkin>

Is Storm a right tool for processing of thousands of small tasks?

2014-03-19 Thread Eugene Dzhurinsky
nning forever, so I don't really see the easy way to "distinguish" the records from one task from records of another one. Should a new topology be started for each new task of a user? Thank you in advance! The links to any appropriate articles are very welcome :) --

Re: Is Storm a right tool for processing of thousands of small tasks?

2014-03-19 Thread Eugene Dzhurinsky
from spout. And it seems to be not trivial in case if topology is truly distributed (I could expect some messages after this "end" marker has been sent). Also, it seems that I can not "attach" a new spout to an existing topology, so looks like I have to start a new topology

Re: Is Storm a right tool for processing of thousands of small tasks?

2014-03-19 Thread Eugene Dzhurinsky
mark some task as "complete", so the sink could send the results back to the user. Like if the spout had produced 5 messages and 6-th is "EOT", and that EOT message arrived to sink when the message 3 was in progress. -- Eugene N Dzhurinsky pgpYoDqXl1fe1.pgp Description: PGP signature

Re: Is Storm a right tool for processing of thousands of small tasks?

2014-03-19 Thread Eugene Dzhurinsky
ut was received in sink. Perhaps I could add some sort of "record number" (which I may know in advance) and process the completion event if the sink has received the expected number of records. -- Eugene N Dzhurinsky pgp0cVsLJxROe.pgp Description: PGP signature