[PROPOSAL] Samza Proposal

2013-07-23 Thread Chris Ricco
Hey All, Sending along an incubator proposal for Samza. Thanks! Chris https://wiki.apache.org/incubator/SamzaProposal == Abstract == Samza is a stream processing system for running continuous computation on infinite streams of data. == Proposal ==

Re: [PROPOSAL] Samza Proposal

2013-07-23 Thread Henry Saputra
Looks like this is similar to S4 (http://incubator.apache.org/s4/) which allow stream and real time data processing via DAG? - Henry On Tue, Jul 23, 2013 at 10:47 AM, Chris Ricco wrote: > Hey All, > > Sending along an incubator proposal for Samza. > > Thanks! > Chris > > https://wiki.apache.or

Re: [PROPOSAL] Samza Proposal

2013-07-23 Thread Debo Dutta (dedutta)
Also add storm to the mix. Storm also allows you to do back edges. debo On 7/23/13 6:48 PM, "Henry Saputra" wrote: >Looks like this is similar to S4 (http://incubator.apache.org/s4/) which >allow stream and real time data processing via DAG? > > >- Henry > > >On Tue, Jul 23, 2013 at 10:47 AM, C

Re: [PROPOSAL] Samza Proposal

2013-07-23 Thread Chris Riccomini
Hey Henry and Debo, Thanks for calling this out. Samza's feature set includes: - *Simpe API:* Unlike most low-level messaging system APIs, Samza provides a very simple call-back based "process message" API that should be familiar to anyone that's used Map/Reduce. - *Managed state:* Sa