Apache Samza Meetup Announced (March 4 @6PM hosted at LinkedInĀ¹s campus in Mountain View CA)

2015-02-23 Thread Ed Yakabosky
Hi all - I would like to announce the first Bay Area Apache Samza Meetuphttp://www.meetup.com/Bay-Area-Samza-Meetup/events/220354853/ hosted at LinkedIn in Mountain View, CA on March 4, 2015 @6PM. We plan to host the event every 2-months to encourage knowledge sharing collaboration in

Re: Re-processing a la Kappa/Liquid

2015-02-23 Thread Jay Kreps
I find it useful to delineate two kinds of things 1. Mutations such as database table updates. These always have a key 2. Immutable events such as clicks, sales, orders, etc. The whole premise of compaction is that you have some redundant updates as in case (1). In order to have updates you have

Reprocessing and windowing

2015-02-23 Thread Geoffry Sumter
Hey everyone, I've been thinking about reprocessing http://samza.apache.org/learn/documentation/0.7.0/jobs/reprocessing.html when my job has windowed state http://samza.apache.org/learn/documentation/0.7.0/container/state-management.html#windowed-aggregation and I have a few questions. Context:

Re: Reprocessing and windowing

2015-02-23 Thread Roger Hoover
Hi Geoffry, You might find the Google Millwheel paper and recent talk relevant. That system supports windows based on event time as well as reprocessing. Sent from my iPhone On Feb 23, 2015, at 4:49 PM, Geoffry Sumter vit...@gmail.com wrote: Hey everyone, I've been thinking about

Re: Reprocessing and windowing

2015-02-23 Thread Yi Pan
Hey, Geoffry, We have started some work in SAMZA-552 to create a window operator API in samza, as part of effort to implement support for a high-level language. I will probably be able to have something to share in a few days and would love to get feedbacks regarding to the window operator.