[ 
https://issues.apache.org/jira/browse/STORM-1187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15011136#comment-15011136
 ] 

Robert Joseph Evans commented on STORM-1187:
--------------------------------------------

Yes event time is just another dimension that we can put into buckets and 
aggregate on but the hard part is knowing when you can consider a bucket 
complete and when you consider a bucket completely dead and no more late data 
will be accepted.  A lot of this depends on your use case and where the 
aggregated data is stored.  I really think the cloud dataflow API captures most 
of what an API like this should support.  If we could make it cleaner that 
would be great, but what we are doing is complex enough that I am not sure we 
really can make it that much cleaner.

> Support for late and out of order events in time based windows
> --------------------------------------------------------------
>
>                 Key: STORM-1187
>                 URL: https://issues.apache.org/jira/browse/STORM-1187
>             Project: Apache Storm
>          Issue Type: Sub-task
>            Reporter: Arun Mahadevan
>            Assignee: Arun Mahadevan
>
> Right now the time based windows uses the timestamp when the tuple is 
> received by the bolt. 
> However there are use cases where the tuples can be processed based on the 
> time when they are actually generated vs the time when they are received. So 
> we need to add support for processing events with a time lag and also have 
> some way to specify and read tuple timestamps.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to