Burak Yavuz created SPARK-21153:
-----------------------------------

             Summary: Time windowing for tumbling windows can use a project 
instead of expand + filter
                 Key: SPARK-21153
                 URL: https://issues.apache.org/jira/browse/SPARK-21153
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.1.1
            Reporter: Burak Yavuz
            Assignee: Burak Yavuz


Time windowing in Spark currently performs an Expand + Filter, because there is 
no way to guarantee the amount of windows a timestamp will fall in, in the 
general case. However, for tumbling windows, a record is guaranteed to fall 
into a single bucket. In this case, doubling the number of records with Expand 
is wasteful, and can be improved by using a simple Projection instead.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to