Re: How to execute non-timestamp-based aggregations in spark structured streaming?

2019-04-22 Thread Tathagata Das
SQL windows with the 'over' syntax does not work in Structured Streaming. It is very hard to incrementalize that in the general case. Hence non-time windows are not supported. On Sat, Apr 20, 2019, 2:16 PM Stephen Boesch wrote: > Consider the following *intended* sql: > > select row_number() >

How to execute non-timestamp-based aggregations in spark structured streaming?

2019-04-20 Thread Stephen Boesch
Consider the following *intended* sql: select row_number() over (partition by Origin order by OnTimeDepPct desc) OnTimeDepRank,* from flights This will *not* work in *structured streaming* : The culprit is: partition by Origin The requirement is to use a timestamp-typed field such as