I think Kant meant time windowing functions. You can use

`window(TIMESTAMP, '24 hours', '24 hours')`

On Tue, Jul 25, 2017 at 9:26 AM, Keith Chapman <keithgchap...@gmail.com>
wrote:

> Here is an example of a window lead function,
>
> select *, lead(someColumn1) over ( partition by someColumn2 order by
> someColumn13 asc nulls first) as someName  from someTable
>
> Regards,
> Keith.
>
> http://keith-chapman.com
>
> On Tue, Jul 25, 2017 at 9:15 AM, kant kodali <kanth...@gmail.com> wrote:
>
>> How do I Specify windowInterval and slideInteval using raw sql string?
>>
>> On Tue, Jul 25, 2017 at 8:52 AM, Keith Chapman <keithgchap...@gmail.com>
>> wrote:
>>
>>> You could issue a raw sql query to spark, there is no particular
>>> advantage or disadvantage of doing so. Spark would build a logical plan
>>> from the raw sql (or DSL) and optimize on that. Ideally you would end up
>>> with the same physical plan, irrespective of it been written in raw sql /
>>> DSL.
>>>
>>> Regards,
>>> Keith.
>>>
>>> http://keith-chapman.com
>>>
>>> On Tue, Jul 25, 2017 at 12:50 AM, kant kodali <kanth...@gmail.com>
>>> wrote:
>>>
>>>> HI All,
>>>>
>>>> I just want to run some spark structured streaming Job similar to this
>>>>
>>>> DS.filter(col("name").equalTo("john"))
>>>>         .groupBy(functions.window(df1.col("TIMESTAMP"), "24 hours", "24 
>>>> hours"), df1.col("hourlyPay"))
>>>>         .agg(sum("hourlyPay").as("total"));
>>>>
>>>>
>>>> I am wondering if I can express the above query in raw sql string?
>>>>
>>>> If so how would that look like and what are some of the disadvantages of 
>>>> using raw sql query vs spark DSL?
>>>>
>>>>
>>>> Thanks!
>>>>
>>>>
>>>
>>
>

Reply via email to