Spark SQL running totals

Stefan Panayotov Thu, 15 Oct 2015 10:49:58 -0700

Hi,
 
I need help with Spark SQL. I need to achieve something like the following.
If I have data like:
 
col_1  col_2
1         10
2         30
3         15
4         20
5         25
 
I need to get col_3 to be the running total of the sum of the previous rows of 
col_2, e.g.
 
col_1  col_2  col_3
1         10        10
2         30        40
3         15        55
4         20        75
5         25        100
 
Is there a way to achieve this in Spark SQL or maybe with Data frame 
transformations?
 
Thanks in advance,



Stefan Panayotov, PhD 
Home: 610-355-0919 
Cell: 610-517-5586 
email: spanayo...@msn.com 
spanayo...@outlook.com 
spanayo...@comcast.net

Spark SQL running totals

Reply via email to