Re: Sliding Window Memory use

2016-09-26 Thread Reynold Xin
I ran it on Databricks community edition which was a local[8] cluster with 6GB of RAM. It ran fine. That said, looking at the plan, we can definitely simplify this quite a bit. We had a new Window physical execution node for each window expression, when we could have collapsed all of them into a s

Sliding Window Memory use

2016-09-26 Thread Jeremy Davis
Hi, I posted this to users, but didn’t get any responses. I just wanted to highlight what seems like excessive memory use when using sliding windows. I have attached a test case where starting with certainly less than 1MB of data I can OOM a 10G heap. Regards, -JD -- import java