Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage

Artemis User Mon, 12 Sep 2022 09:46:10 -0700

The off-heap memory isn't subjected to GC. So the obvious reason isthat your have too many states to maintain in your streaming app, andthe GC couldn't keep up, and end up with resources but to die. Are youusing continues processing or microbatch in structured streaming? Youmay want to lower your incoming data rate and/or increase yourmicrobatch size so to lower the number of states to bepersisted/maintained...


On 9/11/22 10:59 AM, akshit marwah wrote:

Hi Team,
We are trying to shift from HDFS State Manager to Rocks DB StateManager, but while doing POC we realised it is using much moreoff-heap space than expected. Because of this, the executors getkilled with : *out of**physical memory exception.*
*
*
Could you please help in understanding, why is there a massiveincrease in off-heap space, and what can we do about it?
We are using, SPARK 3.2.1 with 1 executor and 1 executor core, tounderstand the memory requirements -
1. Rocks DB Run - took 3.5 GB heap and 11.5 GB Res Memory
2. Hdfs State Manager - took 5 GB heap and 10 GB Res Memory.

Thanks,
Akshit


Thanks and regards
- Akshit Marwah

Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage

Reply via email to