xiarixiaoyao commented on pull request #2673: URL: https://github.com/apache/hudi/pull/2673#issuecomment-801556436
@rubenssoto spark will uncache rdd automaticly in a LRU fashion, so if your program has enough memory , there will be no problem. In my program hudi cached too large data in memory, program occur oom before spark automatic clean cached rdd. @vinothchandar no leak of sort occur, simply the spark automatic cleaning not keeping up . i test on my env even if we donnot use blocking=true , spark can uncache rdd in a very short time ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org