Re: [cache eviction] partition recomputation in big lineage RDDs

2015-10-01 Thread Hemant Bhanawat
As I understand, you don't need merge of your historical data RDD with your RDD_inc, what you need is merge of the computation results of the your historical RDD with RDD_inc and so on. IMO, you should consider having an external row store to hold your computations. I say this because you need

Re: [cache eviction] partition recomputation in big lineage RDDs

2015-09-30 Thread Nicolae Marasoiu
Hi, An equivalent question would be: can the memory cache be selectively evicted from within a component run in the driver? I know it is breaking some abstraction/encapsulation, but clearly I need to evict part of the cache so that it is reloaded with newer values from DB. Because what I