Tathagata Das created SPARK-1592:
------------------------------------

             Summary: Streaming input blocks were never removed from the 
BlockManagers
                 Key: SPARK-1592
                 URL: https://issues.apache.org/jira/browse/SPARK-1592
             Project: Spark
          Issue Type: Bug
            Reporter: Tathagata Das
            Assignee: Tathagata Das
            Priority: Blocker


The raw input data is stored as blocks in BlockManagers. Earlier they were 
cleared by cleaner ttl. Now since streaming does not require cleaner TTL to be 
set, the block would not get cleared. This increases up the Spark's memory 
usage, which is not even accounted and shown in the Spark storage UI. It may 
cause the data blocks to spill over to disk, which eventually slows down the 
receiving of data (persisting to memory become bottlenecked by writing to disk).





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to