We do not measure how much data we are spilling to disk.

On 09.12.2016 14:43, Fabian Hueske wrote:
Hi,

the heap mem usage should be available via Flink's metrics system.
Not sure if that also captures spilled data. Chesnay (in CC) should know that.

If the spilled data is not available as a metric, you can try to write a small script that monitors the directories to which Flink spills (Config parameter: |taskmanager.tmp.dirs| [1]). The script would repeatedly list all files and keep for each file the max size (files are deleted once the are not used anymore). This is not super precise but might be good enough.

Hope this helps,
Fabian

[1] https://ci.apache.org/projects/flink/flink-docs-release-1.1/setup/config.html#jobmanager-amp-taskmanager

2016-12-09 14:12 GMT+01:00 otherwise777 <wou...@onzichtbaar.net <mailto:wou...@onzichtbaar.net>>:

    Currently i'm doing some analysis for some algorithms that i use
    in Flink,
    I'm interested in the Space and time it takes to execute them. For
    the Time
    i used getNetRuntime() in the executionenvironment, but I have no
    idea how
    to analyse the amount of space an algorithm uses.
    Space can mean different things here, like Heap space, disk space,
    overal
    memory or allocated memory. I would like to analyze some of these.



    --
    View this message in context:
    
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/How-to-analyze-space-usage-of-Flink-algorithms-tp10555.html
    
<http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/How-to-analyze-space-usage-of-Flink-algorithms-tp10555.html>
    Sent from the Apache Flink User Mailing List archive. mailing list
    archive at Nabble.com.



Reply via email to