Re: Flink Session TM Logs

2020-07-26 Thread Robert Metzger
Hi Richard, thanks for forwarding my answer to the list! I fear that Flink does not have a built-in solution for serving the logs of a finished TaskManager while a YARN session is still running. I agree with Yangze that you probably have to rely on an external logging service, such as ElasticSear

Re: Flink Session TM Logs

2020-07-26 Thread Yangze Guo
Hi, Richard Before the session has been terminated, you could not fetch the terminated TM logs. One possible solution could be leveraging the log4j2 appenders[1]. Flink uses log4j2 as default in the latest release 1.11. [1] https://logging.apache.org/log4j/2.x/manual/appenders.html Best, Yangze

Unable to deduce RocksDB api calls in streaming.

2020-07-26 Thread Aviral Srivastava
Hi all! I want to profile the time taken to make snapshot calls to RocksDB when using Flink in streaming mode. I have forked the flink core repo, added the example of fraud detection, configured the state backend and checkpointing. The program is running successfully. I have also been able to im

Re: Is outputting from components other than sinks or side outputs a no-no ?

2020-07-26 Thread David Anderson
Every job is required to have a sink, but there's no requirement that all output be done via sinks. It's not uncommon, and doesn't have to cause problems, to have other operators that do I/O. What can be problematic, however, is doing blocking I/O. While your user function is blocked, the function

Is outputting from components other than sinks or side outputs a no-no ?

2020-07-26 Thread Tom Fennelly
Hi. What are the negative side effects of (for example) a filter function occasionally making a call out to a DB ? Is this a big no-no and should all outputs be done through sinks and side outputs, no exceptions ? Regards, Tom.