Re: Set TimeOut and continue with other tasks

2019-07-10 Thread Wei Chen
I am currently trying to use Future Await to set a timeout inside the map-reduce. However, the tasks now fail instead of stuck, even if I have a Try Match to catch it. Doesn't anyone have an idea why? The code is like ```Scala files.map { file => Try { def tmpFunc(): Boolean = { FILE

Re: Set TimeOut and continue with other tasks

2019-07-10 Thread Gourav Sengupta
Is there a way you can identify those patterns in a file or in its name and then just tackle them in separate jobs? I use the function input_file_name() to find the name of input file of each record and then filter out certain files. Regards, Gourav On Wed, Jul 10, 2019 at 6:47 AM Wei Chen

Set TimeOut and continue with other tasks

2019-07-09 Thread Wei Chen
Hello All, I am using spark to process some files parallelly. While most files are able to be processed within 3 seconds, it is possible that we stuck on 1 or 2 files as they will never finish (or will take more than 48 hours). Since it is a 3rd party file conversion tool, we are not able to