Catching exceptions with multi-processing

Fabien Fri, 19 Jun 2015 07:07:31 -0700

Folks,

I am developing a tool which works on individual entities (glaciers) anddo a lot of operations on them. There are many tasks to do, one aftereach other, and each task follows the same interface:


def task_1(path_to_glacier_dir):
    open file1 in path_to_glacier_dir
    do stuff
    if dont_work:
        raise RuntimeError("didnt work")
    write file2 in path_to_glacier_dir

This way, the tasks can be run in parallel very easily:

import multiprocessing as mp
pool = mp.Pool(4)

dirs = [list_of_dirs]
pool.map(task1, dirs, chunksize=1)
pool.map(task2, dirs, chunksize=1)
pool.map(task3, dirs, chunksize=1)

... and so forth. I tested the tool for about a hundred glaciers but nowit has to run for thousands of them. There are going to be errors, someof them are even expected for special outliers. What I would like thetool to do is that in case of error, it writes the identifier of theproblematic glacier somewhere, the error encountered and more info ifpossible. Because of multiprocessing, I can't write in a shared file, soI thought that the individual processes should write a unique "errorfile" in a dedicated directory.

What I don't know how to, however, is how to do this at minimal cost andin a generic way for all tasks. Also, the task2 should not be run iftask1 threw an error. Sometimes (for debugging), I'd rather keep thenormal behavior of raising an error and stopping the program.

Do I have to wrap all tasks with a "try: exept:" block? How to switchbetween behaviors? All the solutions I could think about look quite uglyto me. And it seems that this is a general problem that someone clevererthan me had solved before ;-)


Thanks,

Fabien







--
https://mail.python.org/mailman/listinfo/python-list

Catching exceptions with multi-processing

Reply via email to