On Sun, Oct 09, 2011 at 01:33:58PM -0400, Satrajit Ghosh wrote:
>    has anybody run the permutation_test_score on a distributed engine like
>    SGE or PBS?

No.

>    does joblib support such distribution?

No. One the one hand I'd like to say: 'patch welcomed', on the other
hand, I know of no library implementing the distribution on such clusters
and stable, so we would have to integrate the low level SGE logic to
joblib. Looking at Nipype [1] it does not seems too bad. The hardest work
would be probably to create executable jobs that do the work.

I would be interested by such a feature in joblib, but only if it comes
with a small mocked implementation of the submission API to be able to
have a full test coverage. Indeed, I suspect that the code that will
create Python jobs on the fly will not be tiny.

In the same alley, I think that it might be interesting if the 'n_jobs'
can actually accept not only a number, but also optionaly a dispatcher
object implementing pretty much a parallel mapper interface (to be
defined). This would enable people to write their own dispatcher without
having to patch joblib. Indeed, joblib cannot grow to account for all the
possible dispatching mechanism, as I am worried that it will be code that
will be little tested and eventually bitrot.

Cheers,

Gaƫl

[1]
https://github.com/nipy/nipype/blob/master/nipype/pipeline/plugins/pbs.py
https://github.com/nipy/nipype/blob/master/nipype/pipeline/plugins/sge.py


------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to