Hi Fred.
The link is dead for me.
Do you link against Accelerate (not sure if this is relevant)?
Cheers,
Andy
On 11/15/2012 08:45 PM, Fred Mailhot wrote:
Dear list,
I'm using GridSearchCV to do some simple model selection for a text
classification task. I've got it working (see below for caveat), but
I'm not convinced that I'm making the best use of this tool. If
someone has the time/inclination, I'd love a set of eyes to check the
following gist to see if I'm doing this correctly:
https://gist.github.com/e2ca1910450819a8a28
Also, for some reason this is throwing errors when I set n_jobs to
anything other than 1. I'm on OS X 10.7.4, using sklearn 0.13. The
traceback looks like:
Process PoolWorker-1:
Traceback (most recent call last):
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/process.py",
line 232, in _bootstrap
self.run()
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/process.py",
line 88, in run
self._target(*self._args, **self._kwargs)
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/pool.py",
line 59, in worker
task = get()
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/multiprocessing/queues.py",
line 352, in get
return recv()
TypeError: ('data type not understood', <type 'numpy.dtype'>, ('S0',
0, 1))
Process PoolWorker-2:
[...etc etc ad infinitum]
Has anyone else come across this, or perhaps have any insight into
what's going on? Needless to say, this grid search is taking FOREVER
(ca. 10hrs thus far, and only about halfway through), and I'd love to
be able to parallelize it.
Many thanks,
Fred.
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general