Hi All,
I want to run a large sklearn.ensemble.RandomForestClassifier (with maybe a
dozens or maybe hundreds of trees and 100,000 samples). My desktop won’t handle
this so I want to try using StarCluster. RandomForestClassifier seems to
parallelize easily, but I don’t know how I would split it across many
IPython.parallel engines (if that’s even possible). (Or maybe I should be
foregoing IPython.parallel and using MPI?)
Any help would be greatly appreciated.
Thanks,
Alessandro Gagliardi| Glassdoor|
[email protected]<mailto:[email protected]>
We’re hiring! Check out our open
jobs<http://www.glassdoor.com/about/careers.htm>.
Twitter<https://twitter.com/Glassdoor> |
Facebook<https://www.facebook.com/Glassdoor> | Glassdoor
Blog<http://www.glassdoor.com/blog/>
2012 Webby Award Winner: Best Employment Site
2013 Webby Award Winner: Best Guides/Ratings/Review Site
------------------------------------------------------------------------------
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general