Hi all, My tutorial on scikit-learn at PyCon has been accepted. Would anybody be interested in sprinting there? The sprint days are Mar. 12-15.
http://us.pycon.org/2012/ I think Wes has submitted a talk on Pandas too. I would be very interested in sprinting on machine learning & data analytics in the cloud using partitioned memory mapped arrays to prototype a low overhead alternative to the Hadoop MapReduce runtime optimized for numerical data and in-memory iterative processing, probably leveraging IPython.parallel and POSIX sendfile [1]. Some Pandas idioms like groupBy and alignment would be interesting to investigate in a distributed setting IMHO. [1] http://linux.die.net/man/2/sendfile -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ Cloud Services Checklist: Pricing and Packaging Optimization This white paper is intended to serve as a reference, checklist and point of discussion for anyone considering optimizing the pricing and packaging model of a cloud services business. Read Now! http://www.accelacomm.com/jaw/sfnl/114/51491232/ _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general