Re: [Scikit-learn-general] Best way to integrate probability density function from GMM? -- Fixed

2014-10-10 Thread Robert Kern
r region. The function you want is scipy.stats.mvn.mvndst(). See the FORTRAN source for the arguments: https://github.com/scipy/scipy/blob/master/scipy/stats/mvndst.f#L78-L118 You will have to call this once for each Gaussian component in the mixture and then do the appropriately-weighted sum. -

Re: [Scikit-learn-general] Discrepancy in LogisticRegression on Windows vs. Linux with fixed random_state

2014-04-24 Thread Robert Kern
cibility problem ("this"). > by encoding thread identifiers into > the charcteristic polynomials in such a way that they are "relatively prime > to each other". That means that each thread gets an independent stream of > random numbers. Since there is one Mersenne Twi

Re: [Scikit-learn-general] Discrepancy in LogisticRegression on Windows vs. Linux with fixed random_state

2014-04-24 Thread Robert Kern
ns as > the state is smaller than MT (a key per thread + a counter) and with > the inclusion of such functions in current processors, you get values > faster than MT. -- Robert Kern -- Start Your Social Networ

Re: [Scikit-learn-general] Discrepancy in LogisticRegression on Windows vs. Linux with fixed random_state

2014-04-24 Thread Robert Kern
pplication. It looks like liblinear just uses rand() to do Fisher-Yates shuffles in some of its coordinate descent solvers. I *suspect* that independence is not a strictly necessary property here, but reproducibility is. -- Robert Kern

Re: [Scikit-learn-general] Request for CC0 licensing on examples

2014-01-28 Thread Robert Kern
ur slide (anyone getting your slide PDFs will thank you for it anyways), and I think you will have satisfied the desires of the authors when they contributed them under the BSD license (n.b. I am not one of them, so take that for what it&#x

Re: [Scikit-learn-general] from sklearn.all import *

2013-12-02 Thread Robert Kern
On Mon, Dec 2, 2013 at 3:21 PM, Nelle Varoquaux wrote: > > On 2 December 2013 16:11, Gael Varoquaux wrote: > > On Mon, Dec 02, 2013 at 02:50:45PM +0000, Robert Kern wrote: > >> > +1. "Import *" is a really really bad habit. And hacked up interactive > &

Re: [Scikit-learn-general] from sklearn.all import *

2013-12-02 Thread Robert Kern
> proper and find that Python is not very systematic in what it does. I know you are addressing a comment from the original proposal, but I think these arguments are in *favor* of the proposal, in general. "api" modules are very useful without "import *". Proper use of

Re: [Scikit-learn-general] Array memory layout and slicing

2013-11-26 Thread Robert Kern
ould you handle this case? Check .f_contiguous and .c_contiguous. If neither is True, use np.ascontiguousarray() to get a C-contiguous array. Or np.asfortranarray() to get a Fortran-contiguous array if that's more convenient. np.asarray() will not ensure either contiguity, just ndarray-nes

Re: [Scikit-learn-general] Nonn-ASCII in source files

2013-08-28 Thread Robert Kern
onsider any program broken that doesn't grok it. Setting aside the editor issue, the actual technical problem is that you are using UTF-8 characters in byte string literals rather than unicode literals for the docstrings. If you used unicode literals (u""" """),

Re: [Scikit-learn-general] Fwd: , Write a book on "Learning Scikit " for Packt Publishing

2013-04-22 Thread Robert Kern
MATLAB(TM) Signal Processing Toolbox(TM) _Learning Toolbox_. Good luck in your search. -- Robert Kern -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The pl

Re: [Scikit-learn-general] Using sklearn in Hadoop

2013-02-04 Thread Robert Kern
t (where the local > gradients are aggregated, a gradient step taken on the master and the weight > vector rebroadcast) - see > http://faculty.utpa.edu/reillycf/courses/CSCI6175-F11/papers/nips2010mannetal.pdf > for som

Re: [Scikit-learn-general] Using sklearn in Hadoop

2013-01-27 Thread Robert Kern
ested eye on the Spark project for a while now. Can you share any sklearn+Spark examples that you've worked up so far? -- Robert Kern -- Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS, MVC, Windows

Re: [Scikit-learn-general] Data Set on Tutorial: Machine Learning for Astronomy with Scikit-learn

2012-11-16 Thread Robert Kern
to store data, thought I never had problems on my personal computers. > Saving as txt is more reliable. The NPY file format only uses pickle for object arrays, which this isn't. Otherwise, it records floats in standard IEEE-754 binary format with enough header information to reconstruct i

Re: [Scikit-learn-general] Interest in advanced linear regression topics?

2012-11-14 Thread Robert Kern
ely accepted as a legal practice, at least in the US, is to split up into two teams: one that reads the original software and writes documentation and test cases and one that takes the documentation and tests to write the new implementation. -- Robert Kern --

Re: [Scikit-learn-general] preprocessing.scaler uses population standard deviation

2012-11-06 Thread Robert Kern
variance itself, in order to get unit variance. Remember that variance has units of [data]**2 not [data]. Whether you treat that square root as a separate parameter with an estimator that has properties worth caring about (like biasedn

Re: [Scikit-learn-general] preprocessing.scaler uses population standard deviation

2012-11-06 Thread Robert Kern
umpy, "ddof", is somewhat cryptic, > though. Sorry about that. All of the less cryptic choices were lies. -- Robert Kern -- LogMeIn Central: Instant, anywhere, Remote PC access and management. Stay in control, up

Re: [Scikit-learn-general] Jython and Scikit-Learn

2012-10-27 Thread Robert Kern
On Sat, Oct 27, 2012 at 10:39 PM, Joseph Turian wrote: > How does jnius compare with jpype? It isn't dead, mostly. More seriously, with active developers and Cython underpinnings, they might accept some PRs to add efficient numpy support. -- Rob

Re: [Scikit-learn-general] Jython and Scikit-Learn

2012-10-26 Thread Robert Kern
You may also want to consider jnius: http://pypi.python.org/pypi/jnius/ -- Robert Kern -- WINDOWS 8 is here. Millions of people. Your app in 30 days. Visit The Windows 8 Center at Sourceforge for all your

Re: [Scikit-learn-general] Scikit Learn Web Site doesn t work

2012-10-23 Thread Robert Kern
On Tue, Oct 23, 2012 at 10:18 PM, didier vila wrote: > All > > the website of Scikit Learn doesn t work. Is it normal ? Is it hosted on Sourceforge? All Sourceforge-hosted project websites are down for me right now. -- Ro

Re: [Scikit-learn-general] GaussianProcess 'nugget'

2012-01-31 Thread Robert Kern
asymptotically closer to the "sill" value (the prior variance over the whole domain, or the uncertainty of a point infinitely far away from point X given the value at point X). The variogram can be estimated from the data, and one can use empirical variograms more or less directly. It's a bit

Re: [Scikit-learn-general] [pystatsmodels] Re: Sprinting at PyCon US 2012 in Santa Clara in March.

2012-01-26 Thread Robert Kern
++ project) to use Jython. This should be fixable by writing a C++->Python embedding/wrapping. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth."

Re: [Scikit-learn-general] scikit test failure on osx

2011-11-27 Thread Robert Kern
ccelerate framework is just an old version ATLAS with the serial numbers filed off (literally; they removed some of the APIs that identify the specific version of ATLAS used). I don't think they've updated it particularly vigorously in recent years, but I could be wrong. -- Robert Kern "I

Re: [Scikit-learn-general] A possible solution to templated types in cython

2011-10-19 Thread Robert Kern
On Wed, Oct 19, 2011 at 22:48, Gael Varoquaux wrote: > On Wed, Oct 19, 2011 at 10:43:52PM +0100, Robert Kern wrote: >> I recommend Tempita, a tiny but well-featured templating engine that >> can be dropped into your codebase: > >>   http://pythonpaste.org/tempita/ >

Re: [Scikit-learn-general] A possible solution to templated types in cython

2011-10-19 Thread Robert Kern
umpy has a similar mechanism, though probably not as pretty. > It can be found in numpy.distutils.conv_template. It's very rough, but it > may suit our needs without adding a dependency. I recommend Tempita, a tiny but well-featured templating engine that can be dropped into your codebase: http://pythonpaste

Re: [Scikit-learn-general] Python.exe keep spawning when njobs inGridSearchCV is not 1

2011-09-20 Thread Robert Kern
as talking about fixing the example to run correctly under Windows. That's what I meant about it being a teaching opportunity; the example should show how to correctly structure a script to use a parallel grid search on all platforms. I would additionally suggest that the test-running script

Re: [Scikit-learn-general] Python.exe keep spawning when njobs in GridSearchCV is not 1

2011-09-20 Thread Robert Kern
On Tue, Sep 20, 2011 at 02:49, Olivier Grisel wrote: > 2011/9/20 Robert Kern : >> The fix is straightforward on sklearn's part: move the code into a >> function and call that function under an "if __name__ == '__main__':" >> test. Or just move eve

Re: [Scikit-learn-general] Python.exe keep spawning when njobs in GridSearchCV is not 1

2011-09-20 Thread Robert Kern
ing at the top level, all of the child processes will execute the same code as if they were the main process and creating an explosion of processes. The fix is straightforward on sklearn's part: move the code into a function and call that function under an "if __name__ == '__main__':&