[Scikit-learn-general] using IOpro TextAdapter for skl

2014-08-22 Thread László Sándor
Hi, This is not exactly on-topic, but I hope it is useful for more people than just me. I read in data with these two lines (for speed w/ IOPro) but StandardScaler (which first encounters X) complains about not finding floating point values (but tuples, I think?). Also, the size of the ndarray

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
Thanks, John. I use 0.15.0b1, but I don't set alpha, now that I think of it (the default is not 0 though). Did you see the gist on the issue? On Fri, Aug 22, 2014 at 10:49 AM, John Mark Agosta < johnmark.ago...@gmail.com> wrote: > Laszlo - > > What Sklearn version are you using? Elastic net i

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread John Mark Agosta
Laszlo - What Sklearn version are you using? Elastic net in v 0.14.0 was significantly improved in v0.15 What alpha are you using? Sklearn is known to fail for alpha == 0. -jm __ John Mark Agosta 650 465-4707 johnmark.ago...@gmail.com *"Unpredictable consequences are the most expected t

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
See https://github.com/scikit-learn/scikit-learn/issues/3584 FWIW: Now even the version that first seemed to complete (but I thought I lost output for an unrelated reason) crashes again with the same error. The only thing I changed was to call the script with stdbuf -oL python ….py > ….log O

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
Sorry for the update: but now one version of the job (same size, but different data) still failed with the OS failing to provide memory, even with n_jobs=8. The maximum RAM the job took was 119705224K, while it was allocated 240 GB. I am not sure how to make this reproducible for a bug report

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
Oh, Mailbox failed with the attachment, sorry, here it is. On Fri, Aug 22, 2014 at 8:43 AM, László Sándor wrote: > Sorry, Olivier, I am not sure I follow. In any case, I am happy to update > skl, I was only using the latest 64-bit Linux from the conda distro (though > using MKL wanted to downgr

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
Sorry, Olivier, I am not sure I follow. In any case, I am happy to update skl, I was only using the latest 64-bit Linux from the conda distro (though using MKL wanted to downgrade me to 0.14, I am pretty sure I precluded that). That said, maybe it’s good I did not file a proper bug report: Th

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread Olivier Grisel
Thread based enet CV has been merged after the 0.15 release. -- Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/___ Scikit-learn-general mailing list S

Re: [Scikit-learn-general] classification algorithms that return probabilities?

2014-08-22 Thread Adamantios Corais
Ok, so let me rephrase the problem a little bit. Given a machine learning model built upon scikit-learn, how can I classify new instances but then choose only those with the highest confidence? How do we define "confidence" and how to generate it (if it is not generated automatically by scikit-lear

Re: [Scikit-learn-general] Pipeline - Convert to dense

2014-08-22 Thread Sebastian Okser
I apologize, but I don't follow how that will help much. The features are obtained from the original dataset that is fed to scikit's vectorizers. However, to have the vectorizers work properly they need to be stripped out. I simply want to add them back in prior to any cv splitting. However, as

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread László Sándor
Thanks, I did not know if it was a proper bug, I did not dig very deep. This happened on 0.15.0b2. So what would be meaningful to try, and then report? On Fri, Aug 22, 2014 at 5:41 AM, Alexandre Gramfort wrote: > hi Laszlo, > thanks for using the bug tracker and not the ML to report issues. >

Re: [Scikit-learn-general] Optimal Subset Selection Code Contribution

2014-08-22 Thread Giuseppe Marco Randazzo
Hi Mathieu, i did not understand why u have decided to do not include these sampling techniques. These sampling technique are used to select small representative population of objects from a sample. They work on a multidimensional space of norm , so a simple euclidean space. What i think abo

Re: [Scikit-learn-general] ElasticnetCV crash on 64-bit Linux

2014-08-22 Thread Alexandre Gramfort
hi Laszlo, thanks for using the bug tracker and not the ML to report issues. Also avoid pasting raw text/code in emails but use services like gist.github.com what version of sklearn are you using? ElasticNetCV uses threads in v0.15 so the memory problem should not happen anymore. Alex On Fri,

Re: [Scikit-learn-general] PR about topic models

2014-08-22 Thread Gael Varoquaux
On Tue, Aug 19, 2014 at 11:30:51AM -0400, chyi-kwei yau wrote: > So, are we still considering include this model in the future? I still think that it would be great! > If yes, I can create an issue and start working on this. Thanks a lot, Gaël --

[Scikit-learn-general] sklearn import error. No module named winreg on ubuntu 14.04 up vote

2014-08-22 Thread Luca Puggini
Hi, I have installed sklearn on a fresh ubuntu 14.04 installation. I encounter some import errors. Here there is the description of my problem http://stackoverflow.com/questions/25443503/sklearn-import-error-no-module-named-winreg-on-ubuntu-14-04 How can I solve ? Thanks! ---

Re: [Scikit-learn-general] [GSoC] Wrap up post

2014-08-22 Thread Bertrand Thirion
Impressive. These are long-standing issues. Thanks for the patches. Bertrand - Mail original - > De: "Manoj Kumar" > À: scikit-learn-general@lists.sourceforge.net > Envoyé: Vendredi 22 Août 2014 01:35:13 > Objet: [Scikit-learn-general] [GSoC] Wrap up post > Hi, > A quick wrap up pos

Re: [Scikit-learn-general] [GSoC] Wrap up post

2014-08-22 Thread Manoj Kumar
Thanks Robert :) On Fri, Aug 22, 2014 at 4:48 AM, Robert Layton wrote: > Really interesting work, well done in GSoC! > > > > > -- Godspeed, Manoj Kumar, GSoC 2014, Scikit-learn Mech Undergrad http://manojbits.wordpress.com -