Re: [Scikit-learn-general] onehotencoder and data load

2014-12-16 Thread Daniel Sullivan
Also, I meant read_csv: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.io.parsers.read_csv.html not load_csv On Tue, Dec 16, 2014 at 9:55 AM, Daniel Sullivan wrote: > > Hi Roberto, > > One thing you might try to get an integer instead of one-hot encoded > values is a LabelEncoder:

Re: [Scikit-learn-general] onehotencoder and data load

2014-12-16 Thread Daniel Sullivan
Hi Roberto, One thing you might try to get an integer instead of one-hot encoded values is a LabelEncoder: http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.LabelEncoder.html It's really useful if you process the complete dataset in memory. If you can't hold your complete da

[Scikit-learn-general] onehotencoder and data load

2014-12-15 Thread Pagliari, Roberto
When using OneHotEncoder, is it possible to have one integer per feature as the output, as opposed to binary representation? Also, when using OneHotEncoder, what would be the method to load data (.csv) with mixed type (number and categorical)? Thanks, --