Re: [scikit-learn] Imputers and DataFrame objects

2020-08-18 Thread Kevin Markham
nd it's totally fine if that is your preferred option! Hope that helps, Kevin On Tue, Aug 18, 2020 at 7:56 AM Ram Rachum wrote: > > > On Mon, Aug 17, 2020 at 8:55 PM Kevin Markham wrote: > >> Hi Ram, >> >> These are great questions! >> > > Thank y

Re: [scikit-learn] Imputers and DataFrame objects

2020-08-17 Thread Kevin Markham
<https://www.dropbox.com/preview/Desktop/Shani/floof.py>, but it > definitely felt like I was doing something wrong, like this is supposed to > be simpler. > > Also, when trying to use BinaryEncoder for "one" "two" "three", it raised > an exceptio

[scikit-learn] Best way to include SimpleImputer before CountVectorizer in a Pipeline?

2020-07-01 Thread Kevin Markham
strategy='constant') one_dim = FunctionTransformer(np.reshape, kw_args={'newshape':-1}) vect = CountVectorizer() pipe = make_pipeline(imp, one_dim, vect) pipe.fit_transform(df[['text']]).toarray() == END OF CODE EXAMPLE == -- Kevin Markham F

[scikit-learn] Micro average in classification report

2018-10-05 Thread Kevin Markham
ply appreciate your work! Rather, I'm planning to create a video series explaining some of the new features in 0.20, and I want to make sure that I'm accurately explaining these new features. Thanks very much! Kevin -- Kevin Markham Founder, Data School https://www.dataschool.io https