Re: [scikit-learn] scikit-learn twitter account

2019-11-16 Thread Tom Augspurger
If you want multiple people to have access, you can use tweetdeck.twitter.com to delegate access without sharing credentials. Just need their regular twitter handle. > On Nov 16, 2019, at 01:50, Alexandre Gramfort > wrote: > >  > hi, > > it's me. I own the scikit-learn twitter account.

Re: [scikit-learn] Long term roadmap and moonshot goals

2019-07-23 Thread Tom Augspurger
Pandas will be running one soon too: https://github.com/pandas-dev/pandas/issues/27477 It may be worth coordinating on questions so that we can compare communities (or combining surveys to reduce "survey-fatigue" somehow? Haven't thought through this). Tom On Tue, Jul 23, 2019 at 6:54 AM Adrin

Re: [scikit-learn] Continues monitoring of benchmark performances

2019-07-22 Thread Tom Augspurger
Thanks Adrin, A month or so ago I started running scikit-learn benchmarks, but I had to disable them since they were taking too long (longer than a day). I haven't had time to investigate why, but I assume it was an issue with how I set them up. Just FYI, I'm planning to include "maintain and

Re: [scikit-learn] Can I evaluate clustering efficiency incrementally?

2019-05-14 Thread Tom Augspurger
If anyone is interested in implementing these, dask-ml would welcome additional metrics that work well with Dask arrays: https://github.com/dask/dask-ml/issues/213. On Tue, May 14, 2019 at 2:09 AM Uri Goren wrote: > Sounds like you need to use spark, > this project looks promising: >

Re: [scikit-learn] Announcing sklearn-xarray

2017-12-04 Thread Tom Augspurger
I haven't looked at the implementation of `sklearn_xarray.dataarray.wrap` yet, but a simple test on `dask_ml.preprocessing.StandardScaler` failed with the (probably expected) `TypeError: 'int' object is not iterable` when dask-ml attempts an `X.mean(0)`. I'd be interested to hear what changes

Re: [scikit-learn] merging the predicted labels with original dataframe

2017-07-20 Thread Tom Augspurger
Something like your_df['prediction'] = pd.Series(clf.predict(X_test), index=X_test.index) should handle all the alignment. On Thu, Jul 20, 2017 at 11:04 AM, Ruchika Nayyar wrote: > The original dataset contains both trainng/testing, I have predictions > only on