On Do, 2014-03-06 at 16:30 -0500, josef.p...@gmail.com wrote:
> On Thu, Mar 6, 2014 at 3:49 PM, Ralf Gommers <ralf.gomm...@gmail.com> wrote:
> >
> >
> >
> > On Thu, Mar 6, 2014 at 1:40 PM, Sebastian Berg <sebast...@sipsolutions.net>
> > wrote:
> >>
> >> On Mi, 2014-03-05 at 10:21 -0800, David Goldsmith wrote:
> >> >
> >> >
> >> >
> >> > Date: Wed, 05 Mar 2014 17:45:47 +0100
> >> >         From: Sebastian Berg <sebast...@sipsolutions.net>
> >> >         Subject: [Numpy-discussion] Adding weights to cov and corrcoef
> >> >         To: numpy-discussion@scipy.org
> >> >         Message-ID: <1394037947.21356.20.camel@sebastian-t440>
> >> >         Content-Type: text/plain; charset="UTF-8"
> >> >
> >> >         Hi all,
> >> >
> >> >         in Pull Request https://github.com/numpy/numpy/pull/3864 Neol
> >> >         Dawe
> >> >         suggested adding new parameters to our `cov` and `corrcoef`
> >> >         functions to
> >> >         implement weights, which already exists for `average` (the PR
> >> >         still
> >> >         needs to be adapted).
> >> >
> >> >
> >> > Do you mean adopted?
> >> >
> >>
> >> What I meant was that the suggestion isn't actually implemented in the
> >> PR at this time. So you can't pull it in to try things out.
> >>
> >> >
> >> >         However, we may have missed something obvious, or maybe it is
> >> >         already
> >> >         getting too statistical for NumPy, or the keyword argument
> >> >         might be
> >> >         better `uncertainties` and `frequencies`. So comments and
> >> >         insights are
> >> >         very welcome :).
> >> >
> >> >
> >> > +1 for it being "too baroque" for NumPy--should go in SciPy (if it
> >> > isn't already there): IMHO, NumPy should be kept as "lean and mean" as
> >> > possible, embellishments are what SciPy is for.  (Again, IMO.)
> >> >
> >>
> >> Well, on the other hand, scipy does not actually have a `std` function
> >> of its own, I think. So if it is quite useful I think this may be an
> >> option (I don't think I ever used weights with std, so I can't argue
> >> strongly for inclusion myself). Unless adding new functions to
> >> `scipy.stats` (or just statsmodels) which implement different types of
> >> weights is the longer term plan, then things might bite...
> >
> >
> > AFAIK there's currently no such plan.
> 
> since numpy has taken over all the basic statistics, var, std, cov,
> corrcoef, and scipy.stats dropped those, I don't see any reason to
> resurrect them.
> 
> The only question IMO is which ddof for weighted std, ...
> 

I am right now a bit unsure about whether or not the "weights" would be
"aweights" or different... R seems to not care about the scale of the
weights which seems a bit odd to me for an unbiased estimator? I always
assumed that we can do the statistics behind using the ddof... But even
if we can figure out the right way, what I am doubting a bit is that if
we add weights, their names should be clear enough to not clash with
possibly different kind of (interesting) weights in other functions.


> statsmodels has the basic statistics with frequency weights, but they
> are largely in support of t-test and similar hypothesis tests.
> 
> Josef
> 
> 
> >
> > Ralf
> >
> >
> >
> > _______________________________________________
> > NumPy-Discussion mailing list
> > NumPy-Discussion@scipy.org
> > http://mail.scipy.org/mailman/listinfo/numpy-discussion
> >
> _______________________________________________
> NumPy-Discussion mailing list
> NumPy-Discussion@scipy.org
> http://mail.scipy.org/mailman/listinfo/numpy-discussion
> 


_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@scipy.org
http://mail.scipy.org/mailman/listinfo/numpy-discussion

Reply via email to