On Fri, 2 Feb 2018 12:44:43 -0500, David Golden <[email protected]> wrote: > It's possible that an *alternate* simplest thing might be more meaningful: > count the number of distinct *authors* depended on by any distribution > (including, for the sake of example, the same author, but only once). > > In the Foo case: > > - Foo has 3 authors depending on it > - Foo-Bar has 3 authors depending on it > - Foo-Bar-Noggin and Foo-Bar-Baz have 0 authors depending on it > - Foo-Bar-A has 1 author depending on it > > In the Neil's Thing case: > > - Thing has 2 > - Plant has 1 > - Fruit and Banana each have 1 > - Silver-Banana has 0 > > In Tux's Thing case, all the counts just increase by one and Distasteful > has 0. > > Consider this case: > Zot (Larry) -> Pow (Moe) -> Splat (Curly) -> Whiff (Moe) -> Oof (Larry) > > - Zot has 3 > - Pow has 3 > - Splat has 2 > - Whif has 1 > - Oof has 0 > > The interesting thing about this metric to me is that it focuses on this > question: "If a module breaks, how many *people* are affected" which sounds > a lot more like what Jim's asking.
No, it tells you how many *authors* are affected (or author groups).
Breaking something up-river of say DBI will affect just 3 authors (the
(co)maints), whereas it affect millions of people (the users).
If some brave author maintains two or more up-river modules, it is
still just one author, but uncountable users. (don't count core modules
here, that would make it too hard).
Say we have
Broum + Brumble - Droki - Blimco - Turf
ALEX | BEN JOKI FLON DIY
|
+ Fruig - DBI - DBD::XY
BEN HIW JOCKX
IMHO BEN should be counted twice for Broum, not once
my € 0.02
> Counting an author as 1 for any downstream by the same author is arbitrary
> -- I think it simplifies the analysis and gives more or less the same
> answer, but it could be done the other way, too, if people preferred.
>
> David
>
> On Fri, Feb 2, 2018 at 9:48 AM, James E Keenan <[email protected]> wrote:
>
> > Overall Question: How can we implement different ways of constructing the
> > CPAN river?
> >
> > Background:
> >
> > Since about this time last year I've had occasion to use the concept of
> > CPAN-river to derive lists of distributions to be tested against whatever
> > Perl 5 blead is of the moment. In particular, for the last three months
> > I've been creating assessments of the impact of monthly Perl 5 development
> > releases on the "top 1000" of the CPAN river. (See, e.g.,
> > http://thenceforward.net/perl/misc/cpan-river-1000-perl-5.27-master.psv.gz
> > )
> >
> > To calculate the CPAN river, I've been using the programs developed by
> > David Golden found here:
> >
> > https://github.com/dagolden/zzz-index-cpan-meta
> >
> > ... with one modification: a local branch for the second of the three
> > programs cited there. I use a local branch because I'm using Linux and
> > cannot install Ramdisk.
> >
> > Problem:
> >
> > As I've stared at this data over the past year I've become aware that the
> > order in which distros appear in the river is not necessarily the most
> > useful for assessing the real-world impact of changes in blead. Put less
> > charitably, the CPAN river can be "gamed." It is possible for a person to
> > release a large number of distributions which have dependencies on other
> > distributions by the same author. That can boost some of those
> > distributions high up into the CPAN river -- into, say, the "top 1000" that
> > I use in my monthly program.
> >
> > But if that author's distributions are not depended upon by *other*
> > authors' distributions then they are arguably less important than those
> > such as Module-Build and DateTime which are depended upon by vast numbers
> > of distros written by people other than those distros' maintainers.
> >
> > Since "testing against blead" programs take hours to run, I would like to
> > have that time spent focusing on what I consider to be more relevant
> > distros.
> >
> > For the 5.29.* development cycle starting in May of this year, I would
> > like to be able to use a ranking of CPAN distros which goes beyond asking:
> >
> > * "How many other distributions depend on this one?"
> >
> > ... to asking:
> >
> > * "How many distributions by other authors/maintainers depend on this one?"
> >
> > Would that be feasible? Has anyone attempted this already?
> >
> > Thank you very much.
> > Jim Keenan
> >
--
H.Merijn Brand http://tux.nl Perl Monger http://amsterdam.pm.org/
using perl5.00307 .. 5.27 porting perl5 on HP-UX, AIX, and openSUSE
http://mirrors.develooper.com/hpux/ http://www.test-smoke.org/
http://qa.perl.org http://www.goldmark.org/jeff/stupid-disclaimers/
pgpXQK7P484Aj.pgp
Description: OpenPGP digital signature

