Re: post-YAPC::Europe CPANTS news

Chris Dolan Tue, 12 Sep 2006 09:13:45 -0700

On Sep 12, 2006, at 9:24 AM, Salve J Nilsen wrote:

Any metric that catches bad things, particularly bad technicalthings, is going to be just fine.Metrics that try to push "good" behavior are fraught with trouble,because they start pushing people in odd directions.
Do you have an example on this? (Any pointer would be wonderful.)

I have two: pod.t and pod_coverage.t. These are pointless to run onan end-user's machine. At best they are redundant to immutable testsalready run on the author's machine and just waste processor cycles.At worst they fail and cause false negative test reports. Theprevalence of those two tests in CPAN modules is almost entirely dueto the influence of CPANTS.

Despite the criticisms above, the CPANTS POD tests have ultimatelysucceeded: they have convinced authors to do a better job ofdocumenting all methods, or marking private methods as such. I thinknobody can argue that the POD tests, in particular, have had a netpositive effect on the quality of CPAN.

===

Now begins a huge digression on encouraging good behavior vs.discouraging bad behavior leading to a recommendation for CPANTS.

One flaw in the language of Adam's assertion is that he doesn'tproperly distinguish the goals and metrics of CPANTS. Discouraging aspecific bad behavior is just a way encouraging other unspecifiedbehavior, which could be good or bad. IF FEASIBLE, it's alwaysbetter to encourage good behavior. The danger is not metrics thatencourage good behavior, but instead metrics that encourage aspecific good behavior when there are a multitude of equally-validgood behaviors. In that case, discouraging the bad behaviors is thebest you can do. I believe that's what Adam was trying to say.

I'm going to continue with the specific example of POD Kwalitee. TheCPANTS goal is (obviously) to encourage higher qualitydocumentation. However, that's a hard thing for a computer tomeasure. So, instead we try to discourage specific bad behaviors:POD syntax errors and undocumented subroutines.

Let me run through an exercise. In the first step, consider how onewould arrive at the need for CPANTS POD tests:


Goal: encourage high-quality CPAN packages
  Assertion: high-quality packages have high-quality documentation
    Assertion: high-quality documentation is parseable by doc tools
      Subgoal: discourage invalid POD
        Measure: Is the POD valid for each module in the package?

Assertion: high-quality documentation describes every publicsubroutine

      Subgoal: discourage undocumented subs

Measure: Does each module in the package have documentationfor every public sub?

The next step in the exercise becomes how to implement thosemeasures. In the current CPANTS simple proxies are used for thosemeasures. Namely, we assume that if there is a t/*pod.t file presentthen the pod is valid, and if there is a t/*pod_coverage.t presentthen all subroutines are documented.

Note that my subgoals are stated as discouraging bad behavior. It'salways easier to test for failures than successes (case in point:governments usually create laws, not commandments). The CPANTS PODtests, however, check for good behavior ("Thou shalt add pod.t to thypackage") instead of checking for bad behavior ("Thou shalt notinclude invalid POD in thy module").

Wouldn't it be better to just measure the POD validity directlyinstead of using a proxy for that measurement? As an outsider, I'llguess that CPANTS resorts to the proxies for these reasons, in orderof importance:

 1) reliability
 2) ease of implementation
 3) speed of evaluation

Certainly, CPANTS wants to avoid false negatives at all costs. It'simpact on the community is purely voluntary, so it wants to avoidantagonizing authors. If CPANTS mistakenly says that your module hasincomplete POD coverage when you *know* that you have documentedevery method, you're going to be annoyed. Some authors may declineto participate in CPANTS if they get annoyed enough. So, falsenegatives are perilous to the success of the entire project.

I believe the main reason that CPANTS measures t/*pod*.t existenceinstead of directly running Test::Pod/Test::Pod::Coverage is that thelatter is harder for the author to judge consistently before he/sheuploads to CPAN. But, with the improved availability of offlineCPANTS analysis (via Module::CPANTS::Analyse), it should be feasiblefor authors to get rid of more complex false negatives beforeuploading to CPAN.

So, as a technological expedient, CPANTS is encouraging a sub-optimalgood behavior (adding t/*pod*.t to CPAN releases) in the process oftrying to discourage a bad behavior. To fix this, we need to removethe need for the expedient. That means letting CPANTS perform morecomplicate analyses and letting authors test those analyses offlineexactly as they would be performed online on cpants.perl.org.

Thus, I finally get to an action item: CPANTS should encourageauthors to run Module::CPANTS::Analyse offline before uploading toCPAN. I assert that if we can convince authors to perform morethorough tests of their packages at author-time, then the quality ofCPAN will improve. And the more closely the metrics match our realquality goals, the bigger the quality delta we will achieve.


Chris

--
Chris Dolan, Software Developer, Clotho Advanced Media Inc.
608-294-7900, fax 294-7025, 1435 E Main St, Madison WI 53703
vCard: http://www.chrisdolan.net/ChrisDolan.vcf

Clotho Advanced Media, Inc. - Creators of MediaLandscape Software(http://www.media-landscape.com/) and partners in the revolutionaryCroquet project (http://www.opencroquet.org/)

Re: post-YAPC::Europe CPANTS news

Reply via email to