I have refined the generated statistics in AWB, and the result is shown 
below for CKM archetypes (snapshot June 2011). Notes:

  * the stats are over the 197 archetypes validated by the ADL 1.5
    compiler (note that this is more strict than any of the ADL 1.4
    tools). Another 80 archetypes did not pass, most for trivial reasons
    of 1.5 validity like repeated occurrences and cardinality constraints.
  * the stats are obviously skewed toward the actual archetypes that
    have been done and are currently valid, which is more heavily
    OBSERVATION and EVALUATION related types.
  * the first group of stats is aggregated ontology codes, e.g. 3794
    at-code definitions over those 197 archetypes (I will add in average
    computation, but it is easy enough to see that its about 19 at-code
    average per archetype).
      o this is a guide to costs and scale of translation and binding work
  * in the RM metrics stats group, these are aggregated counts of
    archetyped nodes of any kind, i.e. there are 4327 nodes of any kind
    of C_OBJECT. This is an average of around 21.5 nodes per archetype
      o this is potentially a guide to complexity and maintainability
  * in the RM breakdown stats group, each row indicates how many times
    that RM class was mentioned (i.e. constrained) in the 197 valid
    archetypes. E.g. the ADDRESS class was constrained 7 times.
      o for each class, the number of occurrences of constraints on each
        attribute is available on the next level breakout (see further down)
          + what this tells us (usually) is that a relatively small
            subset of the total number of attributes defined for a given
            type (e.g. DV_QUANTITY) are archetyped
          + within the DATA_VALUE types, it also tells us what DV_XXX
            types are actually needed, according to clinical modellers,
            e.g. DV_DATE, DV_DURATION and DV_DATE_TME are all used.
      o not surprisingly CLUSTERs, ELEMENTS, ITEM_TREE and a few other
        classes figure prominently
      o re: the recent debates about ITEM_STRUCTURE simplification, note
        that ITEM_SINGLE and ITEM_LIST types turn up in the stats (maybe
        these classes are candidates for turning into ITEM_TREEs, or
        maybe they are evidence that we need to retain ITEM_LIST?)

There is more work to go here obviously, and I won't try to draw any 
hard conclusions right from what we see here. In terms of tooling the 
next steps are:

  * add a few further stats like average, max, min
  * some cleanup in visual presentation
  * generate an XML report form of this data with a .css for use with
    normal browsers
  * publish a new beta release of the Workbench, including all this
    functionality

Any suggestions, feedback is of course welcome.






-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: iidhecfe.png
Type: image/png
Size: 25395 bytes
Desc: not available
URL: 
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: jgafeceb.png
Type: image/png
Size: 12176 bytes
Desc: not available
URL: 
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment-0001.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: gbccjicj.png
Type: image/png
Size: 5658 bytes
Desc: not available
URL: 
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment-0002.png>

Reply via email to