More RM statistics

Thomas Beale Mon, 24 Oct 2011 15:43:04 +0100

I have refined the generated statistics in AWB, and the result is shown 
below for CKM archetypes (snapshot June 2011). Notes:

* the stats are over the 197 archetypes validated by the ADL 1.5
compiler (note that this is more strict than any of the ADL 1.4
tools). Another 80 archetypes did not pass, most for trivial reasons
of 1.5 validity like repeated occurrences and cardinality constraints.
* the stats are obviously skewed toward the actual archetypes that
have been done and are currently valid, which is more heavily
OBSERVATION and EVALUATION related types.
* the first group of stats is aggregated ontology codes, e.g. 3794
at-code definitions over those 197 archetypes (I will add in average
computation, but it is easy enough to see that its about 19 at-code
average per archetype).
o this is a guide to costs and scale of translation and binding work
* in the RM metrics stats group, these are aggregated counts of
archetyped nodes of any kind, i.e. there are 4327 nodes of any kind
of C_OBJECT. This is an average of around 21.5 nodes per archetype
o this is potentially a guide to complexity and maintainability
* in the RM breakdown stats group, each row indicates how many times
that RM class was mentioned (i.e. constrained) in the 197 valid
archetypes. E.g. the ADDRESS class was constrained 7 times.
o for each class, the number of occurrences of constraints on each
attribute is available on the next level breakout (see further down)
+ what this tells us (usually) is that a relatively small
subset of the total number of attributes defined for a given
type (e.g. DV_QUANTITY) are archetyped
+ within the DATA_VALUE types, it also tells us what DV_XXX
types are actually needed, according to clinical modellers,
e.g. DV_DATE, DV_DURATION and DV_DATE_TME are all used.
o not surprisingly CLUSTERs, ELEMENTS, ITEM_TREE and a few other
classes figure prominently
o re: the recent debates about ITEM_STRUCTURE simplification, note
that ITEM_SINGLE and ITEM_LIST types turn up in the stats (maybe
these classes are candidates for turning into ITEM_TREEs, or
maybe they are evidence that we need to retain ITEM_LIST?)

There is more work to go here obviously, and I won't try to draw any
hard conclusions right from what we see here. In terms of tooling the
next steps are:

* add a few further stats like average, max, min
* some cleanup in visual presentation
* generate an XML report form of this data with a .css for use with
normal browsers
* publish a new beta release of the Workbench, including all this
functionality

Any suggestions, feedback is of course welcome.

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: iidhecfe.png
Type: image/png
Size: 25395 bytes
Desc: not available
URL:
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: jgafeceb.png
Type: image/png
Size: 12176 bytes
Desc: not available
URL:
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment-0001.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: gbccjicj.png
Type: image/png
Size: 5658 bytes
Desc: not available
URL:
<http://lists.openehr.org/mailman/private/openehr-technical_lists.openehr.org/attachments/20111024/2516d7e5/attachment-0002.png>

More RM statistics

Reply via email to