Hello Vergand,

One of my jobs is running a parameter vocabulary that currently has over 27,000 
entries.  Much of its bulk is due to the assignment of multiple parameter names 
for each step in a numeric sequence - such as radiation wavelengths or sediment 
grain-size expressed as percentiles.

Consider a scenario where you start with a small group of standard percentiles 
- say 5, 25, 50, 75, 95.  You set up a parameter name for each of these in the 
first instance, which is easy. Then along comes another user who wants to 
describe data with percentiles at a resolution of 1 per cent.  So another 95 
parameter names need to be set up.  Then along comes another user who wants a 
resolution of 0.1 per cent.  I start drowning in names and nobody can find 
anything.

However, had I followed Jonathan's second solution all I would need to do as a 
vocabulary manager is set up one concept to describe the percentile axis, which 
covers every user from those who use a handful of percentiles to those whose 
percentile resolution requirements are beyond the bounds of my imagination.

I know Jonathan's first option was based on propogation of cell methods and not 
standard names.  However, these still need managing and if they become 
excessively abundant they also become difficult to navigate.

Cheers, Roy.

________________________________________
From: Vegard Bønes [vegard.bo...@met.no]
Sent: 15 November 2011 13:17
To: Lowry, Roy K.
Cc: cf-metadata@cgd.ucar.edu; Jonathan Gregory
Subject: Re: [CF-metadata] standards for probabilities

Dear Roy,

Can you be a bit more concrete about why you prefer the second alternative?


-- Vegard


----- Original Message -----
Fra: "Roy K. Lowry" <r...@bodc.ac.uk>
Til: "Jonathan Gregory" <j.m.greg...@reading.ac.uk>, "Vegard B??nes" 
<vegard.bo...@met.no>
Kopi: cf-metadata@cgd.ucar.edu
Sendt: 15. november 2011 11:17:01
Emne: RE: [CF-metadata] standards for probabilities

Dear Jonathan,

I prefer your second alternative.  It's not what I do, but it's what I wish I 
did!!

Cheers, Roy.

________________________________________
From: cf-metadata-boun...@cgd.ucar.edu [cf-metadata-boun...@cgd.ucar.edu] On 
Behalf Of Jonathan Gregory [j.m.greg...@reading.ac.uk]
Sent: 15 November 2011 10:11
To: Vegard B??nes
Cc: cf-metadata@cgd.ucar.edu
Subject: Re: [CF-metadata] standards for probabilities

Dear Vegard

> I want to express such things as "25th percentile precipitation amount" 
> (based on ensemble data), and probability that air temperature will be within 
> 2.5 degrees of the forecast. How should I do this?

You are right, this case has not yet been dealt with, although the guidelines
for construction of standard names foresee that needs like this might arise!

If the quantity is a precipitation_amount, it's fine to use that standard
name. The question is how to record that is the 25th percentile. Two possible
ways to do this would be:

* To extend the possible syntax of cell_methods so that it can describe
percentiles. It is already possible to indicate a median in cell_methods, and
that is a particular percentile. The advantage of this way of doing it would
be that you would record whether the distribution of precipitation amounts
being considered was for time-variation, or spatial variation, or some other
kind of variation. Obviously you could have a probability distribution with
percentiles for many different independent variables.

* To use a size-1 or scalar coordinate variable to record the probability,
with a new standard_name, perhaps
cumulative_distribution_function_of_precipitation_amount.
The value of this coordinate would be 0.25 for the 25th percentile. The
advantage of this method would be that you could have several different
percentiles in the same variable, by having a multivalued probability coord.
If you wanted to be specific about what the independent variable was, that
would have to be included in the standard name as well e.g.
cumulative_distribution_function_of_precipitation_amount_over_time.

What do you think?

Cheers

Jonathan
_______________________________________________
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata--
This message (and any attachments) is for the recipient only. NERC
is subject to the Freedom of Information Act 2000 and the contents
of this email and any reply you make may be disclosed by NERC unless
it is exempt from release under the Act. Any material supplied to
NERC may be stored in an electronic records management system.
_______________________________________________
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata

Reply via email to