Dear Dan Sorry to be a nuisance, but I still do not see the need for at_or_above_threshold If the data is multiples of some unit (like yours is), a value might exactly equal the threshold. If it isn't discretised (like model data usually isn't, except by the precision of floating-point numbers), the value is very unlikely to equal the threshold exactly. Would it not be OK to understand the existing phrase "above_threshold" as meaning "at_or_above_threshold"? It seems redundant to have both in the standard_name table, because I don't think anyone would use the distinction between the two to label two quantities as being different.
If it's confusing to interpret "above_threshold" as "at_or_above_threshold" then let's rename all the affected standard_names to say "at_or_above|below". But that is cumbersome. Though we try to avoid abbreviations in general in the standard name table, maybe we could use _geq_ and _leq_ in this case. However, I don't think it would be bad just to tweak the definition of "above". Cheers Jonathan ----- Forwarded message from "Hollis, Dan" <dan.hol...@metoffice.gov.uk> ----- > From: "Hollis, Dan" <dan.hol...@metoffice.gov.uk> > To: CF Metadata List <cf-metadata@cgd.ucar.edu> > CC: 'John Graybeal' <john.grayb...@marinexplore.com>, "Gregory, Jonathan" > <j.m.greg...@reading.ac.uk> > Subject: RE: [CF-metadata] Days of rain > Date: Thu, 18 Sep 2014 14:19:06 +0000 > > Hi all, > > Clearly there are pros and cons of both options. However after some > discussion with colleagues here we decided that we'd go with the option of > requesting a new standard name. I'd therefore like to request: > > number_of_days_with_lwe_thickness_of_precipitation_amount_at_or_above_threshold<javascript:void(0)> > > with the following definition: > > "Amount" means mass per unit area. The construction lwe_thickness_of_X_amount > or _content means the vertical extent of a layer of liquid water having the > same mass per unit area. "lwe" means liquid water equivalent. A variable > whose standard name has the form > number_of_days_with_X_at_or_below|above_threshold is a count of the number of > days on which the condition X_at_or_below|above_threshold is satisfied. It > must have a coordinate variable or scalar coordinate variable with the a > standard name of X to supply the threshold(s). It must have a climatological > time variable, and a cell_methods entry for within days which describes the > processing of quantity X before the threshold is applied. A number_of_days is > an extensive quantity in time, and the cell_methods entry for over days > should be "sum". > > > I hope that's acceptable but let me know if anyone spots any problems with > this. > > Thanks, > > Dan > > > ________________________________ > From: CF-metadata [mailto:cf-metadata-boun...@cgd.ucar.edu] On Behalf Of John > Graybeal > Sent: 04 September 2014 17:21 > To: Gregory, Jonathan > Cc: CF Metadata List > Subject: Re: [CF-metadata] Days of rain > > I don't really agree with your mathematical argument (and if that argument is > valid, then the boolean attribute _doesn't_ "in effect carry part of the > definition of the standard name"). > > But I'm fine with that approach, because users could always choose to add the > boolean attribute on their own, if it makes sense to them. It doesn't have to > be a part of the standard. But we would still need to add the explanatory > text for what the name really means. > > Or, we can add the new standard names. I guess Dan's call at this point as to > whether he wants to request that? > > John > > On Sep 4, 2014, at 06:03, Jonathan Gregory > <j.m.greg...@reading.ac.uk<mailto:j.m.greg...@reading.ac.uk>> wrote: > > Dear Dan, John et al. > > I'm not comfortable with a boolean attribute that in effect carries part > of the definition of the standard_name. This isn't consistent with the usual > treatment of standard_names, which completely define the quantity except for > those parts which are dealt with by coordinates or statistical reduction. (You > could maybe argue that a boolean attribute was like a scalar coord variable > supplying a reference value, as is required by some standard names, but that > feels like stretching a point to me.) > > If we really need to distinguish .gt. from .ge. I think we should have > distinct > standard names for them. But I still don't think we really need to distinguish > them. If the data were quantised, .gt. can include .eq. if the threshold is a > multiple of the quantum; if the data were continuous, it strictly means .gt. > Dan's data are anyway a mixture of quantised (for which a threshold of 0.2 mm > really means 0.2 mm) and rounded (for which 0.2 mm means 0.15 mm) so it's not > possible to be strict about how the threshold is applied. > > Best wishes > > Jonathan > > > ----- Forwarded message from "Hollis, Dan" > <dan.hol...@metoffice.gov.uk<mailto:dan.hol...@metoffice.gov.uk>> ----- > > From: "Hollis, Dan" > <dan.hol...@metoffice.gov.uk<mailto:dan.hol...@metoffice.gov.uk>> > To: 'John Graybeal' > <john.grayb...@marinexplore.com<mailto:john.grayb...@marinexplore.com>>, > Alison Pamment > <alison.pamm...@stfc.ac.uk<mailto:alison.pamm...@stfc.ac.uk>>, CF Metadata > List > <cf-metadata@cgd.ucar.edu<mailto:cf-metadata@cgd.ucar.edu>>, "Gregory, > Jonathan" > <j.m.greg...@reading.ac.uk<mailto:j.m.greg...@reading.ac.uk>> > Subject: RE: [CF-metadata] Days of rain > Date: Thu, 4 Sep 2014 10:12:32 +0000 > > On balance the boolean attribute looks like the better option to me, partly > for the reasons of discoverability that John highlights below, but also > because it can be applied to all similar variables. Although there may only > be nine 'threshold' names so far, and I would only need one more to be added, > it nevertheless still seems better to choose a solution that works for all > variables, both now and in the future. The boolean attribute seems to fit the > bill. > > Regards, > > Dan > > -----Original Message----- > From: John Graybeal [mailto:john.grayb...@marinexplore.com] > Sent: 03 September 2014 17:28 > To: Alison Pamment > Cc: Hollis, Dan; Gregory, Jonathan; CF Metadata List > Subject: Re: [CF-metadata] Days of rain > > While I agree it is not a big problem to use at_or_above_threshold in this > and whatever other standard names eventually are needed, discoverability > would be better with the boolean attribute ("comparison_includes_equality"?). > As a practical matter, people looking for, e.g., > number_of_days_with_lwe_thickness_of_precipitation_amount_above_threshold > data will want to discover data that is at_or_above_threshold too, but may > not think (or want) to search for the additional term using > at_or_above_threshold, even if they know of that possibility. > > That said, it's a minor point, I'm OK with either approach. I definitely > think names should not rely on a particular observation technology or > approach (e.g., quantization, rounding, measurement accuracy or technique). > > Will start a new thread re archives. > > john > > > > > On Sep 3, 2014, at 06:52, > alison.pamm...@stfc.ac.uk<mailto:alison.pamm...@stfc.ac.uk> wrote: > > Dear Dan, > > I agree that setting a threshold depending on how the data were collected > does not seem very satisfactory and it wouldn't allow you to combine readings > from a variety of instruments into a single data product. The definition of > 'greater|less than or equal to' is clearly not the same as 'greater|less > than' so I would argue that they are different quantities and should have > different standard names. Currently we have only nine 'threshold' names in > the table and personally I don't think it's a big problem to add one more of > the form > number_of_days_with_lwe_thickness_of_precipitation_amount_at_or_above_threshold. > The definition would of course need to make clear the difference from the > existing name and in fact the definitions should cross-reference one another > to make users aware of both. Do others agree? > > Regarding searching the mailing list archives, if I want to find a very > specific phrase within the email text I download the plain text file for the > appropriate year (available from the main archive page > http://mailman.cgd.ucar.edu/pipermail/cf-metadata/) and use my browser's > 'Find' function. It takes a few minutes to download but it can be a useful > way of pinpointing the thing you're looking for. > > Best wishes, > Alison > > ------ > Alison Pamment Tel: +44 1235 778065 > NCAS/British Atmospheric Data Centre Email: > alison.pamm...@stfc.ac.uk<mailto:alison.pamm...@stfc.ac.uk> > STFC Rutherford Appleton Laboratory > R25, 2.22 > Harwell Oxford, Didcot, OX11 0QX, U.K. > > > > -----Original Message----- > From: Hollis, Dan [mailto:dan.hol...@metoffice.gov.uk] > Sent: 03 September 2014 12:55 > To: Gregory, Jonathan; > cf-metadata@cgd.ucar.edu<mailto:cf-metadata@cgd.ucar.edu> > Subject: Re: [CF-metadata] Days of rain > > Hi Jonathan, > > For manually-read rain gauges the advice to the observer is simply to record > the measurement to one decimal place. For the thresholds of interest this > seems to me to be equivalent to saying the values have been rounded. > Therefore 0.2 mm does mean 0.15-0.25 mm, 1.0 mm means 0.95-1.05 mm, > and 10 mm means 9.95-10.05 mm. > > In contrast automated sites use a tipping bucket gauge (in the UK at least) > which constrains the observations to be multiples of the bucket size. I > believe that for all the data we use this is a nominal 0.2 mm i.e. > precipitation totals can be 0.0 mm, 0.2 mm, 0.4 mm etc, and values such as > 0.1 mm, 0.3 mm, 0.5 mm cannot be reported. Given that all we know is that > the bucket has tipped (i.e. has become full and caused the mechanism to tip > and empty the bucket) this implies that an observation of 0.2 mm actually > means 0.2 <= true value < 0.4 (because the bucket has not yet tipped a > second time). > > For our gridded climate datasets (rainfall total, days of rain etc) we use > data > from both types of gauge without correction or adjustment. I think we can > be fairly confident that uncertainties in the interpolation process will be > quite a bit larger than either the observation uncertainty or the differences > between the two observation types i.e. these types of subtlety are probably > 'in the noise' and to be honest not something I'd given much thought to. > > In conclusion I'm slightly reluctant to specify a threshold that tries to > reflect > how the observations have been gathered, partly because this is not the > same for all sites and partly because it could change in the future (e.g. if > we > were to adopt a different type of rain gauge). Personally I'd prefer to > describe what we do to the data once it has been collected, which in the > case of the 'days of rain' variables is to test if the value is greater than > or > equal to a threshold. > > Thoughts? > > Regards, > > Dan > > > > -----Original Message----- > From: CF-metadata [mailto:cf-metadata-boun...@cgd.ucar.edu] On Behalf > Of Jonathan Gregory > Sent: 01 September 2014 17:42 > To: cf-metadata@cgd.ucar.edu<mailto:cf-metadata@cgd.ucar.edu> > Subject: [CF-metadata] Days of rain > > Dear Dan > > We have several variables that we describe loosely as 'days of rain'. > Strictly speaking they are a count (e.g. for a calendar month) of the number > of days when the 24-hour precipitation total was greater than or equal to a > threshold. We currently generate grids for three thresholds - 0.2mm, 1.0mm > and 10.0mm. My intention is to use the following existing standard name: > > > number_of_days_with_lwe_thickness_of_precipitation_amount_above_thr > eshold > > My only slight problem is that the definition implies 'greater than' > whereas our variables are 'greater than or equal to' the threshold. Assuming > the observations have a precision of 0.1 mm ... > > I think it depends on how the data have been treated. Are they rounded to > the > nearest 0.1 mm? If so, a recorded value of 0.0 mm means an actual value in > the > range 0.00-0.05 mm, 0.1 mm means 0.05-0.15 mm, 0.2 mm means 0.15-0.25 > mm, etc., > and your threshold of 0.2 mm in recorded precipitation is actually a > threshold > of 0.15 mm. That is therefore what I would suggest as the coordinate value > for > the threshold. > > Best wishes > > Jonathan > _______________________________________________ > CF-metadata mailing list > CF-metadata@cgd.ucar.edu<mailto:CF-metadata@cgd.ucar.edu> > http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata > _______________________________________________ > CF-metadata mailing list > CF-metadata@cgd.ucar.edu > http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata > -- > Scanned by iCritical. > _______________________________________________ > CF-metadata mailing list > CF-metadata@cgd.ucar.edu<mailto:CF-metadata@cgd.ucar.edu> > http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata > > > ----- End forwarded message ----- > _______________________________________________ > CF-metadata mailing list > CF-metadata@cgd.ucar.edu<mailto:CF-metadata@cgd.ucar.edu> > http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata > ----- End forwarded message ----- _______________________________________________ CF-metadata mailing list CF-metadata@cgd.ucar.edu http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata