Re: [CF-metadata] standard names for stations

2011-09-16 Thread Jonathan Gregory
Dear Nan, John, Jeff

I think variable attributes are generally better than global attributes,
because it's possible or indeed likely that you might have data from different
sources in the same file. I prefer data variables to describe themselves. We
can provide global attributes as a default for the whole file, but it seems
safer to me to repeat the attributes per variable.

Following John's point, I agree that this station ID information will not be
usable by machine or guaranteed to be unique unless we standardise it in some
way. I don't know about this subject, but I am sure there are people who are
well-informed about it! If we want the station ID attribute(s) to be machine-
readable, we can propose a standard format for them. If we want an attr where
the data-writer can informally record ID information that another human could
interpret, we don't need a standard format. Which do we want - any views?

Best wishes

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] standard names for stations

2011-09-16 Thread Lowry, Roy K.
Hi Jonathan,

My vote would go to something that is both human and machine-readable.  There 
are standards designed for this such as JSON. Some might consider this comes 
down on the side of the machine more than the human, but I find once I get my 
in I can read JSON much easier than XML.

If this is considered to be going over the top then we need to at least specify 
the order in which the component parts of the string are included and 
preferably we should specify a delimiter to separate the components.  Even an 
ultra-light standard such as this makes the life of any programmer writing code 
to parse the string infinitely easier.

Cheers, Roy.

-Original Message-
From: cf-metadata-boun...@cgd.ucar.edu 
[mailto:cf-metadata-boun...@cgd.ucar.edu] On Behalf Of Jonathan Gregory
Sent: 16 September 2011 15:32
To: cf-metadata@cgd.ucar.edu
Subject: Re: [CF-metadata] standard names for stations

Dear Nan, John, Jeff

I think variable attributes are generally better than global attributes,
because it's possible or indeed likely that you might have data from different
sources in the same file. I prefer data variables to describe themselves. We
can provide global attributes as a default for the whole file, but it seems
safer to me to repeat the attributes per variable.

Following John's point, I agree that this station ID information will not be
usable by machine or guaranteed to be unique unless we standardise it in some
way. I don't know about this subject, but I am sure there are people who are
well-informed about it! If we want the station ID attribute(s) to be machine-
readable, we can propose a standard format for them. If we want an attr where
the data-writer can informally record ID information that another human could
interpret, we don't need a standard format. Which do we want - any views?

Best wishes

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
-- 
This message (and any attachments) is for the recipient only. NERC
is subject to the Freedom of Information Act 2000 and the contents
of this email and any reply you make may be disclosed by NERC unless
it is exempt from release under the Act. Any material supplied to
NERC may be stored in an electronic records management system.
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] standard names for stations

2011-09-16 Thread Kennedy, Paul
For what it is worth, we migrated a lot of our systems metadata and config to 
JSON and it works like a charm. The primary reason was to enable humans and 
computers to co exist better. We tried XML but JSON won the day for us. 
Regards
Pk




- Original Message -
From: cf-metadata-boun...@cgd.ucar.edu cf-metadata-boun...@cgd.ucar.edu
To: Jonathan Gregory j.m.greg...@reading.ac.uk; cf-metadata@cgd.ucar.edu 
cf-metadata@cgd.ucar.edu
Sent: Fri Sep 16 23:13:56 2011
Subject: Re: [CF-metadata] standard names for stations

Hi Jonathan,

My vote would go to something that is both human and machine-readable.  There 
are standards designed for this such as JSON. Some might consider this comes 
down on the side of the machine more than the human, but I find once I get my 
in I can read JSON much easier than XML.

If this is considered to be going over the top then we need to at least specify 
the order in which the component parts of the string are included and 
preferably we should specify a delimiter to separate the components.  Even an 
ultra-light standard such as this makes the life of any programmer writing code 
to parse the string infinitely easier.

Cheers, Roy.

-Original Message-
From: cf-metadata-boun...@cgd.ucar.edu 
[mailto:cf-metadata-boun...@cgd.ucar.edu] On Behalf Of Jonathan Gregory
Sent: 16 September 2011 15:32
To: cf-metadata@cgd.ucar.edu
Subject: Re: [CF-metadata] standard names for stations

Dear Nan, John, Jeff

I think variable attributes are generally better than global attributes,
because it's possible or indeed likely that you might have data from different
sources in the same file. I prefer data variables to describe themselves. We
can provide global attributes as a default for the whole file, but it seems
safer to me to repeat the attributes per variable.

Following John's point, I agree that this station ID information will not be
usable by machine or guaranteed to be unique unless we standardise it in some
way. I don't know about this subject, but I am sure there are people who are
well-informed about it! If we want the station ID attribute(s) to be machine-
readable, we can propose a standard format for them. If we want an attr where
the data-writer can informally record ID information that another human could
interpret, we don't need a standard format. Which do we want - any views?

Best wishes

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
-- 
This message (and any attachments) is for the recipient only. NERC
is subject to the Freedom of Information Act 2000 and the contents
of this email and any reply you make may be disclosed by NERC unless
it is exempt from release under the Act. Any material supplied to
NERC may be stored in an electronic records management system.
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] Request for a region standard_name

2011-09-16 Thread Jonathan Gregory
Dear Jim

Following Don's comment, if you'd be happy with a standard region name of
contiguous_united_states, and since no-one else has objected, I think we
should agree to that. (We can omit of_america; I would expect people to know
where the united_kingdom is without of_great_britain_and_northern_ireland!)
I expect Alison will comment and/or add it to the table when she is able to.

Cheers

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] standard names for stations

2011-09-16 Thread John Caron

Heres a few comments on this discussion from my POV:

1) to summarize whats already in CF1.6:

section A9.2:

It is strongly recommended that there should be a station variable 
(which may be of any type) with the attribute cf_role=”timeseries_id”, 
whose values uniquely identify the stations.
It is recommended that there should be station variables with 
standard_name attributes station_description, surface_altitude and 
“station_wmo_id” when applicable.


Since surface_altitude already exists, the other two are called out at 
the end:


New standard names to be added to the standard name table
- station_description : variable of character type containing a 
description of a time

series station
- station_wmo_id : variable of character or integer type, containing the 
WMO

identifier of an observing station

(i dont see this last part on the web site at

http://cf-pcmdi.llnl.gov/documents/cf-conventions/1.6

so here is the final version in pdf for reference:

http://www.unidata.ucar.edu/staff/caron/public/CFch9-may10.pdf

note that this is not a draft, but been accepted for 1.6. However, we 
can always amend and extend it for 1.7.)



2) the NetCDF Attribute Convention for Dataset Discovery is at

http://www.unidata.ucar.edu/software/netcdf-java/formats/DataDiscoveryAttConvention.html

but doesnt have anything about stations. it does have a naming 
authority which was intended to create globally unique dataset ids



3) the attribute cf_role=”timeseries_id” has the same effect as a 
standard name. our intention was to start to separate structural 
meatdata vs naming physical quantities via standard names. so 
cf_role=”timeseries_id” indicates a unique identifier for the station.



4) There is an important wrinkle introduced in 1.6 wrt the global vs 
variable attributes. The info for a particular station is associated by 
way of the station dimension, and all variables with just that 
dimension are station variables. The set of variables for a station 
are also associated by various mechanism involving dimensions. So:


1. any metadata intended to describe the station should be a station 
variable or an attribute on a station variable.
2. if the data, for example, came from multiple instruments, you might 
want to annotate the variables with that info, understanding that the 
variable is already associated with a specific station and must be 
consistent.


5) Generally i like the idea of richer metadata for stations and 
platforms etc, and a naming authority is a really good idea. In service 
of Getting Things Done, i would recommend that we agree on something 
that works for human readable metadata, and then start to experiment 
with machine readable versions, eg JSON.


whether the naming authority is part of the name or not is a bit of 
style, but ill say that i like it.


6) So what would be helpful would be to start with the existing new 
things in 1.6:


1) station variable (which may be of any type) with the attribute 
cf_role=”timeseries_id”, whose values uniquely identify the stations.

2) station variable with standard_name station_description
3) station variable with standard_name “station_wmo_id”

and propose clarification and extensions to that. The concrete proposal 
has come from Jeffery, so perhaps he wants to revise it based on 
feedback so far and propose another reversion?



___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] Ragged Arrays?

2011-09-16 Thread Jonathan Gregory
Dear Chris

I was hoping that someone else might respond to this, because there was a
discussion not so long ago on this email list about a similar issue.

 I'm working on a format for storing the output from a particle
 tracking model. Once we're happy with it, we'll post here for
 feedback, but for the moment a question:
 
 As many particle tracking models create and destroy particles as the
 run goes on, we have decided that a ragged array representation is
 the way to go. But I'm having trouble figure out the standard way
 to represent ragged arrays.

You refer in your email to the discrete sampling geometries proposal and
I think that should be the way to do it, and I wonder if this is topologically
the same as one of the existing feature_types. The final agreed version of
that proposal, which will be in the next version of CF now being compiled,
is at http://www.unidata.ucar.edu/staff/caron/public/CFch9-feb25_jg.pdf
(see track ticket 37). Does it do what you need?

Cheers

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] use of volume_* and *_optical_thickness in variable names

2011-09-16 Thread Jonathan Gregory
Dear Markus

Welcome to CF.

 1) To what degree are qualifications part of the standard_name? In the 
 guidelines for constructing standard_names, qualifications such as _in_air or 
 _due_to_dry_aerosol are separated from the standard name, while they are 
 included with some standard_names in the table. Would a suitable 
 standard_name be 
 surface_volume_scattering_coefficient_at_stp_in_air_due_to_pm1_dry_aerosol 
 or just volume_scattering_coefficient with the qualifications optional? 
 Would the qualifications I used in the example be correct?

The qualifications are part of the standard name. So far, we have decided to
approve each qualified standard name individually, although if they follow
existing patterns it is often easy to reach agreement.

 2) How are the terms _optical_depth and _optical_thickness used? I know there 
 are deep ideological divides about the correct use of these terms, and I 
 don't plan to re-open any possible previous discussions. I only would like to 
 know how these terms are used in the CF convention. I found the standard name 
 atmosphere_optical_thickness_due_to_ambient_aerosol in the standard_name 
 table, and the explanation The optical thickness is the integral along the 
 path of radiation of a volume scattering/absorption/attenuation coefficient. 
 Is this path meant as the slant path pointing, e.g., from the surface at the 
 sun, or along the vertical axis from the surface through the atmosphere?

I don't recall a previous discussion about this. Perhaps someone with relevant
expertise could comment. It might be that more precise names are needed if you
wish to draw the distinction.

 3) I found the standard_name 
 volume_extinction_coefficient_in_air_due_to_ambient_aerosol in the table. 
 In what sense qualifying is the term volume_, i.e. how would the 
 volume_extinction_coefficient be different from the 
 extinction_coefficient?

If I remember correctly, the terminology volume_*_coefficient was adopted
for those with units of m-1, in contrast with those for scattering by a
surface, which are dimensionless, and volume_*_function for those in m-1 sr-1.

Best wishes

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


Re: [CF-metadata] Request for a region standard_name

2011-09-16 Thread Jim Biard

Jonathan,

I'm fine with contiguous_united_states_of_america or 
contiguous_united_states.


Jim

On 9/16/2011 11:54 AM, Jonathan Gregory wrote:

Dear Jim

Following Don's comment, if you'd be happy with a standard region name of
contiguous_united_states, and since no-one else has objected, I think we
should agree to that. (We can omit of_america; I would expect people to know
where the united_kingdom is without of_great_britain_and_northern_ireland!)
I expect Alison will comment and/or add it to the table when she is able to.

Cheers

Jonathan
___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata


--
Jim Biard

Government Contractor, STG Inc.
Remote Sensing and Applications Division (RSAD)
National Climatic Data Center
151 Patton Ave.
Asheville, NC 28801-5001

jim.bi...@noaa.gov
828-271-4900

___
CF-metadata mailing list
CF-metadata@cgd.ucar.edu
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata