Martin Desruisseaux created SIS-418: ---------------------------------------
Summary: Decompose netCDF keywords attribute value Key: SIS-418 URL: https://issues.apache.org/jira/browse/SIS-418 Project: Spatial Information Systems Issue Type: Improvement Components: Storage Affects Versions: 0.8, 0.7, 0.6, 0.5, 0.4, 0.3 Reporter: Martin Desruisseaux The {{keywords}} attribute in netCDF files sometime contains a value like below: {noformat} :keywords = "EARTH SCIENCE > Oceans > Ocean Temperature > Sea Surface Temperature" ; :keywords_vocabulary = "GCMD Science Keywords" ; {noformat} Currently, {{NetcdfMetadata}} store the whole string in a single ISO 19115 {{Keyword}} value, while actually it should be many keywords. The GCMD keywords structure is described on the NASA [Global Change Master Directory (GCMD) Keywords|https://earthdata.nasa.gov/about/gcmd/global-change-master-directory-gcmd-keywords] web page. It uses the following hierarchical structure: {quote}Category > Topic > Term > Variable > Detailed Variable{quote} We may interpret the structure as below: * We support only {{"EARTH_SCIENCE"}} (other possible values are {{"CONTINENT"}}, {{"Earth Remote Sensing Instruments"}}, _etc._) * {{KeywordType.DISCIPLINE}} (note: would be wrong if above was {{"CONTINENT"}} for example). * {{KeywordType.THEME}} (note: would be {{KeywordType.INSTRUMENT}} instead if above was _"Earth Remote Sensing Instruments"_ instead than _"Earth Science"_). * {{KeywordType.SUB_TOPIC_CATEGORY}} We may also provide a {{KeywordClass}} with a reference to GCMD as an ontology. -- This message was sent by Atlassian JIRA (v7.6.3#76005)