Re: Research project on integrating geoservices with Apache Airavata

Bruce Barkstrom Sat, 20 Apr 2013 13:04:54 -0700

If you approached this from a kind of generic computer science standpoint,
it would seem
natural to create a record structure at the finest spatial resolution.  At
each point in this
structure, you would define a record such as


type Data_Record is record
   longitude : double;
   latitude : double;
   SST : double;
   Chlorophyl_A : double;
   SLA : double;
   Eckman_Pumping : double;
   ...
end record;

If the data are not at the same resolution, then it will be necessary to
interpolate between
the measurements.  Perhaps the simplest approach in this case would be to
create a
data structure for each field and then to provide pointers to the measured
values and
then for each point in the grid, to allocate a weight to each measured
value that will give
the interpolated value when averaged up.

This two-paragraph discussion lacks details, but might be generalized to
produce what
you want.

Bruce R. Barkstrom

On Fri, Apr 19, 2013 at 5:09 AM, Martin Desruisseaux <
[email protected]> wrote:

> Hello Amila
>
> Le 18/04/13 20:47, AMILA RANATUNGA a écrit :
>
>> Thank you very much for the replies so far here and in SIS dev list as
>> well. They were really helpful. We are currently writing a research paper
>> regarding what kind of model that should be used by when building a
>> geoscience gateway. We intend to discuss issues that a geoscientist face
>> during his/her research and features that should be inserted to such a
>> gateway to overcome them. Then we can consider them during our main project
>> as well. It is appreciated if you can point any research papers or
>> resources where we can see this domain from an eye of a geoscientist will
>> be really useful. Even a good case study will be really helpful.
>>
>
> There is a chapter in my Ph.D thesis that I wrote 10 years ago [1], but it
> is in French... However there is some points:
>
> Before to start coding open source software, I had a formation on ERDAS
> Imagine. At that time, the raster data could be either measurement (e.g.
> altitude in metres), or categories (land, forest, lake, etc.). However for
> my work in oceanography I needed a mix of both in the same raster: Sea
> Surface Temperature (SST) measurement, together with some NaN
> (Not-a-Number) values indicating that the pixel was a cloud, or a land,
> etc. The software 10 years ago was not allowing that.
>
> My study was correlating data from remote sensing image, with fisheries
> data. From OGC perspective, this is equivalent to getting WCS and WebSensor
> to work together. Raster and sensor are two very different kind of data,
> and doing some work of the kind "I want all temperature data at the
> location and time of each sensor data, and also all temperature data 10
> days before the time of each sensor data" was needed.
>
> On the remote images side, my study was using 4 different kind of data:
> Sea Surface Temperature (SST), chlorophyll-a concentration, Sea Level
> Anomaly (SLA) and Ekman pumping. Each kind of data have very different
> characteristics in term spatial and temporal coverage, resolution and
> format. Handling such heterogeneous source of data was a challenge. Indeed,
> in my review of previous work, I saw many study correlating fish
> populations with temperature, or correlating fish population with
> chlorophyll, but I found no study correlating fish population to many
> parameters taken together (e.g. some condition of temperature in same time
> than some concentration of chlorophyll-a). Doing such combined study has
> been a big development effort. However it was 10 years ago, I'm sure the
> situation is different now.
>
> For each time and location of a sensor data, I needed to interpolate the
> temperature, chlorophyll-a, etc. measurement from the raster data, at the
> sensor time, 5 days before, 10 days before, etc., compute on the fly some
> derivative quantities like gradient of temperature (i.e. apply the Sobel
> operator on rasters of SST data) again 0 day, 5 days, 10 days, etc. before,
> handle the case of missing (NaN) values (e.g. if got a NaN when
> interpolating a value using the bi-cubic interpolation, try again with the
> bi-linear interpolation since it uses less data and thus reduce the risk of
> getting NaN). So having a software doing the work automatically was crucial.
>
> An other way to explain the above paragraph would be to said that for each
> sensor, we create many (potentially hundred) "virtual sensors" derived from
> remote sensing data. For example if you had a sensor measuring temperature
> inside your car, it is like attaching "virtual sensors" to the real sensor,
> where the virtual sensors behave like any real sensor but using the data
> from remote sensing images. Of course we have to take in account that the
> car is moving, to pixel requested on the remote sensing images is always
> changing.
>
> The amount of data produced in the above step was huge. Some statistical
> tools was needed for evaluating the coefficient of correlation between the
> above "virtual sensors" and the real ones, so we can trim the "virtual
> sensors" that do not seem relevant to our study. Again, because of the
> amount of data, automation is key.
>
> Not sure if it is of any help...
>
>     Martin
>
> [1] http://horizon.documentation.**ird.fr/exl-doc/pleins_textes/**
> divers09-08/010035115.pdf<http://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers09-08/010035115.pdf>
>
>

Re: Research project on integrating geoservices with Apache Airavata

Reply via email to