Re: [TMO] patient record normalization

Matthias Samwald Sat, 11 Sep 2010 03:07:16 -0700

I guess we should keep in mind that this discussion was (at leastoriginally) not about how units are represented on the Semantic Web, but howthey should be represented for a specific project: the TMO. Differentpeople, projects and communities will have different needs, and we will notbe able to achieve a consensus that will make everyone happy. Therefore, itmight be reasonable to focus on the specific case of TMO -- and maybe someof the consensus we reach there can be generalized to other areas.


David wrote:

the Mars Climate Orbiter was famously lost because one team assumed Metricunits and another team assumed English units

It is silly not to include explicit information about units, but it might beequally silly not to use SI units in a science or technology environment. Iguess it might be easy to say this as a continental European, but non-SIunits should be eradicated from sci/tech data. That might have more impacton interoperability than any standardized vocabularies, mapping algorithmsetc., and it might be simpler to implement in the long run.

However, I see one problem with requiring data providers to convert theirunits to standard units (besides the extra effort involved): in somesettings it might be important to capture the _original_ value and unit ofthe measurement, just for the sake of knowing the original datum. This mighteven be a legal requirement in some clinical settings. In my understanding,the goal of TMO is to be used in translational research, not clinicalpractice, and therefore this will probably not be an issue.


Mark wrote:

It speaks to a conversation that I had with my review committee thismorning about how The Web was built by simply being completely open.Anyone could (can) publish anything in any way they want, so long as theyadhere to the simple rules of HTML. I am very concerned that the SemanticWeb is not learning its lessons from the WWW. We are trying toinstitutionalize everything, and that simply doesn't work (it doesn'tscale!).

I guess the classic web and its tremendous global success is a goodinspiration, but I am not sure about how easily the principles of the webcan be translated into principles of the web of data. The 'anything goes'approach might just shift the problem from the data publishing phase to thedata consumption phase, which could result in the temporary belief of havingsolved the problem.Let me make a bold statement: there is no lack of biomedical RDF dataanymore. In fact, we are now in a situation where the same open dataset isoften RDFized several times by different groups. This growing number ofduplicated efforts is an interesting new development, and I might try todocument and analyze this trend when I find the time.Still, it is far from trivial to actually query these datasets, because oftheir heterogeneity. The answer is not to institutionalize everything, butto simply make RDF publishers better aware of concerns about overabundantheterogeneity and lack of transparency. And it could be a good reason toreduce sources of heterogeneity in a project that is under our control, suchas the TMO.


Cheers,
Matthias Samwald

// DERI Galway, Ireland
// Konrad Lorenz Institute for Evolution and Cognition Research, Austria
// http://samwald.info

Re: [TMO] patient record normalization

Reply via email to