Re: [Crm-sig] Issue: Solution for Dualism of E41 Appellation and rdfs:label

Martin Doerr Wed, 12 Sep 2018 16:55:28 +0300

Dear Richard,

I basically agree with your comments. Specifically however, I indeedwanted to say that the official definition of rdfs:label makes itexactly a subproperty of P1 (or shortcut of it) in any correct use ofRDFS. If we want to mix RDFS models, we should have an opinion abouttheir compatibility. Otherwise, we would have to regard them asalternative that cannot be compared with the CRM.

I am not happy with adding rdfs:label to instances of Appellation,because this would mean it is a name for a name and not the name. Iwould sympathize with George using rdfs:value, if it had the respectivesemantics.

What we need, to my opinion, is a property of Symbolic Object we maycall it "has symbolic content" or "has symbolic content inline" oranything better, which defines that the symbolic content *is identicalto* the Literal, *abstracted *to the "level of symbolic specificity"that the Literal implies and that conforms to the identity condition ofthe Symbolic Object, i.e., characters of a certain script, or whatever.That would make the meaning of the "value" unambiguous.

We may need add another property, such as "is contained in" or sopointing to a URL actually holding an instance of its content, againabstracted to the "level of symbolic specificity" that the file instanceimplies and that conforms to the identity condition of the Symbolic Object.

Whereas the shortcut interpretation is attractive, it is not exactly thesame. Using a shortcut, we say that the intermediate node is ofdifferent, independent nature from the terminal node. Here, we do notsay "Appellation" is related to something called "Literal". We say "thisAppellation IS itself what is in this Literal". That may or may not be areason to reject this interpretation.

We also have to distinguish Appellations and other Symbolic Objectswhich have multiple symbolic forms, i.e. spelling variants, versionsetc., from those *being one* symbolic form. The rdfs:value has no meansto express that. I believe we need yet another property "has symboliccontent variant". In that case, the URI is necessary, to my opinion.

I think the polymorphism we describe here, well studied inobject-oriented languages, is in the nature of Appellations. The problemfor me is, that the the respective KR models have NOT THOUGHT of thecase that such polymorphisms can occurr. Nevertheless, RDFS is tolerantenough to accept the Superproperty statement, but not to create a classwhich is either URI or *inline expanded* object.

This polymorphism occurs EXCLUSIVELY for Symbolic Objects with symbolsets a certain machine supports. Another reason not to use rdfs:value,because it does not give credit to the fact that only Symbolic Objectscan have such a "value".

I agree that we may over-think the point. As I mentioned, thesuperproperty statement I propose has no other effect than that I canget E41's and labels back by querying P1 only.


Opinions?

Best,

Martin

On 9/12/2018 9:56 AM, Richard Light wrote:

On 11/09/2018 20:02, Martin Doerr wrote:
Dear All,
Firstly, apologies, the RDF was wrong, it was intended to be P1 issuperproperty of rdfs:label.
I'm not sure that this is something we need to state at all, and Iworry that - if it is included in our RDFS Schema - it may bringunwanted side-effects. Isn't this saying that any instance ofrdfs:label is to be treated as an instance of P1? Bear in mind thatCRM data may co-exist in triple stores in company with other RDF data,which may well use rdfs:label for its own purposes. This assertionthat 'all rdfs:labels are P1 relationships' would then be applied tothis other data as well. This might well result in incorrect/spuriousresults when SPARQL queries are applied to the data.
In general, I suggest that we are ok to define sub-classes/propertiesof standard RDFS types, but we shouldn't definesuper-classes/properties of them. (I would welcome comments on thevalidity of this suggestion from someone who understands RDF betterthan me.)
Semantically, the range of rdfs:label, when used, is ontologically anAppellation in the sense of the CRM.
Agreed (see my reply from yesterday). The conclusion I draw from thisis that we can validly say:
E1 rdfs:label "string value" is a shortcut for the path 'E1 CRMEntity' 'P1 is identified by' 'E41 Appellation' ...
in exactly the same spirit as the similarly-worded note which we findin the definition of P1 itself. (Obviously, by using this shortcut, welose the information that this string value is an Appellation, butthat's the nature of short-cuts.)
I agree with George, that all RDF nodes should have a human readablelabel. They name the thing, even if it is a technical node.I would find it confusing to say, labels are not to be queried, onlyto be read, and the "real" names must have a URI,
regardless weather I have more to say about it.

I am really not a fan of punning, we definitely forbid it in the CRM.
The point with Appellations is that some, the simple ones, candirectly be represented in the machine, or be outside. The solutionto assign a URI in all cases, and then a value or label, does notmake the world easier. It is extremely bad performance. We talk hereabout implementation, not about ontology.You get simply a useless explosion of the graph for a purpose oftheoretic purity.
Agreed. What we need to do is to propose a simple way of expressingsimple Appellations in RDF. That is why my shortcut definition aboveends with '...': I don't think we have yet decided how to do this.
I've just been looking over the draft document we are trying to write,and it currently says that a fully-worked-out path will use 'P3 hasnote -> E62 string' to express the value of an E41 Appellation. This(i.e. the suggestion to use P3) comes from the definition of thesuperclass E90 Symbolic Object. A comment in our draft RDF documentquestions whether this is sufficiently precise, since P3 is simply "acontainer for all informal descriptions about an object that have notbeen expressed in terms of CRM constructs". I suggest that we needeither to use rdfs:value to hold the string value, or (better) todefine a CRM-specific subproperty of rdfs:value and use that. (Thissubproperty could be part of the published CRM, or it could just formpart of the 'RDF implementation' guidelines.) I don't think that weshould use rdfs:label here.
I don't think we should concern ourselves with URLs in our RDFguidance document. Any implementer of our RDF solutions can choose toassign a URL to represent any node in the structure, but it won'tchange the logic of the resulting RDF, or how it responds to SPARQLqueries.
Those claiming confusing should be more precise. Has someone lookedat query benchmarks? Has someone looked at graphical representationsof RDF graphs. Do they really look better?
So either we either ignore the issue, and write queries that collectnames either via P1, URI and a value/label, or via a label, becausethis is where names appear in RDF, we make no punning, but ourqueries implement exactly this meaning. So, we are not better, but doas if we wouldn't know.
Or, we describe the fact by punning, have one superproperty for allcases, which we can query, and stop thereby the discussion if labelsare allowed or not, and how they relate to appellations. The punningcomes in, because the range of the superproperty must comprise theranges of the subproperties. We can play a bit more, make the punningwith a superproperty of P1, and have both P1 and rdfs:labelsubproperties of it, if this is preferred.The solution I describe is just a logical representation of thesituation, not creating a different situation. It just says thatnames can be complex objects or simple literals.
As I said yesterday, I don't see how any punning strategy can makedifferently-structured RDF equivalent for the purposes of querying.Therefore, I think we will have to accept that if we allow more thanone way of representing a given statement in CRM RDF, we will have toconstruct queries which look explicitly for each of the possible patterns.
The problem is, that the RDF literals do have meaning beyond beingsymbol sequences.
Insofar as they have such meaning, I would argue that we define it(i.e. that meaning) by the CRM context in which we place thestring/literal value. I think there is a danger that we couldover-think this problem.
Richard
The punning does not introduce the problem. With or without, thequeries have to cope with names in either form.This holds similarly for space primitives and large geometry files,for short texts and equivalent files etc.
Opinions?

Best

Martin
--
*Richard Light*


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig



--
--------------------------------------------------------------
 Dr. Martin Doerr              |  Vox:+30(2810)391625        |
 Research Director             |  Fax:+30(2810)391638        |
                               |  Email: mar...@ics.forth.gr |
                                                             |
               Center for Cultural Informatics               |
               Information Systems Laboratory                |
                Institute of Computer Science                |
   Foundation for Research and Technology - Hellas (FORTH)   |
                                                             |
               N.Plastira 100, Vassilika Vouton,             |
                GR70013 Heraklion,Crete,Greece               |
                                                             |
             Web-site: http://www.ics.forth.gr/isl           |
--------------------------------------------------------------

Re: [Crm-sig] Issue: Solution for Dualism of E41 Appellation and rdfs:label

Reply via email to