Re: [Crm-sig] Issue: Solution for Dualism of E41 Appellation and rdfs:label

Martin Doerr Fri, 14 Sep 2018 20:20:04 +0300

Dear Richard,

I'll shorten now:


On 9/14/2018 7:54 PM, Richard Light wrote:

My suggestion is that we define the "has symbolic content" property,and then put our energy into agreeing one or more subproperties ofrdf:value which meet the known recording requirements for culturalheritage information. By doing this, I suggest that we will havesolved the main problem which confronts implementors who want toexpress CRM in RDF.
Yep, subproperty of rdf:value is not bad.
I think the polymorphism we describe here, well studied inobject-oriented languages, is in the nature of Appellations. Theproblem for me is, that the the respective KR models have NOTTHOUGHT of the case that such polymorphisms can occurr.Nevertheless, RDFS is tolerant enough to accept the Superpropertystatement, but not to create a class which is either URI or *inlineexpanded* object.
This polymorphism occurs EXCLUSIVELY for Symbolic Objects withsymbol sets a certain machine supports. Another reason not to userdfs:value, because it does not give credit to the fact that onlySymbolic Objects can have such a "value".
I'm afraid you have lost me here. It would be very helpful to me(and might encourage others to join in the conversation) if youcould post one or two concrete examples of what you mean.
OK, in simple words: there are names which have an identity based ona certain sequence of characters. There are others, historicallyinteresting, which have a phonetic identity, and even that may vary.We collaborate with historians, that deal with family names in theAegean area around 1800, which have no standard spelling at all, noteven a preferred one. The different spelling variants have laterevolved into distinct family names. But in order to match instancesin the documents, we need both concepts of identity.
True, but any instance of the name in a document will only take oneconcrete form, not all of them. (For handwritten sources it may be amatter of judgement what that form actually is.) So you can recordthe form of name it exhibits (as a string), and then assert that it is(in your view) an attestation of the generic family name for which youhave a URI.

This is not true. We do have counterexamples. The name may take multipleforms in the same document.

Even my ancestors used "Derr" instead of "Dörr". Since the localdialect does not distinguish "e" and "ö", it is unclear if it is aspelling variant of the same phonetics or if the "ö" is anetymological misinterpretion, because "Dörr" has a linguistic meaningand the "e" in "Derr" may have another semantic root, but this is notwidely accepted.
So, the names that are not identical to a Literal must be representedusing a URI. That is what I mean by polymorphism. Also, if we want totalk about the name itself as a historical fact, we need a distinctidentity. All these cases are needed but rare for names.
There are perfectly good reasons for considering names to be worthy ofstudy and recording in their own right. I would argue that this isequally true whether the name in question has one, or many, possibleforms. So there is always an argument for minting a URI to representthe name as a Symbolic Object. Doing this allows you to makestatements, for example, about its genesis, its meaning, itshistorical distribution, etc., and means you can record specificinstances of the name as attestations of this Symbolic Object.
However, I would still argue that /instances /of the name should berecorded as strings - the actual value found in the resource in question.

Sure. this is another issue. And they can be multiple...

best,

Martin

For texts, it is the opposite. They are more often in files than inliterals.
On the other side, only Symbolic Objects can "reside" on computersand outside. Therefore the "punning" problem does only occur inconnection to Symbolic Objects. Only these can have a "value" in themachine, whereas rdfs:value may be about anything.
Thanks,

Richard

[1] https://www.w3.org/community/openannotation/
Best,

Martin
Best wishes,

Richard
I agree that we may over-think the point. As I mentioned, thesuperproperty statement I propose has no other effect than that Ican get E41's and labels back by querying P1 only.
Opinions?

Best,

Martin

On 9/12/2018 9:56 AM, Richard Light wrote:
On 11/09/2018 20:02, Martin Doerr wrote:
Dear All,
Firstly, apologies, the RDF was wrong, it was intended to be P1is superproperty of rdfs:label.
I'm not sure that this is something we need to state at all, and Iworry that - if it is included in our RDFS Schema - it may bringunwanted side-effects. Isn't this saying that any instance ofrdfs:label is to be treated as an instance of P1? Bear in mindthat CRM data may co-exist in triple stores in company with otherRDF data, which may well use rdfs:label for its own purposes. This assertion that 'all rdfs:labels are P1 relationships' wouldthen be applied to this other data as well. This might wellresult in incorrect/spurious results when SPARQL queries areapplied to the data.
In general, I suggest that we are ok to definesub-classes/properties of standard RDFS types, but we shouldn'tdefine super-classes/properties of them. (I would welcomecomments on the validity of this suggestion from someone whounderstands RDF better than me.)
Semantically, the range of rdfs:label, when used, isontologically an Appellation in the sense of the CRM.
Agreed (see my reply from yesterday). The conclusion I draw fromthis is that we can validly say:
E1 rdfs:label "string value" is a shortcut for the path 'E1 CRMEntity' 'P1 is identified by' 'E41 Appellation' ...
in exactly the same spirit as the similarly-worded note which wefind in the definition of P1 itself. (Obviously, by using thisshortcut, we lose the information that this string value is anAppellation, but that's the nature of short-cuts.)
I agree with George, that all RDF nodes should have a humanreadable label. They name the thing, even if it is a technical node.I would find it confusing to say, labels are not to be queried,only to be read, and the "real" names must have a URI,
regardless weather I have more to say about it.
I am really not a fan of punning, we definitely forbid it in theCRM.
The point with Appellations is that some, the simple ones, candirectly be represented in the machine, or be outside. Thesolution to assign a URI in all cases, and then a value or label,does not make the world easier. It is extremely bad performance.We talk here about implementation, not about ontology.You get simply a useless explosion of the graph for a purpose oftheoretic purity.
Agreed. What we need to do is to propose a simple way ofexpressing simple Appellations in RDF. That is why my shortcutdefinition above ends with '...': I don't think we have yetdecided how to do this.
I've just been looking over the draft document we are trying towrite, and it currently says that a fully-worked-out path will use'P3 has note -> E62 string' to express the value of an E41Appellation. This (i.e. the suggestion to use P3) comes from thedefinition of the superclass E90 Symbolic Object. A comment inour draft RDF document questions whether this is sufficientlyprecise, since P3 is simply "a container for all informaldescriptions about an object that have not been expressed in termsof CRM constructs". I suggest that we need either to userdfs:value to hold the string value, or (better) to define aCRM-specific subproperty of rdfs:value and use that. (Thissubproperty could be part of the published CRM, or it could justform part of the 'RDF implementation' guidelines.) I don't thinkthat we should use rdfs:label here.
I don't think we should concern ourselves with URLs in our RDFguidance document. Any implementer of our RDF solutions canchoose to assign a URL to represent any node in the structure, butit won't change the logic of the resulting RDF, or how it respondsto SPARQL queries.
Those claiming confusing should be more precise. Has someonelooked at query benchmarks? Has someone looked at graphicalrepresentations of RDF graphs. Do they really look better?
So either we either ignore the issue, and write queries thatcollect names either via P1, URI and a value/label, or via alabel, because this is where names appear in RDF, we make nopunning, but our queries implement exactly this meaning. So, weare not better, but do as if we wouldn't know.
Or, we describe the fact by punning, have one superproperty forall cases, which we can query, and stop thereby the discussion iflabels are allowed or not, and how they relate to appellations.The punning comes in, because the range of the superproperty mustcomprise the ranges of the subproperties. We can play a bit more,make the punning with a superproperty of P1, and have both P1 andrdfs:label subproperties of it, if this is preferred.The solution I describe is just a logical representation of thesituation, not creating a different situation. It just says thatnames can be complex objects or simple literals.
As I said yesterday, I don't see how any punning strategy can makedifferently-structured RDF equivalent for the purposes ofquerying. Therefore, I think we will have to accept that if weallow more than one way of representing a given statement in CRMRDF, we will have to construct queries which look explicitly foreach of the possible patterns.
The problem is, that the RDF literals do have meaning beyondbeing symbol sequences.
Insofar as they have such meaning, I would argue that we define it(i.e. that meaning) by the CRM context in which we place thestring/literal value. I think there is a danger that we couldover-think this problem.
Richard
The punning does not introduce the problem. With or without, thequeries have to cope with names in either form.This holds similarly for space primitives and large geometryfiles, for short texts and equivalent files etc.
Opinions?

Best

Martin
--
*Richard Light*


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
--------------------------------------------------------------
  Dr. Martin Doerr              |  Vox:+30(2810)391625        |
  Research Director             |  Fax:+30(2810)391638        |
                                |  Email:mar...@ics.forth.gr  |
                                                              |
                Center for Cultural Informatics               |
                Information Systems Laboratory                |
                 Institute of Computer Science                |
    Foundation for Research and Technology - Hellas (FORTH)   |
                                                              |
                N.Plastira 100, Vassilika Vouton,             |
                 GR70013 Heraklion,Crete,Greece               |
                                                              |
              Web-site:http://www.ics.forth.gr/isl            |
--------------------------------------------------------------


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
*Richard Light*


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
--------------------------------------------------------------
  Dr. Martin Doerr              |  Vox:+30(2810)391625        |
  Research Director             |  Fax:+30(2810)391638        |
                                |  Email:mar...@ics.forth.gr  |
                                                              |
                Center for Cultural Informatics               |
                Information Systems Laboratory                |
                 Institute of Computer Science                |
    Foundation for Research and Technology - Hellas (FORTH)   |
                                                              |
                N.Plastira 100, Vassilika Vouton,             |
                 GR70013 Heraklion,Crete,Greece               |
                                                              |
              Web-site:http://www.ics.forth.gr/isl            |
--------------------------------------------------------------


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
*Richard Light*


_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig



--
--------------------------------------------------------------
 Dr. Martin Doerr              |  Vox:+30(2810)391625        |
 Research Director             |  Fax:+30(2810)391638        |
                               |  Email: mar...@ics.forth.gr |
                                                             |
               Center for Cultural Informatics               |
               Information Systems Laboratory                |
                Institute of Computer Science                |
   Foundation for Research and Technology - Hellas (FORTH)   |
                                                             |
               N.Plastira 100, Vassilika Vouton,             |
                GR70013 Heraklion,Crete,Greece               |
                                                             |
             Web-site: http://www.ics.forth.gr/isl           |
--------------------------------------------------------------

Re: [Crm-sig] Issue: Solution for Dualism of E41 Appellation and rdfs:label

Reply via email to