Hi
Just a quick question on this. We develop the model
independently of technology. I can see that this discussion
is getting technical. I currently implement
propositions sets using RDF named graphs because we can and
it works but it is not stipulated. Rob suggests that there
are tech upgrades that might suit this issue better.
However, isn't it the case that we need to be able to
implement in different ways (I don't currently know much
about RDF*) depending on the systems we have? How is RDF*
implemented? - is it backwardly compatible with what we are
all using? Do we give more modelling credence to things
that everyone uses? etc., etc. But aren't these questions
the reason why we are technology independent? Given this,
my question is, - have we got to a stage when the modelling
now depends on a particular technology? Can someone
provide some clarification on this? Which solution is tech
independent? Are they all independent of this tech
discussion? One is at least.
D
On Thu, 11 May 2023 at 16:18, Martin Doerr via Crm-sig
<crm-sig@ics.forth.gr> wrote:
Dear Robert,
We have just created the new issue to discuss this in
detail. We should prepare a detailed analysis, citing
all pros and cons. May be we continue this discussion
better in a subgroup?
Named Graphs are not a very specific technology, if we
take the fact that all current triple stores are
actually implemented as quad stores, regardless whether
they call the construct "Named Graph" or "context". We
have used and implemented this feature, and it is very
performant. It runs on BlazeGraph as well. I think
their is not a simple answer to that. Performance can
become a major issue, when you have really a lot of data.
For the attribution of artists and "style of" vs
"school of" etc. of the collection management system of
the British Museum, the ResearchSpace Project had
created a set of subproperties of P14 carried out by,
which could be used as input for a roles vocabulary.
I did not propose to use Dig as is, but to consider the
construct. The W3C annotation model is very
interesting. We would need a connection to the Creation
Event of making an annotation, and whose opinion it is,
in order to make it CRM compatible. Why not allowing a
Named Graph as target? We should compare the segment
construct of the W3C annotation model with the METS
<area> types and extensions we used. The Dig model was
used to trace provenance of annotated area through
transformations of digital objects. That was very
important for exchanging research insights on 3D
models. To be discussed!
We can extend E13 to Proposition Sets, which would be
very important to describe consistently CRMinf and
generalized observations. That would then be most
effectively implementd via Named Graphs.
Opinions?
Best,
Martin
On 5/11/2023 3:41 PM, Robert Sanderson wrote:
If the intent is that the assertion is in the
discourse, and not a syntactic workaround for .1
properties that would be unnecessary if we had RDF* or
property graphs, then I would say E13 is exactly the
right approach to use. In comparison, I consider the
PC classes to be just that - a syntactic work around
needed in RDF and not part of the discourse. In LInked
Art, in a discussion around uncertain attribution of
artists and "style of" vs "school of", we posited the
need for a property on E13 for this scenario. (Also
the need for .1 on P11 for the same reason as we have
it on P14)
I would say that Dig's annotation is *not* the correct
approach for several reasons:
* Named Graphs are a very specific technology that
have never seen significant uptake and are likely
(IMO) to decrease in usage once RDF* is formalized.
* Dig needs to be updated, and Annotation is (I would
hope) likely to go away ... because ...
* ... it could just use the Web Annotation Data Model:
https://www.w3.org/TR/annotation-model/
(And reification has all the problems discussed in
this thread already)
Rob
On Thu, May 11, 2023 at 7:17 AM George Bruseker via
Crm-sig <crm-sig@ics.forth.gr> wrote:
Dear Martin,
I agree that E13 is a poor man's solution to a
complicated problem. But it is for some, the
solution available. Other solutions like Inf for
documenting historical argumentation and using
named graphs is great as a possibility. Using prov
o to represent the meta discursive level of the
provenance of the dataset as such great. But my
immediate interest was simple the humble ability
of E13 to be able to point to all statements that
can be made with precisely one link in CRM. I'll
keep watching the space!
Cheers,
G
On Thu, May 11, 2023 at 1:25 PM Martin Doerr
<mar...@ics.forth.gr> wrote:
Dear George,
I agree with you below about the historical
aspects. The annotation model has the same
historical aspect, but is not limited to a
single link.
Let us discuss!😁
Best,
Martin
On 5/11/2023 12:41 PM, George Bruseker wrote:
Dear Francesco, Martin,
Again for the record since I seem to be being
read at cross purposes, when I mention the
word 'provenance' I do not mean it in the
sense of dataset provenance (to which prov o
would apply). I mean that in the world to be
described (the real world of tables
charis cats dogs scholars ideas etc.) there
are real world events in which people
attribute things to things (see my
previous email). This is content of the world
to be represented in the semantic graph (not
a metagraph about the graph). This is
describable and is described in CIDOC CRM
using E13 and its friends. If you want to say
that there was a historical situation that
someone in your department said (likely in
the information system) that some attribute
related two things you can do this with E13
(or I have completely misunderstood the CIDOC
CRM). This happens all the time in art
history. One particular often arising case is
an argument about who played what role in
some object. Was Davinci the painter or was
it Simon? This is just a hum drum case of
needing to apply CIDOC CRM to real cases.
Since E13 is a mechanism for so doing on all
other statements, it would be a logical
continuation that it could be used also on .1
statements. But for technical reasons it
cannot, that is why I suggested a mild
technical solution that makes the technical
extension logically coherent. It is in this
sense that I mean provenance and not in the
metasense of the provenance of the data qua
data, also an exciting but other issue to my
mind.
Cheers,
George
On Thu, May 11, 2023 at 12:27 PM Martin Doerr
via Crm-sig <crm-sig@ics.forth.gr> wrote:
Dear Francesco,
This is an excellent paper.
I cite: "However, reification has no
formal semantics, and leads to a high
increase in the number of triples, hence,
it does not scale well. "
I agree with your proposals. Prov-O
mapping is a must for CRM-SIG.
Best,
Martin
On 5/10/2023 11:55 PM, Francesco Beretta
via Crm-sig wrote:
Dear Martin, George, All,
I would not dare to suggest some
solution of this complex issue but let
me hint to a couple of useful papers
(among many others):
Sikos, Leslie F., and Dean Philp,
‘Provenance-Aware Knowledge
Representation: A Survey of Data Models
and Contextualized Knowledge Graphs’,
/Data Science and Engineering/, 5.3
(2020), 293–316
<https://doi.org/10.1007/s41019-020-00118-0>
Hernández, Daniel, Aidan Hogan, and
Markus Krötzsch, ‘Reifying RDF: What
Works Well With Wikidata?’, in
/Proceedings of the 11th International
Workshop on Scalable Semantic Web
Knowledge Base Systems Co-Located with
14th International Semantic Web
Conference (ISWC 2015), Bethlehem, PA,
USA, October 11, 2015./, 2015, pp. 32–47
<http://ceur-ws.org/Vol-1457/SSWS2015_paper3.pdf>
Once again, I would like to suggest
carefully distinguishing between the CRM
domain of discourse, in which the E13
class is conceptualized, and the issue
of stating the provenance of the
information modelled in the discourse
domain, including instances of class E13
as part of the modelled domain.
For this last task (or domain of
discourse), it would seems reasonable
and in line with best practices to use
the PROV model and the corresponding
PROV-O ontology, a W3C recommendation.
Or providing a specific extension of the
CRM, compatible and aligned with the
PROV model. But using PROV-O seems a
good choice in order to facilitate
interoperability.
There remains the more fundamental
question of whether the current debate
about RDF implementation is not in fact
indicative of a more fundamental problem
related to properties of properties and
the implicit and richer information they
contain, which cannot be adequately
expressed in RDF without conceptualising
them in terms of actual classes. Aren't
these rather hybrid P(roperty)C(lasses),
especially if they should be declared as
subclasses of E1, to be considered as
/de facto/ classes and not just
properties? Because if they are just
statements, then adopting one or the
other form of existing RDF reifications
practices seems to be the good way to go.
Best
Francesco
Le 10.05.23 à 18:48, Martin Doerr via
Crm-sig a écrit :
Dear All,
I suggest to resolve the issue of
referring to the provenance of .1
properties more specifically:
Solution a: Add properties to E13 to
specify a .1 property. This may be more
effective than the double indirection
via PC class instance and 4 links of
the E13 construct.
Solution b: Use RDF reification for
this specific problem via the PC class.
We need to examine in both cases the
inferences we want to maintain about
the base property and its domain and
range, and what the relevant query
construct is.
Personally, I prefer solution c: Use
the annotation model of CRM Dig, which
goes via Named Graphs. This is much
more performant and logically clearer,
because Named Graphs are implemented as
direct references to property
identifier, and maintain a reference
count for each one. This is an
important logic in its own right.
Inferences about the .properties would
work in out ouf of a Named Graph,
whereas the reification may need
additional rules.
The query languages of Quad stores
support them explicitly.
The latest version of 3M supports Named
Graph definitions. This feature should
be tested.
I would rather discourage E13 in the
long term as a means to denote
provenance generally and recommend a
uniform use of Named Graphs. I am aware
that not all RDF encodings support the
feature. I that case we could resort to
reification.
Opinions?
Best,
Martin
On 5/9/2023 10:37 PM, Francesco Beretta
via Crm-sig wrote:
Dear Christian-Emil, All,
For the reasons I detailed in my other
email, I totally agree with your point
of view and would like to raise all
possible caveats to this kind of
mixing up quick and dirty
implementation solutions and
consistent conceptual modelling.
If we need more classes, even on a
provisional and experimental
perspective, I would strongly suggest
to produce them and document them as
such, with stable URIs, and then
refine progressively the ontology and
integrate it into the CRM family. Of
course, a nice place to do this is
ontome.net <http://ontome.net> 😉
Best
Francesco
Le 08.05.23 à 17:36, Christian-Emil
Smith Ore via Crm-sig a écrit :
Also: RDF(S) is an implementation
technology. We can assume that there
exists a implmentation function from
the CRM-FOL to RDF(S), but this may
not be a 1-1 function. Strange
constructs like the PC0(?) may not
have counterparts in CRM-FOL.
Changing the ontology on the bases of
special tricks used in the
implementation may not always be a
good idea, but may inspire us to make
well thought out and consistent
changes in the ontology.
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas
(FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig
--
Rob Sanderson
Senior Director for Digital Cultural Heritage
Yale University
--
------------------------------------
Dr. Martin Doerr
Honorary Head of the
Center for Cultural Informatics
Information Systems Laboratory
Institute of Computer Science
Foundation for Research and Technology - Hellas (FORTH)
N.Plastira 100, Vassilika Vouton,
GR70013 Heraklion,Crete,Greece
Vox:+30(2810)391625
Email:mar...@ics.forth.gr
Web-site:http://www.ics.forth.gr/isl
_______________________________________________
Crm-sig mailing list
Crm-sig@ics.forth.gr
http://lists.ics.forth.gr/mailman/listinfo/crm-sig