Re: Semantics of rdfs:seeAlso (Was: Is it best practices to use a rdfs:seeAlso link to a potentially multimegabyte PDF?)

Kingsley Idehen Thu, 13 Jan 2011 10:07:17 -0800

On 1/13/11 12:04 PM, Nathan wrote:

Hi Kinglsey,
Kingsley Idehen wrote:
When our engine describes entities it can publish these descriptionsusing variety of structured data formats that include RDF. The samething applies on the data consumption side. Basically, RDF formatsare options re. Linked Data (the concept).
A generic problem here, when using non RDF types with Linked Data overHTTP, is that there's currently no way to indicate that a resourceis/has a set of machine readable "linked data" variants, in many casesit is useful to publish and consume with linked data in CSV format andrelated (as you well note) - but without prior out of band knowledgethat the representation contains, or is, linked data, the machines arepretty much screwed. Typically the RDF variants don't have thisproblem because the media type sets the expectation, so you can connegon an RDF type and know your getting back "linked data", you can't dothis with CSV and related with any expectation that you'll get back"linked data" - thus, if there was some way to mark the set ofrepresentations given upon dereferencing a URI as linked data,containing rdf, rdfable 3 tuples, or a view thereof, it'd be a lotfriendlier to the web of data in general.


So what happens to RDFa in (X)HTML? Even worse, no DOCTYPE declarations?
What about various JSON dialects for Linked Data graphs?
How about N-Triples? Ditto TriX and others?

In my world view I see realities such as:

1. Spreadsheet and other desktop productivity users opening up a URL(directly or indirectly via WebDAV mounted to filesystem) -- this is amassive realm for Linked Data exploitation

2. Starting FYN (follow-your-nose) patterns in ODE, Sponger etc.. thatmight start from an RDF resource but eventually encounter resources thataren't RDF based.


Thus, I believe we have to consider:

1. Client side heuristics on the parts of Linked Data apps that dealwith data format heterogeneity atop underlying S-P-O / E-A-V homogeneityre. propositions embedded in data.

A typical approach would be to register new mediatypes, +variantkinds, for instance text/rdf+csv or such like, but these typeswouldn't be well known throughout the internet, served correctly bydefault in the likes of apache, or handed off to the correct consumingprograms by user agents - I'll leave it there, without a proposal, butsome indication to the machine would/will be needed to make thisapproach friendlier for the web.

Remember a Linked Data Server can say (via HTTP): all I have is a CSV(or other non RDF format) based representation of the RDF (viamediatype) based Data you requested :-)

If you look closer, we are revisiting the issue of: where does"resource" stop. Is it at the container or content level? In my worldview, the content matters. Yes, mediatypes help, but ultimately we haveto be much more open about the concept of Linked Data. Of course, aclient (e.g. Tabulator) can say: I don't understand what you sent meetc..., which is fine, but it shouldn't be the basis for defining whatLinked Data (the concept) is all about.

Again, I have no problems with RDF based Linked Data as a variation ofthe Linked Data concept. I just want clarity more than anything else.Being provincial about Linked Data (via RDF format specificity) isn'tgoing to increase comprehension and adoption momentum.

and as an aside: I do worry a little that there may be someoverloading of terms going on here, Linked Data (the concept) andLinked Data (the protocol) - I'm unsure exactly how to define LinkedData (the concept) but assuming you're referring to a broad range ofEAV variant 3-Tuple based data with URIs.

The concept of Linked Data is old. Linked Data at InterWeb scalecourtesy of HTTP ubiquity is an immensely valuable (and mega cool!)contemporary spin on an old concept. What else can I say? I guessGoogle's your friend re. historic research on the subject: Linked Data :-)

TimBL (as far as I know) has never claimed to have invented the conceptof Linked Data. He dropped a note (subject: Linked Data) explaining howyou can leverage AWWW as an effective mechanism for producing LinkedData at InterWeb scale.



Kingsley


Best,

Nathan



--

Regards,

Kingsley Idehen 
President&  CEO
OpenLink Software
Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca: kidehen

Re: Semantics of rdfs:seeAlso (Was: Is it best practices to use a rdfs:seeAlso link to a potentially multimegabyte PDF?)

Reply via email to