Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Sat, 2011-06-18 at 23:05 -0500, Pat Hayes wrote: Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. I don't think that analogy holds. I don't think this is any sort of meta-language confusion. I agree that (for many applications) documents are more semantically distant from dogs than female dogs are from male dogs, but I see that as merely a difference of degree -- and one that is application dependent -- and not of kind. Semantically, they are all just relations, and some of these relations are important to some applications, and others to others: :x ex:isPrimarySubjectOf :d . :y ex:isSecondarySubjectOf :d . :m ex:isOppositeSexOf :f . To my mind, those all look pretty similar in nature. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. I don't think that's true. But I think my comments will get a bit deeper into semantic web architectural issues than will interest most LOD readers, so I've moved my explanation to the AWWSW list instead: http://lists.w3.org/Archives/Public/public-awwsw/2011Jun/0006.html [ . . . ] So far, http-range-14 is the only viable suggestion I have seen for how to do this. If anyone has a better one, let us discuss it. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. I agree with both of these sentiments though. -- David Booth, Ph.D. http://dbooth.org/ Opinions expressed herein are those of the author and do not necessarily reflect those of his employer.
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 20 Jun 2011, at 02:48, Melvin Carvalho wrote: On 19 June 2011 20:42, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 20:15, Danny Ayers wrote: Only personal Henry, but have you tried the Myers-Briggs thing - I think you used to be classic INTP/INTF - but once you got WebID in your sails it's very different. These things don't really allow for change. Is there a page where I can find this out in one click? Looks like those pages ask all kinds of questions that require detailed and complicated answers. I am surprised anyone ever answers those things. It's certainly more complex than the Object/Document distinction ;-) Myers Briggs is based on the Jungian analysis of mythology and personality types, with a few additions. Myths being public dreams, and dreams being private myths. I think Jung based a lot of his thinking on the iChing which he mentions a lot and for the most famous german translation of which he even wrote an introduction. As it happens the iChing does require of you just one click. It asks the universe to answer what change you are undergoing. http://wengu.tartarie.com/wg/wengu.php?no=0l=Yijing So as opposed to personality analysis which is fixed apparently, this is easy to use - though quite complex and open to interpretation. The personality types are the lens from which we interpret the inner and outer universal symbols. e.g. Intuitively / Analytically / Senses / Feeling. But the symbols themselves are often the more fascinating parts. yes, the inner/outer distinction also comes from the iChing. The upper trigram represents the outer, the lower trigram represents the inner. Btw, these trigrams even appear in the UTF-8 symbols. An interesting parallel here is the relation to Jung's archetypes of the unconscious and WebID. Both in your dreams, and in mythology, you have symbols where are metaphors that reference some universal concept. WebID is of course a reference to the self ( foaf : Person ). Perhaps to the self that is only as much as the strength of his links to others. As many of the myths we live with today are 100s of years out of date, and people are searching for something new, perhaps WebID can become a modern symbol, to determine or even evangelize the new personality type of society, post information revolution :) I see it more as a tool that by tying in people into the network will grow the network of semantic web supporters and evangelisers. Perhaps it can become mythical. The URL should be by now :-) Henry Only slightly off-topic, very relevant here, need to pin down WebID in a sense my dogs can understand. Ok. So you need to give each of your dogs and cats a webid enabled RDFID chip that can publish webids to other animals with similarly equipped chips when they sniff them. From the frequence and length of sniffs you can work out the quality of the relationships. On coming home for food, this data could be uploaded automatically to your web server to their foaf file. These relationships could then be used to allow their pals access to parts of your house. For example good friends of your dog, could get a free meal once a week. You could also use that to tie up friendship with their owners, by the master-of-pet relationships, and give them special ability to tag their pet photos. Masters of my dogs friends could be potential friends. If you get these pieces working right you could set up a business with a strong viral potential, perhaps the strongest on the net. Here to make my point: The Myers-Briggs thing is intuitively rubbish. But with only one or two posts in the ground, it does seem you can extrapolate. On 19 June 2011 19:52, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 19:44, Danny Ayers wrote: I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. I confess to talking bollocks when I should be coding. yeah, me too. Though now you folks managed to get me interested in this problem! (sigh) Henry Social Web Architect http://bblfish.net/ -- http://danny.ayers.name Social Web Architect http://bblfish.net/ Social Web Architect http://bblfish.net/
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/20/11 8:31 AM, Henry Story wrote: Perhaps it can become mythical. The URL should be by now:-) The URI :-) -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/20/11 10:39 AM, Henry Story wrote: On 20 Jun 2011, at 10:51, Kingsley Idehen wrote: On 6/20/11 8:31 AM, Henry Story wrote: Perhaps it can become mythical. The URL should be by now:-) The URI :-) Perhaps we should write it URi to get a bit of Apple magic. Pronounced your-eye, Neat and implicit ! it does indeed have this you--i tension, which French philosopher of technology Bernard Stiegler, elaborating on work by Gilbert Simondon from the 1960ies (especially his book L'Individuation psychique et collective to appear in translation this summer in English I am told) takes as fundamental to our understanding of the self. There is no individual first and social after or on the side, or vice versa. Individuation is a process of integration of the social via history passed down through oral traditions initially (by memorisation of poems such as the Iliad), through alphabetic writing - the most important technological transformation brought on by the ancient greeks; ancient greece where kids had to go to school to learn to read and write, so they could learn the laws of the city written on the walls of Athens for all to see - and then in the 20th century through radio, and television. This learning one's societies past and learning who one is, is the same process that then allows one to distinguish oneself from the social and within it. Individuation cannot happen without the social background, just as the social can flourish only to the extent that it gives the individual a place to distinguish himself within it. Societies themselves are individualised by how they distinguish themselves from others... Yep! So your WebID indeed identifies you, but within the space of your social and conceptual relations. Amen! WebID is a very powerful demonstration of what makes Linked Data so powerful. It solves real problems that can't be fixed objectively (distributed fashion without central control) in the Information Space dimension. Kingsley Henry -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen Social Web Architect http://bblfish.net/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/20/11 1:48 AM, Melvin Carvalho wrote: Myers Briggs is based on the Jungian analysis of mythology and personality types, with a few additions. Myths being public dreams, and dreams being private myths. The personality types are the lens from which we interpret the inner and outer universal symbols. e.g. Intuitively / Analytically / Senses / Feeling. But the symbols themselves are often the more fascinating parts. An interesting parallel here is the relation to Jung's archetypes of the unconscious and WebID. Both in your dreams, and in mythology, you have symbols where are metaphors that reference some universal concept. WebID is of course a reference to the self ( foaf : Person ). As many of the myths we live with today are 100s of years out of date, and people are searching for something new, perhaps WebID can become a modern symbol, to determine or even evangelize the new personality type of society, post information revolution:) Nicely stated! I knew it would only take a little Star Wars mythology, dimensionality, and WebID to smoke you out :-) -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 20 June 2011 10:51, Kingsley Idehen kide...@openlinksw.com wrote: On 6/20/11 8:31 AM, Henry Story wrote: Perhaps it can become mythical. The URL should be by now:-) The URI :-) The mythical URI, perfect. -- http://danny.ayers.name
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 June 2011 20:42, Henry Story henry.st...@bblfish.net wrote: Ok. So you need to give each of your dogs and cats a webid enabled RDFID chip To inject a little reality: Sashapooch has got an embedded RFID (not yet RDFID!) tag, not sure but I think it became Italian law. Basilhound being a bit older, before this stuff came in, has a (sloppy) tattoo on his tummy, something like LOLU51. I assume the chip in Sasha has a similar string in it. But the idea is great - in the same way QR codes are most useful when they include a URL, putting one in the RFID tag of animals makes a lot of sense. Simple use case: when the critter wanders off, you can easily contact the owner. I know the RFID chips are now really cheap commodities, what I don't know about is the scanners - are they yet affordable enough that you could say include one in a mobile phone? Cheers, Danny. -- http://danny.ayers.name
Re: Fwd: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Point taken, I forget where I am sometimes, will try harder. My apologies. On 19 June 2011 21:06, Nathan nat...@webr3.org wrote: Danny Ayers wrote: I feel very guilty being in threads like this. Shit fuck smarter people than me. Just minor, and I can hardly talk as I swear most often in different settings, but I am a little surprised to see this language around here. I quite like having an arena where these words don't arise in the general conversation. Ack you know what I'm saying - nothing personal, but I'd personally appreciate not seeing them too frequently around here :) Best! -- http://danny.ayers.name
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/20/11 14:01 , Danny Ayers wrote: On 19 June 2011 20:42, Henry Storyhenry.st...@bblfish.net wrote: Ok. So you need to give each of your dogs and cats a webid enabled RDFID chip To inject a little reality: Sashapooch has got an embedded RFID (not yet RDFID!) tag, not sure but I think it became Italian law. Basilhound being a bit older, before this stuff came in, has a (sloppy) tattoo on his tummy, something like LOLU51. I assume the chip in Sasha has a similar string in it. But the idea is great - in the same way QR codes are most useful when they include a URL, putting one in the RFID tag of animals makes a lot of sense. Simple use case: when the critter wanders off, you can easily contact the owner. I know the RFID chips are now really cheap commodities, what I don't know about is the scanners - are they yet affordable enough that you could say include one in a mobile phone? Danny, At least one of my phones (the Nexus S) has one, though not very useful yet; It does react to Dutch public transport chip cards and my Finnish passport with Unknown tag type. Yrjänä Cheers, Danny. -- Mr. Yrjana Rankka| gh...@openlinksw.com Developer, Virtuoso Team | http://www.openlinksw.com | Making Technology Work For You
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Point taken Pat but I have been in the same ring as you for many years, but to progress the Web can't we just take our hands off the wheel, let it go where it wants. (Not that I have any influence, and realistically you neither Pat). I'm now just back from a sabbatical, but right now would probably be a good time to take one. If these big companies do engage on the microdata front, it's great. I'm sure it's been said before, why don't we get pornographers working hard on their metadata on visuals, because they work for Google/Bing whatever. The motivation right now might not be towards Tim's day one goals of sharing some stuff between departments at CERN, but that's irrelevant in the longer term. Getting the the Web as an infrastructure for data seems like a significant step in human evolution. And it's a no-brainer. But getting from where we are to there is tricky. Honestly, I don't care. It'll happen, my remaining lifespan or about 50 on top, there will be another, big, revolution. Society is already so different, just with little mobile phones. /gak I'm no going to speculate, we're heading for a major change. Cheers, Danny. On 19 June 2011 06:05, Pat Hayes pha...@ihmc.us wrote: Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. So far, http-range-14 is the only viable suggestion I have seen for how to do this. If anyone has a better one, let us discuss it. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. Pat On Jun 18, 2011, at 1:51 PM, Danny Ayers wrote: On 17 June 2011 02:46, David Booth da...@dbooth.org wrote: I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. Thanks David, a nice summary of the most important point IMHO. Ok, I've been trying to rationalize the case where there is a failure to make the distinction, but that's very much secondary to the fact that nothing really gets broken. Cheers, Danny. http://danny.ayers.name IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola (850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 06:05, Pat Hayes wrote: Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. The way to do this is to build applications where this thing matters. So for example in the social web we could build a slightly more evolved like protocol/ontology, which would be decentralised for one, but would also allow one to distinguish documents, from other parts of documents and things. So one could then say that one wishes to bring people's attention to a well written article on a rape, rather than having to like the rape. Or that one wishes to bring people's attention to the content of an article without having to like the style the article is written in. If such applications take hold, and there is a way the logic of using these applications is made to work where these distinctions become useful and visible to the end user, then there will be millions of vocal supporters of this distinction - which we know exists, which programmers know exists, which pretty much everyone knows exists, but which people new to the semweb web, like the early questioners of the viability of the mouse and the endless debates about that animal, will question because they can't feel in their bones the reality of this thing. So far, http-range-14 is the only viable suggestion I have seen for how to do this. Well hash uris are of course a lot easier to understand. http-range-14 is clearly a solution which is good to know about but that will have an adoption problem. If anyone has a better one, let us discuss it. I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. One could argue much more fruitfully on DocumentObject ontologies, and it would be interesting to see where that leads one. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. Well these are logical necessities you are speaking of. So it will come out in the wash. Just like 2+2=4, those who wish to ignore it will loose out in a number of transactions. So the fun thing is that we can find completely coherent ontologies that don't brake the semweb and that would allow Richard Cyganiak to write http://richard.cyganiak.de/ a foaf:Document; dofoaf:name Richard Cyganiak; dc:title: Richard Cyganiak's homepage; dofoaf:knows http://bblfish.net/ . It looks like here that the document has been confused with the object, but in fact the relations are designed so that they indirectly refer to something else. Now it is not clear that this is easier or less confusing to write than pure foaf. But it does make it look like what Danny wants to have is happening, namely that the document refers to the thing too - assuming a document only refers to one thing. But that is already the main problem. Even an image never refers to one thing only. Take a simple image of the eiffel tower: there can be cars in it, there can be birds, mice, rats (ratatouille), and many other creatures jumping around on people's heads. The higher the resolution the more things that picture can be said to refer to. So to know which is the
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 7:43 AM, Danny Ayers wrote: Point taken Pat but I have been in the same ring as you for many years, but to progress the Web can't we just take our hands off the wheel, let it go where it wants. (Not that I have any influence, and realistically you neither Pat). I'm now just back from a sabbatical, but right now would probably be a good time to take one. If these big companies do engage on the microdata front, it's great. I'm sure it's been said before, why don't we get pornographers working hard on their metadata on visuals, because they work for Google/Bing whatever. The motivation right now might not be towards Tim's day one goals of sharing some stuff between departments at CERN, but that's irrelevant in the longer term. Getting the the Web as an infrastructure for data seems like a significant step in human evolution. And it's a no-brainer. But getting from where we are to there is tricky. Honestly, I don't care. It'll happen, my remaining lifespan or about 50 on top, there will be another, big, revolution. Society is already so different, just with little mobile phones. /gak I'm no going to speculate, we're heading for a major change. Danny, Do you agree with HTTP-range-14 finding or not? My gripe with HTTP-range-14 is all about aesthetic matters re. language and anecdote choices, not the core concept it attempts to articulate. If you clearly state your gripe in similar terms there could be a chance of yourself and Pat actually realizing that you are in agreement. Personally, I've always assumed you clearly groked why Name and Address disambiguation is vital re. Web's data space dimension. I am suspecting that you are saying: we should find ways to co-exist with initiatives (e.g. schema.org) that haven't addressed these matters, just yet etc.. Note: many are grappling with how to construct viable business models from Linked Data, thus in some cases you will have services that look like they don't care about Name and Address disambiguation on the outside, courtesy of their publicly accessible resources, while in reality they understand these matters very well and have put them you use for a while. Remember, a URI doesn't have to be public :-) I think the debate will ultimately be more about getting these big players to share their more powerful URIs with the public via services and apps from communities like this that make the opportunity costs of these big players palpable :-) Kingsley Cheers, Danny. On 19 June 2011 06:05, Pat Hayespha...@ihmc.us wrote: Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. So far, http-range-14 is the only viable suggestion I have seen for how to do this. If anyone has a better one, let us discuss it. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. Pat On Jun 18, 2011, at 1:51 PM, Danny Ayers wrote: On 17 June 2011 02:46, David Boothda...@dbooth.org wrote: I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not*
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Hi Hugh, By the way, as is well-known I think, a lot of people use and therefore must be happy with URIs that are not Range-14 compliant, such as http://www.w3.org/2000/01/rdf-schema . Your general point that there is non-compliant data out there that people are still able to make use of is probably right, but that specific example is compliant - those are all (even the ontology URI) hash-URIs. Dave
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 13:05, Hugh Glaser wrote: A step too far? Hi. I've sort of been waiting for someone to say: I have a system that consumes RDF from the world out there (eg dbpedia), and it would break and be unfixable if the sources didn't do 303 or #. Plenty of people saying they can't express what they want without it. And plenty of people saying they can't write some code that they might not be able to understand some RDF they receive properly. But no actual examples in the wild (at least as far as I can tell in a lot of messages). This might be for quite a few reasons, such as: 1) There are no such consuming systems; 2) The existing consuming systems would not break. Number (1) would be too embarrassing, and is wrong because I have some, so I'll think about number (2). As you point out there are some consuming systems but they are not very distributed: you know ahead of time what you will find there, and so you can adapt your parsing for the few special cases. At that level the XML crowd/JSON crowd are right - rdf does not give you much. In fact it makes it easier to do things wrong. So we should be supporting more RESTful XML that can be GRDDLed with X-SPARQL. The semweb gives you a lot more when things get even more distributed, such as when everyone starts having foaf files on billions of computers. At that point nobody will want to tweak their app for the specific data at one site. Also one will want to be careful of the difference between documents and things, for the same reason I pointed out with the like button in Facebook. So for the moment the errors don't appear, because we are few consumers and few producers, and we can work around mistakes manually on a case by case basis. To get a real linked data application you need: 1- data that is produced in a completely decentralised way 2- data that is linked between those decentralised nodes 3- data that is consumed, and where the consumption has real world effects Number 3 is the recursive feedback piece that will make 1 and 2 come to a point of stability, or meta-stability, as we are dealing with self organising systems here. This can be done with the social web. We need systems where you publishing data means that I can do something, learn something about you, and so on... but without you ever knowing ahead of time what software or services we are using. (( The Twitters and other Web2.0 folks have made their life easy by centralising data publishing and consumption as much as possible. For systems like there is no real communication problem: there is a central dictator and he says what the meaning of the terms go. As things evolve that part even escapes him - the way office document formats escaped M$ - because of the huge number of people and software dependent on the initial meaning produced.)) If I write things out wrong, your software should be able to let me know about it. Just as if we organise to meet but we give each other the wrong address, we will end up missing the meeting. If this were not so then giving out addresses and organising meetings would be a very different exercise. There seem to be some axes in the discussion: publish / consume long/medium term / shorter term ideal / pragmatic Interestingly, we don't seem to have a strong theory / practice axis, which is great. yes, my point has been we need to work on small vocabularies, widely distributed, widely used, to kick start the rest of the system As a publisher, I/we have had to work pretty hard to conform to really quite complex requirements for publishing RDF as Linked Data; not just Range-14, but voiD, sitemaps and various bits and pieces that Kingsley always tells me to do in the RDF. As a consumer, it has been pretty simple: Well guv, thanks for the URI, here's some RDF. It has always been something of a source of angst (if not actual pain) to me that none of the extra work I put into publishing RDF is ever used by me or anyone else, as far as I know. In fact, some of the sites I consume actually don't do things properly - I might have had to change my consuming systems to cope with this, but I don't, because they already cope fine. Why is it not a problem? One obvious reason is that the consuming application is actually looking for specific knowledge about things. And as pointed out above they are not that distributed, and the consequences of things going wrong on a lot of the open data stack is not that big yet. I don't have a consuming system that is considering both lexical and animal subjects, and so confusion does not arise. Also you are probably not putting up reasoners yet. In fact, it is the predicates that tend to distinguish satisfactorily for me (as has been pointed out by some people). Thus, if I get a triple that says the URI that would resolve to my Facebook page foaf:knows the URI that would resolve to your Facebook page, I (my system)
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 12:05 PM, Hugh Glaser wrote: A step too far? Hi. I've sort of been waiting for someone to say: I have a system that consumes RDF from the world out there (eg dbpedia), and it would break and be unfixable if the sources didn't do 303 or #. Plenty of people saying they can't express what they want without it. And plenty of people saying they can't write some code that they might not be able to understand some RDF they receive properly. But no actual examples in the wild (at least as far as I can tell in a lot of messages). This might be for quite a few reasons, such as: 1) There are no such consuming systems; 2) The existing consuming systems would not break. Number (1) would be too embarrassing, and is wrong because I have some, so I'll think about number (2). There seem to be some axes in the discussion: publish / consume long/medium term / shorter term ideal / pragmatic Interestingly, we don't seem to have a strong theory / practice axis, which is great. As a publisher, I/we have had to work pretty hard to conform to really quite complex requirements for publishing RDF as Linked Data; not just Range-14, but voiD, sitemaps and various bits and pieces that Kingsley always tells me to do in the RDF. As a consumer, it has been pretty simple: Well guv, thanks for the URI, here's some RDF. It has always been something of a source of angst (if not actual pain) to me that none of the extra work I put into publishing RDF is ever used by me or anyone else, as far as I know. Er. we use it :-) The problem with this whole Linked Data thing is that its truly Ninja tech. The killer conductor of value is the LINK. This lethal weapon applies to all dimensions of the Web: 1. Information Space 2. Data Space 3. Knowledge Space. Trouble is, where do we find strong anecdotes for a cross dimensional lethal weapon? I try to use Stars Wars and the FORCE at times, but even that doesn't quite nail what we are dealing with here. Thus, we could take another approach i.e., embrace and extend what we know is anomalous since the AWWW architecture (FORCE) actually lets us do this anyway. In fact, some of the sites I consume actually don't do things properly - I might have had to change my consuming systems to cope with this, but I don't, because they already cope fine. Exactly! You are using the FORCE :-) Why is it not a problem? One obvious reason is that the consuming application is actually looking for specific knowledge about things. I don't have a consuming system that is considering both lexical and animal subjects, and so confusion does not arise. You have a Data Space dimension app. The Information Space dimension doesn't interfere with your world view. This is key in many ways. For instance, imagine if your app was of the Information Space dimension instead, the effect would be very close to what we see today re. those that see Name and Address disambiguation as impractical overkill since nothing breaks in the world they experience. In fact, it is the predicates that tend to distinguish satisfactorily for me (as has been pointed out by some people). Yep! The Data Space realm lets you Describe anything with clarity, and even when unclear, agents can ultimately agree to disagree without obliteration. Thus, if I get a triple that says the URI that would resolve to my Facebook page foaf:knows the URI that would resolve to your Facebook page, I (my system) will happily interpret that as one person (or whatever) foaf:knows the other. I certainly don't want to go and resolve these to find out to what the URIs actually resolve. And if I did, what would I do about it? Ignore it? As you would in code generally, encounter an exception, and decide if you avoid making it a critical fault :-) In fact, as has also been mentioned, you can define domains, ranges and restrictions for as long as you like, but it is quite possible and likely that the users of URIs will continue blissfully unaware of any of this, in exactly the same way that they continue unaware that there might be something ambiguous about the URIs they are using. Yes, when they operate in the Information Space dimension. By the way, as is well-known I think, a lot of people use and therefore must be happy with URIs that are not Range-14 compliant, such as http://www.w3.org/2000/01/rdf-schema . In the Information Space dimension, yes. In that dimension it doesn't matter. When we help people publish, it really is tough to engage them long enough to care about the complex issues, and they often get it wrong - I am engaged with quite a few people who are now publishing serious amounts of interesting RDF where I have contacted them to try to help. The status of the conversations is that they have fixed what they can, and are now thinking (for a long time) about how they might configure their systems to do it properly - but they may never get there. I will still want to use their
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 14:04, Kingsley Idehen wrote: Er. we use it :-) The problem with this whole Linked Data thing is that its truly Ninja tech. The killer conductor of value is the LINK. This lethal weapon applies to all dimensions of the Web: 1. Information Space 2. Data Space 3. Knowledge Space. Trouble is, where do we find strong anecdotes for a cross dimensional lethal weapon? I try to use Stars Wars and the FORCE at times, but even that doesn't quite nail what we are dealing with here. Thus, we could take another approach i.e., embrace and extend what we know is anomalous since the AWWW architecture (FORCE) actually lets us do this anyway. That's a fun way of describing things. But we have to be careful not to hype things too much, or we risk being tied into the 1980 AI hype space, and then nobody will listen anymore. Perhaps a more scientific way to express this is within the language of self-organising systems. There is a lot of research there which is relevant to us. http://en.wikipedia.org/wiki/Self_organising_systems I am a bit new to this area. Any books I must read? Henry Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Thanks Henry. Just to be clear on one point: On 19 Jun 2011, at 12:44, Henry Story wrote: snip / When we help people publish, it really is tough to engage them long enough to care about the complex issues, and they often get it wrong - I am engaged with quite a few people who are now publishing serious amounts of interesting RDF where I have contacted them to try to help. The status of the conversations is that they have fixed what they can, and are now thinking (for a long time) about how they might configure their systems to do it properly - but they may never get there. I will still want to use their RDF. yes, in these case by case scenarios it is easy for you to write special case filters. And we could do the same thing with HTML whenever we browse the web too. But the web had an application: the browser that lead to feedback effects that increased the coherence of the system. snip / But I don't write special case filters - if I did it would not consider it Semantic Web. I simply follow my nose to use the URI (or in fact usually via an owl:sameas in a sameas store), and they work. It all works because my code that consumes the retrieved RDF to build the data enrichment by inference (things like the communities of practice), and things like my fresnel lenses, restrict any ambiguity by looking for the predicates, etc. they care about. RDF can be a long way short of what we want it to be without having to treat it as special cases.
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 12 Jun 2011, at 14:40, Danny Ayers wrote: [snip] Aside from containing a different bunch of bits because of the encoding, sasha-photo.jpg could be a lossy-compressed version of sasha-photo.gif, containing less pixel information yet sharing many characteristics. All ok so far..? If so, from this we can determine that a representation of a resource need not be complete in terms of the information it contains to fulfill the RDF statement and the HTTP contract. A photo and a graph work in essentially the same way. They both set restrictions on possible worlds of which they are true. A photo restricts the number of possible worlds to those that are visually equivalent to the picture taken. A graph is true of all the possible worlds where those relations holds - which is usually infinitely large. In either case the meaning of a graph or document is a set of possible worlds. A set is an object - one can speak of it - but a very different kind of object from what you may think of as what appears in the picture. As such there is indeed a fundamental logical difference between a document and objects in the world. And that also explains why a photo is not clearly about one thing or another - though of course given that it is a restriction on the way things can be, it limits the things the document could be about. As stated in a previous mail, the same photo can be about the eiffel tower, a sunset, a beautiful view of Paris, a vacation experience, a friend that appears in the picture, a murder that was commited at that moment,... The photo remains the same in all those descriptions, and it can be tagged in all those ways, which is why it is good to have names for each of those things that are different from the photo. Each of those should have definite descriptions to help identify the referents from the description. Henry Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 13:04, Kingsley Idehen wrote: On 6/19/11 12:05 PM, Hugh Glaser wrote: A step too far? Hi. I've sort of been waiting for someone to say: I have a system that consumes RDF from the world out there (eg dbpedia), and it would break and be unfixable if the sources didn't do 303 or #. Plenty of people saying they can't express what they want without it. And plenty of people saying they can't write some code that they might not be able to understand some RDF they receive properly. But no actual examples in the wild (at least as far as I can tell in a lot of messages). This might be for quite a few reasons, such as: 1) There are no such consuming systems; 2) The existing consuming systems would not break. Number (1) would be too embarrassing, and is wrong because I have some, so I'll think about number (2). There seem to be some axes in the discussion: publish / consume long/medium term / shorter term ideal / pragmatic Interestingly, we don't seem to have a strong theory / practice axis, which is great. As a publisher, I/we have had to work pretty hard to conform to really quite complex requirements for publishing RDF as Linked Data; not just Range-14, but voiD, sitemaps and various bits and pieces that Kingsley always tells me to do in the RDF. As a consumer, it has been pretty simple: Well guv, thanks for the URI, here's some RDF. It has always been something of a source of angst (if not actual pain) to me that none of the extra work I put into publishing RDF is ever used by me or anyone else, as far as I know. Er. we use it :-) Er, I'm not sure you do :-) You certainly consume it, and a very nice job you do to. But the use is more than generic browsers - it suggest to me that something useful might happen as a result of the consumption (perhaps I learn that I can ask Jim to introduce me to Mary, as he knows her better than anyone else I know). These things are usually called applications, or possibly services. They tend to be reasonably domain-specific, as generic things tend not to be easy to sue, or even fit for purpose for end users. Sorry if I have missed stuff. The problem with this whole Linked Data thing is that its truly Ninja tech. The killer conductor of value is the LINK. This lethal weapon applies to all dimensions of the Web: 1. Information Space 2. Data Space 3. Knowledge Space. Trouble is, where do we find strong anecdotes for a cross dimensional lethal weapon? I try to use Stars Wars and the FORCE at times, but even that doesn't quite nail what we are dealing with here. Thus, we could take another approach i.e., embrace and extend what we know is anomalous since the AWWW architecture (FORCE) actually lets us do this anyway. In fact, some of the sites I consume actually don't do things properly - I might have had to change my consuming systems to cope with this, but I don't, because they already cope fine. Exactly! You are using the FORCE :-) Why is it not a problem? One obvious reason is that the consuming application is actually looking for specific knowledge about things. I don't have a consuming system that is considering both lexical and animal subjects, and so confusion does not arise. You have a Data Space dimension app. The Information Space dimension doesn't interfere with your world view. This is key in many ways. For instance, imagine if your app was of the Information Space dimension instead, the effect would be very close to what we see today re. those that see Name and Address disambiguation as impractical overkill since nothing breaks in the world they experience. In fact, it is the predicates that tend to distinguish satisfactorily for me (as has been pointed out by some people). Yep! The Data Space realm lets you Describe anything with clarity, and even when unclear, agents can ultimately agree to disagree without obliteration. Thus, if I get a triple that says the URI that would resolve to my Facebook page foaf:knows the URI that would resolve to your Facebook page, I (my system) will happily interpret that as one person (or whatever) foaf:knows the other. I certainly don't want to go and resolve these to find out to what the URIs actually resolve. And if I did, what would I do about it? Ignore it? As you would in code generally, encounter an exception, and decide if you avoid making it a critical fault :-) In fact, as has also been mentioned, you can define domains, ranges and restrictions for as long as you like, but it is quite possible and likely that the users of URIs will continue blissfully unaware of any of this, in exactly the same way that they continue unaware that there might be something ambiguous about the URIs they are using. Yes, when they operate in the Information Space dimension. By the way, as is well-known I think, a lot of people use and therefore must be happy with URIs
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some Could it be exactly the other way around? that documents and things described in it are easy to distinguis EXACTLY becouse one is about the other, no one can possibly mess them up/except for idiotic computer algorithms from the 70s that limits themselves to simbolic AI techniques. Otherwise you seem to say that its more difficult to distinguish between a dog and a bitch than it is to distinguish between a dog and a stream of bytes in return to an HTTP request, and that seems a bit funny? look if someone points me at a facebook URL i know its about a person and not about the damn page (which has 2000 ways to change every time that url is resolved anyway. certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. i mean we can go on and tell oursellf we cant possibly write applications that know or understand what facebook URL is about. but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) Gio
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Nathan wrote: Henry Story wrote: On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Sorry, I'm missing something - from what I can see, each document has a number of items, potentially in a hierarchy, and each item is either anonymous, or has an @itemid. Where's the confusion between Document and Primary Subject? Or do you mean from the Schema.org side, where each Type and Property has a dereferencable URI, which currently happens to also eb used for the document describing the Type/Property?
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Absolutely, Pat. Well said. This is really important. Can we please stop the madness of confusing things with documents about them and do what we want to do cleanly and in an efficient way. Tim On 2011-06 -19, at 00:05, Pat Hayes wrote: Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. So far, http-range-14 is the only viable suggestion I have seen for how to do this. If anyone has a better one, let us discuss it. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. Pat
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Henry Gio Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 18:58, Nathan wrote: Nathan wrote: Henry Story wrote: On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Sorry, I'm missing something - from what I can see, each document has a number of items, potentially in a hierarchy, and each item is either anonymous, or has an @itemid. Where's the confusion between Document and Primary Subject? Or do you mean from the Schema.org side, where each Type and Property has a dereferencable URI, which currently happens to also eb used for the document describing the Type/Property? Well I can't really tell because I don't know what the semantics of those annotations are, or how they function. Without those it is difficult to tell if they have made a mistake. If there is no way of translating what they are doing into a system that does not make the confusion, then one could explore what the cost of that will be to them. If the confusion is strong then there will be limitations in what they can express that way. It will then be a matter of working out what those limitations are and then offering services that allow one to go further than what they are proposing. At the very least the good thing is that they are not bringing the confusion into the RDF space, since they are using their own syntax and ontologies. There may also be an higher way to fix this so that they could return a 20x (x-some new number) which points to the document URL (but returns the representation immediately, a kind of efficient HTTP-range-14 version) So there are a lot of options. Currently their objects are tied to an html document. What are the json crowd going to think? In any case there is a problem of translation that has to be dealt with first. Henry Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 June 2011 12:37, Henry Story henry.st...@bblfish.net wrote: [snip pat] The way to do this is to build applications where this thing matters. So for example in the social web we could build a slightly more evolved like protocol/ontology, which would be decentralised for one, but would also allow one to distinguish documents, from other parts of documents and things. So one could then say that one wishes to bring people's attention to a well written article on a rape, rather than having to like the rape. Or that one wishes to bring people's attention to the content of an article without having to like the style the article is written in. I would have come down on you like a ton of bricks for that Henry, if it wasn't for seeing to-and-fro on Facebook about some Nazi-inspired club (Slimelight, for the record). On FB there is no way to express your sentiments. Like/blow to smithereens. If such applications take hold, and there is a way the logic of using these applications is made to work where these distinctions become useful and visible to the end user, then there will be millions of vocal supporters of this distinction - which we know exists, which programmers know exists, which pretty much everyone knows exists, but which people new to the semweb web, like the early questioners of the viability of the mouse and the endless debates about that animal, will question because they can't feel in their bones the reality of this thing. So far, http-range-14 is the only viable suggestion I have seen for how to do this. Well hash uris are of course a lot easier to understand. http-range-14 is clearly a solution which is good to know about but that will have an adoption problem. I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. I confess to talking bollocks when I should be coding. Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 Jun 2011, at 19:44, Danny Ayers wrote: I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. I confess to talking bollocks when I should be coding. yeah, me too. Though now you folks managed to get me interested in this problem! (sigh) Henry Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
I thought forever that if we see iniquities we are duty-bound to stand in the way. But that don't seem to change anything. Let the crap rain forth, if you really need to make sense of it the blokes on this list will do it. Activity is GOOD, no matter how idiotic. Decisions made on very different premises than anyone around here would promote. Sorry, I'm of the opinion that the Web approach is the winner. Alas it also seems lowest common denominator. Cheers, Danny. On 19 June 2011 19:36, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 18:58, Nathan wrote: Nathan wrote: Henry Story wrote: On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Sorry, I'm missing something - from what I can see, each document has a number of items, potentially in a hierarchy, and each item is either anonymous, or has an @itemid. Where's the confusion between Document and Primary Subject? Or do you mean from the Schema.org side, where each Type and Property has a dereferencable URI, which currently happens to also eb used for the document describing the Type/Property? Well I can't really tell because I don't know what the semantics of those annotations are, or how they function. Without those it is difficult to tell if they have made a mistake. If there is no way of translating what they are doing into a system that does not make the confusion, then one could explore what the cost of that will be to them. If the confusion is strong then there will be limitations in what they can express that way. It will then be a matter of working out what those limitations are and then offering services that allow one to go further than what they are proposing. At the very least the good thing is that they are not bringing the confusion into the RDF space, since they are using their own syntax and ontologies. There may also be an higher way to fix this so that they could return a 20x (x-some new number) which points to the document URL (but returns the representation immediately, a kind of efficient HTTP-range-14 version) So there are a lot of options. Currently their objects are tied to an html document. What are the json crowd going to think? In any case there is a problem of translation that has to be dealt with first. Henry Social Web Architect http://bblfish.net/ -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Only personal Henry, but have you tried the Myers-Briggs thing - I think you used to be classic INTP/INTF - but once you got WebID in your sails it's very different. These things don't really allow for change. Only slightly off-topic, very relevant here, need to pin down WebID in a sense my dogs can understand. The Myers-Briggs thing is intuitively rubbish. But with only one or two posts in the ground, it does seem you can extrapolate. On 19 June 2011 19:52, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 19:44, Danny Ayers wrote: I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. I confess to talking bollocks when I should be coding. yeah, me too. Though now you folks managed to get me interested in this problem! (sigh) Henry Social Web Architect http://bblfish.net/ -- http://danny.ayers.name
Re: Fwd: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Danny Ayers wrote: I feel very guilty being in threads like this. Shit fuck smarter people than me. Just minor, and I can hardly talk as I swear most often in different settings, but I am a little surprised to see this language around here. I quite like having an arena where these words don't arise in the general conversation. Ack you know what I'm saying - nothing personal, but I'd personally appreciate not seeing them too frequently around here :) Best!
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 1:39 PM, Henry Story wrote: On 19 Jun 2011, at 14:04, Kingsley Idehen wrote: Er. we use it :-) The problem with this whole Linked Data thing is that its truly Ninja tech. The killer conductor of value is the LINK. This lethal weapon applies to all dimensions of the Web: 1. Information Space 2. Data Space 3. Knowledge Space. Trouble is, where do we find strong anecdotes for a cross dimensional lethal weapon? I try to use Stars Wars and the FORCE at times, but even that doesn't quite nail what we are dealing with here. Thus, we could take another approach i.e., embrace and extend what we know is anomalous since the AWWW architecture (FORCE) actually lets us do this anyway. That's a fun way of describing things. Fun is one mechanism for stimulating attention an route to unveiling new insights and innovations :-) But we have to be careful not to hype things too much, or we risk being tied into the 1980 AI hype space, and then nobody will listen anymore. I certainly don't have that in mind. The only tweak I would make is: s/Ninja/Jedi, since Star Wars and its underlying mythology remains a great source of anecdotal material to me when I try to explain what's happening across the WWW's many dimensions. Perhaps a more scientific way to express this is within the language of self-organising systems. Every audience might not be scientifically inclined, at least not in an obvious way. Thus, as you can see, there isn't one way. We have to find and accommodate a plethora of narratives and associated anecdotes. There is a lot of research there which is relevant to us. http://en.wikipedia.org/wiki/Self_organising_systems Nice find, that is certainly representative of what's happening. Kingsley I am a bit new to this area. Any books I must read? Henry Social Web Architect http://bblfish.net/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 2:26 PM, Hugh Glaser wrote: On 19 Jun 2011, at 13:04, Kingsley Idehen wrote: On 6/19/11 12:05 PM, Hugh Glaser wrote: A step too far? Hi. I've sort of been waiting for someone to say: I have a system that consumes RDF from the world out there (eg dbpedia), and it would break and be unfixable if the sources didn't do 303 or #. Plenty of people saying they can't express what they want without it. And plenty of people saying they can't write some code that they might not be able to understand some RDF they receive properly. But no actual examples in the wild (at least as far as I can tell in a lot of messages). This might be for quite a few reasons, such as: 1) There are no such consuming systems; 2) The existing consuming systems would not break. Number (1) would be too embarrassing, and is wrong because I have some, so I'll think about number (2). There seem to be some axes in the discussion: publish / consume long/medium term / shorter term ideal / pragmatic Interestingly, we don't seem to have a strong theory / practice axis, which is great. As a publisher, I/we have had to work pretty hard to conform to really quite complex requirements for publishing RDF as Linked Data; not just Range-14, but voiD, sitemaps and various bits and pieces that Kingsley always tells me to do in the RDF. As a consumer, it has been pretty simple: Well guv, thanks for the URI, here's some RDF. It has always been something of a source of angst (if not actual pain) to me that none of the extra work I put into publishing RDF is ever used by me or anyone else, as far as I know. Er. we use it :-) Er, I'm not sure you do :-) You certainly consume it, and a very nice job you do to. But the use is more than generic browsers - it suggest to me that something useful might happen as a result of the consumption (perhaps I learn that I can ask Jim to introduce me to Mary, as he knows her better than anyone else I know). Yes, and this is coming. Basically, as part of the WebID (powerful Linked Data and FOAF exploitation) Henry, I, and others are working on use of our respective efforts for semantically enhanced friending. We not only handle friending we also handle notifications such that from a single blog post, address book entry, calendar item creation of change etc., notices get progagated, but all of this is driven by a WebID (a personal URI). In addition to all of this, we have WebID based ACLs for powerful resource sharing etc.. We (at OpenLink) have even extended S/MIME with WebID which makes a world of difference re. helping folks regain control of their in-boxes and basically fixing email. These things are usually called applications, or possibly services. Yes, that's the key to the matter. Make apps that make a difference via the standards we promote. Put differently, promote our beliefs via apps that illuminate standards that be believe in and promote. They tend to be reasonably domain-specific, as generic things tend not to be easy to sue, or even fit for purpose for end users. Sorry if I have missed stuff. Address Books, Calendars, Blogs, Discussion Forums, Comments, Pingbacks, In-boxes Drop-boxes, Photo Albums, and Galleries etc.. all benefit immensely from Linked Data, we just need more applications as a few have existed in isolation for a while :-) Re. apps., one of the real problems with Linked Data is that the LINK is the key too everything. That said, when dealing with Apps., most think about UI first, and that's where matters can get confusing real fast i.e., some attempts at visualization utterly compromise Linked Data's essence. Likewise, slapping UI on to Linked Data with illuminating its essence in mind also introduces its own set of problems. [SNIP] -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 5:56 PM, Nathan wrote: Henry Story wrote: On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Sorry, I'm missing something - from what I can see, each document has a number of items, potentially in a hierarchy, and each item is either anonymous, or has an @itemid. Where's the confusion between Document and Primary Subject? Put differently, are they conflating things i.e., leaving the beholder to make the distinction outside AWWW. Yes, they are, but purely because this effort is Information Space dimension based :-) Time for a video [1]. Links: 1. http://www.youtube.com/watch?v=JkxieS-6WuA -- imaging the 10th dimension -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/19/11 6:36 PM, Henry Story wrote: On 19 Jun 2011, at 18:58, Nathan wrote: Nathan wrote: Henry Story wrote: On 19 Jun 2011, at 18:27, Giovanni Tummarello wrote: but dont be surprised as less and less people will be willing to listen as more and more applications (Eg.. all the stuff based on schema.org) pop up never knowing there was this problem... (not in general. of course there is in general, but for their specific use cases) The question is if schema.org makes the confusion, or if the schemas published there use a DocumentObject ontology where the distinctions are clear but the rule is that object relationships are in fact going via the primary topic of the document. I have not looked at the schema, but it seems that before arguing that they are inconsistent one should see if there is not a consistent interpretation of what they are doing. Sorry, I'm missing something - from what I can see, each document has a number of items, potentially in a hierarchy, and each item is either anonymous, or has an @itemid. Where's the confusion between Document and Primary Subject? Or do you mean from the Schema.org side, where each Type and Property has a dereferencable URI, which currently happens to also eb used for the document describing the Type/Property? Well I can't really tell because I don't know what the semantics of those annotations are, or how they function. Without those it is difficult to tell if they have made a mistake. If there is no way of translating what they are doing into a system that does not make the confusion, then one could explore what the cost of that will be to them. If the confusion is strong then there will be limitations in what they can express that way. It will then be a matter of working out what those limitations are and then offering services that allow one to go further than what they are proposing. At the very least the good thing is that they are not bringing the confusion into the RDF space, since they are using their own syntax and ontologies. There may also be an higher way to fix this so that they could return a 20x (x-some new number) which points to the document URL (but returns the representation immediately, a kind of efficient HTTP-range-14 version) So there are a lot of options. Currently their objects are tied to an html document. What are the json crowd going to think? Microdata as espoused by schema.org, via actual Microdata spec, includes a rules for making JSON representations. Irrespective, the conflation of entity Name and representation Address ultimately remains. But again, in the Information Space realm these ambiguities are the norm. Thus, it ultimately boils down to bridge vocabularies and ontologies to solve this problem re. Data Space dimension exploitation. Personally, I just don't loose sleep over schema.org, its a great contribution that ultimately simplifies comprehension of the Data Space dimension. Remember, we humans don't do well with prevention, we prefer cure (via pills ideally) that are immediately available once calamities manifest :-) In any case there is a problem of translation that has to be dealt with first. Yes-ish. Kingsley Henry Social Web Architect http://bblfish.net/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Self-star Systems (was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...] )
+1 to Netlogo! Regards, Dave On Jun 19, 2011, at 18:52, John Erickson wrote: Henry Story asked... Perhaps a more scientific way to express this is within the language of self-organising systems. There is a lot of research there which is relevant to us. http://en.wikipedia.org/wiki/Self_organising_systems I am a bit new to this area. Any books I must read? I responded to Henry personally with the following, which Henry suggested I send to the list... snip Caution: The study of self-organizing systems will keep you up all night with its coolness! ;) You asked for some book recommendations; these are a few on my shelf. I think the only required reading is Out of Control, which will blow you mind, and the rest will just complement that ;) 1. Kevin Kelly, Out of Control 2. Mikhail Prokopenko (ed), Advances in Applied Self-Organizing Systems 3. Ozalp Babaoglu, et.al., Self-star Properties in Complex Information Systems 4. Yaneer Bar-Yam, Dynamics of Complex Systems 5. Martin A. Nowak, Evolutionary Dynamics: Exploring the Equations of Life An example chapter from Nowak: Evolutionary Graph Theory ;) BTW, if you haven't already, install NetLego http://ccl.northwestern.edu/netlogo/ immediately. Serious work going on there, but very accessible! /snip -- John S. Erickson, Ph.D. http://bitwacker.com olyerick...@gmail.com Twitter: @olyerickson Skype: @olyerickson
Re: WebID and pets -- was: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 19 June 2011 20:42, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 20:15, Danny Ayers wrote: Only personal Henry, but have you tried the Myers-Briggs thing - I think you used to be classic INTP/INTF - but once you got WebID in your sails it's very different. These things don't really allow for change. Is there a page where I can find this out in one click? Looks like those pages ask all kinds of questions that require detailed and complicated answers. I am surprised anyone ever answers those things. It's certainly more complex than the Object/Document distinction ;-) Myers Briggs is based on the Jungian analysis of mythology and personality types, with a few additions. Myths being public dreams, and dreams being private myths. The personality types are the lens from which we interpret the inner and outer universal symbols. e.g. Intuitively / Analytically / Senses / Feeling. But the symbols themselves are often the more fascinating parts. An interesting parallel here is the relation to Jung's archetypes of the unconscious and WebID. Both in your dreams, and in mythology, you have symbols where are metaphors that reference some universal concept. WebID is of course a reference to the self ( foaf : Person ). As many of the myths we live with today are 100s of years out of date, and people are searching for something new, perhaps WebID can become a modern symbol, to determine or even evangelize the new personality type of society, post information revolution :) Only slightly off-topic, very relevant here, need to pin down WebID in a sense my dogs can understand. Ok. So you need to give each of your dogs and cats a webid enabled RDFID chip that can publish webids to other animals with similarly equipped chips when they sniff them. From the frequence and length of sniffs you can work out the quality of the relationships. On coming home for food, this data could be uploaded automatically to your web server to their foaf file. These relationships could then be used to allow their pals access to parts of your house. For example good friends of your dog, could get a free meal once a week. You could also use that to tie up friendship with their owners, by the master-of-pet relationships, and give them special ability to tag their pet photos. Masters of my dogs friends could be potential friends. If you get these pieces working right you could set up a business with a strong viral potential, perhaps the strongest on the net. Here to make my point: The Myers-Briggs thing is intuitively rubbish. But with only one or two posts in the ground, it does seem you can extrapolate. On 19 June 2011 19:52, Henry Story henry.st...@bblfish.net wrote: On 19 Jun 2011, at 19:44, Danny Ayers wrote: I am of the view that this has been discussed to death, and that any mailing list that discusses this is short of real things to do. I confess to talking bollocks when I should be coding. yeah, me too. Though now you folks managed to get me interested in this problem! (sigh) Henry Social Web Architect http://bblfish.net/ -- http://danny.ayers.name Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 16 June 2011 22:39, Pat Hayes pha...@ihmc.us wrote: Not only do I not follow your reasoning, I don't even know what it is you are saying. The document is a valid *representation* of the car, yes of course. That's all that's necessary to square this circle. But as valid as the car itself? So you think a car is a representation of itself? Or are you drawing a contrast between the 'named car resource' and the car itself? ??? All HTTP delivers is representations of named resources. (I very much do think a car is a representation of itself in HTTP terms, in the same way a document is, but it isn't necessary here). Maybe it would be best if we just dropped this now. I gather that you were offering me a way to make semantic sense of something, but Im not getting any sense at all out of this discussion, I am afraid. I'll be delighted to drop it, I thought we were getting stuck in a tar pit but your statement above is the er, oil, that gets us out. Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 17 June 2011 02:46, David Booth da...@dbooth.org wrote: I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. Thanks David, a nice summary of the most important point IMHO. Ok, I've been trying to rationalize the case where there is a failure to make the distinction, but that's very much secondary to the fact that nothing really gets broken. Cheers, Danny. http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Really (sorry to keep raining on the parade, but) it is not as simple as this. Look, it is indeed easy to not bother distinguishing male from female dogs. One simply talks of dogs without mentioning gender, and there is a lot that can be said about dogs without getting into that second topic. But confusing web pages, or documents more generally, with the things the documents are about, now that does matter a lot more, simply because it is virtually impossible to say *anything* about documents-or-things without immediately being clear which of them - documents or things - one is talking about. And there is a good reason why this particular confusion is so destructive. Unlike the dogs-vs-bitches case, the difference between the document and its topic, the thing, is that one is ABOUT the other. This is not simply a matter of ignoring some potentially relevant information (the gender of the dog) because one is temporarily not concerned with it: it is two different ways of using the very names that are the fabric of the descriptive representations themselves. It confuses language with language use, confuses language with meta-language. It is like saying giraffe has seven letters rather than giraffe has seven letters. Maybe this does not break Web architecture, but it certainly breaks **semantic** architecture. It completely destroys any semantic coherence we might, in some perhaps impossibly optimistic vision of the future, manage to create within the semantic web. So yes indeed, the Web will go on happily confusing things with documents, partly because the Web really has no actual contact with things at all: it is entirely constructed from documents (in a wide sense). But the SEMANTIC Web will wither and die, or perhaps be still-born, if it cannot find some way to keep use and mention separate and coherent. So far, http-range-14 is the only viable suggestion I have seen for how to do this. If anyone has a better one, let us discuss it. But just blandly assuming that it will all come out in the wash is a bad idea. It won't. Pat On Jun 18, 2011, at 1:51 PM, Danny Ayers wrote: On 17 June 2011 02:46, David Booth da...@dbooth.org wrote: I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. Thanks David, a nice summary of the most important point IMHO. Ok, I've been trying to rationalize the case where there is a failure to make the distinction, but that's very much secondary to the fact that nothing really gets broken. Cheers, Danny. http://danny.ayers.name IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 17/06/11 01:46, David Booth wrote: I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. We've been encouraging people to do so. Most do not have the time to invest in complexity that they percieve no benefit from adding. We need to reward people for good semantics by making sure there's tools and apps which add value for their business and activities. -- Christopher Gutteridge -- http://id.ecs.soton.ac.uk/person/1248 / Lead Developer, EPrints Project, http://eprints.org/ / Web Projects Manager, ECS, University of Southampton, http://www.ecs.soton.ac.uk/ / Webmaster, Web Science Trust, http://www.webscience.org/
Re: Squaring the HTTP-range-14 circle
On Thu, 2011-06-16 at 21:22 -0400, Tim Berners-Lee wrote: On 2011-06 -16, at 16:41, Ian Davis wrote: The problem here is that there are so few things that people want to say about web pages compared with the multitude of things they want to say about every other type of thing in existence. Well, that is a wonderful new thing. For a long while it was difficult to put data on the web, while there is quite a lot of metadata. Wonderful idea that the semantic web may be beating the document web hands down but that's not totally clear that we should trash the use of URIs for use to refer to documents as do in the document web. I'm sure Ian wasn't claiming the data web is beating the document web and equally sure that you don't really think he was :) FWIW my experience is also that most of the data that people want to publish *in RDF* is about things rather than web pages. Clearly there *are* good use cases for capturing web page metadata in RDF but I've not seen that many in-the-wild cases where people wanted to publish data about *both* the web page and the thing. That's why Ian's Back to Basics suggestion works for me [as a fall back from just use #]. My interpretation is that, unlike most of this thread, it wasn't saying use URIs ambiguously but saying the interpretation of the URI is up to the publisher and is discovered from the data not from the protocol response, it is legitimate to use a http-no-# URI to denote a thing if that is what you really want to do. Thus if I want to publish a table of e.g. population statistics at http://foobar.gov.uk/datasets/population then I can do so and use that URI within the RDF data as denoting the data set. As publisher I'm saying this is a qb:DataSet not a web page, anything that looks like a web page when you point a browser at it is just a rendering related to that data and that rendering isn't being given a separate URI so you can talk about it, sorry about that. If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Dave
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/17/11 1:46 AM, David Booth wrote: I agree with TimBL that it is*good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does*not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. Instead of *break* what about compromising or undermining flexibility implicit in AWWW? This is tantamount to obscuring the WWW potential relative to its broad user constituency. Re. schema.org, I don't regard their effort as breaking, compromising, or undermining AWWW. I simply believe they are taking baby steps that are 100% defined by their current business models. Rightly or wrongly so, they have to protect their business models. In a sense, the same applies to academia and its model where grant funding is vital to research projects. What is dangerous though, is encouraging people to misuse and misunderstand AWWW. Names and Addresses are distinct items. AWWW essence depends on preserving this vital distinction. When there are more applications (+1 to Henry's comment about focusing on Linked Data apps and viral patterns) this lower level matter will vapourize. Although not present (I am too young) I am certain similar arguments arose during the early days of silicon based computing between OS developers and programming language developers. I certainly know these conversations did arise when Spreadsheets vendors tackled Cell Reference functionality. There are many useful cases in plain sight that many overlook re. power of URIs as data conductors, integrators, and access mechanisms. I think (based on my experience with this community and industry at large) that there is too much focus on reinventing too many parts of the consumption stack, from scratch. The key is to be useful but introduce usefulness unobtrusively if you really seek uptake. Naturally, this requires understanding of what already exists (i.e., domain and subject matter knowledge) and functionality areas addressed by existing solutions. Sorry, but if all you do is program, you cannot really understand the reality of end-users. I like to make reference to Apple as a great anecdote because they've risen from near demise to the vanguard of modern computing by exploiting the InterWeb from the inside out, they don't see the Web as simply being about HTML. They understand that its a linked information space and future data space. They utilize this insight internally in a manner that just manifests as being useful to its ever growing customer base. Remember, there's a lot of old NeXTStep still underlying what Apple does. Also remember, the WWW was built on an NeXT machine with a lot of inspiration from how its innards worked. Believe it or not, we are still playing catch up (circa. 20011) with NeXTStep and Unix in general re. really smart and useful Linked Data apps :-) Embrace history and the future gets clearer and much more exciting. We have an unbelievable opportunity within grasp. We can embrace and extend (in a good way) what we may perceive as imperfections by others (e.g. schema.org). As Pat stated in an earlier post, these imperfections present opportunities that might even span decades before the behemoths out there hit their respective opportunity cost thresholds. Once said thresholds are hit they will respond accordingly via product fixes and/or enterprise acquisitions etc.. Contrary to popular belief, I will state once again that HTTP 303 is the poster child for ingenuity inherent in the HTTP protocol and the AWWW. Yes, we could also up the semantic smarts on clients and let a retrieved resource disambiguate Names and Addresses, but that only adds a burden to a target audience that's already challenged re: 1. recognizing linked data structures via directed graphs 2. recognizing that linked data structures have always been about links and that HTTP URIs are a powerful vehicle for expanding this concept to InterWeb scales 3. recognizing that de-reference (indirection) and address-of operations are achievable via URIs and cost-effectively so via HTTP URIs due to WWW ubiquity 4. understanding that RDF is *an option* for linked data structures at InterWeb scales, you can use other syntaxes without losing access to really useful stuff like RDFS and OWL semantics (which also suffers from over emphasis on RDF at expense of core syntax agnostic concepts). Links: 1. http://en.wikipedia.org/wiki/Spreadsheet#Cells 2. http://en.wikipedia.org/wiki/Spreadsheet#Named_cells . -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On 6/17/11 12:35 PM, Dave Reynolds wrote: If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. Let's look at this from a slight different angle. What does HTTP 200 OK mean? I believe it's how a server indicates to a client that an Address (it created) is functional . I believe Tim is saying: HTTP 200 OK is integral to the Web in a general sense. This is behavior backed into AWWW that underlies the ubiquitous WWW albeit the information space dimension re., Linked Documents. An HTML resource is still a resource, and 200 OK doesn't care about the resource type. As I stated in an earlier post, handling indirection on the server (this is basically what we did in our very first Linked Data server implementation, pre. Banff 2007) puts a burden on the clients i.e., it really sets an expectation that the client is willing and capable of doing Name and Address disambiguation by analyzing the data returned. Now, if an application commits 100% to self-describing data expressed in graph form, serialized in a variety of representations, that would work, but in reality this is actually worse than what we are grappling with right now re. paths of least resistance en route to broadening and accelerating Linked Data uptake. Thus, like all things, its at best just another option with some consequences that could ultimately compromise the big picture goal. -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On Fri, Jun 17, 2011 at 12:35 PM, Dave Reynolds dave.e.reyno...@gmail.com wrote: On Thu, 2011-06-16 at 21:22 -0400, Tim Berners-Lee wrote: On 2011-06 -16, at 16:41, Ian Davis wrote: The problem here is that there are so few things that people want to say about web pages compared with the multitude of things they want to say about every other type of thing in existence. Well, that is a wonderful new thing. For a long while it was difficult to put data on the web, while there is quite a lot of metadata. Wonderful idea that the semantic web may be beating the document web hands down but that's not totally clear that we should trash the use of URIs for use to refer to documents as do in the document web. I'm sure Ian wasn't claiming the data web is beating the document web and equally sure that you don't really think he was :) Yes, absolutely. FWIW my experience is also that most of the data that people want to publish *in RDF* is about things rather than web pages. Clearly there *are* good use cases for capturing web page metadata in RDF but I've not seen that many in-the-wild cases where people wanted to publish data about *both* the web page and the thing. That's why Ian's Back to Basics suggestion works for me [as a fall back from just use #]. My interpretation is that, unlike most of this thread, it wasn't saying use URIs ambiguously but saying the interpretation of the URI is up to the publisher and is discovered from the data not from the protocol response, it is legitimate to use a http-no-# URI to denote a thing if that is what you really want to do. Yes, that's exactly what I am saying. Thus if I want to publish a table of e.g. population statistics at http://foobar.gov.uk/datasets/population then I can do so and use that URI within the RDF data as denoting the data set. As publisher I'm saying this is a qb:DataSet not a web page, anything that looks like a web page when you point a browser at it is just a rendering related to that data and that rendering isn't being given a separate URI so you can talk about it, sorry about that. If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. Ian
Re: Squaring the HTTP-range-14 circle
On 2011-06 -17, at 08:51, Ian Davis wrote: If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. I am really not sure that I want to give up the ability in my browser to bookmark a page about something -- the IMDB page a about a movie, rather than the movie itself. When the cost os just fixing Microdata syntax to make it easy to say things about the subject of a page. Tim
Re: Squaring the HTTP-range-14 circle
On Fri, Jun 17, 2011 at 2:04 PM, Tim Berners-Lee ti...@w3.org wrote: On 2011-06 -17, at 08:51, Ian Davis wrote: If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. Sure. All these use cases stand and can co-exist. I can look at the data in any of those responses, or data I glean from elsewhere, to figure out if the URI I'm accessing refers to the content I received or the subject of that content. That model works for any protocol BTW. I am really not sure that I want to give up the ability in my browser to bookmark a page about something -- the IMDB page a about a movie, rather than the movie itself. OK, we differ here then. I would prefer to bookmark the movie because that's what I'm interested in. The page will change over the years but the movie will still persist. Today you have no choice because your conceptual model does not give a URI to the movie and doesn't see the need to generate 2 URIs. When the cost os just fixing Microdata syntax to make it easy to say things about the subject of a page. i don't think this has anything to do with microdata. Ian
Re: Squaring the HTTP-range-14 circle
On 17 Jun 2011, at 15:04, Tim Berners-Lee wrote: AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. Indeed I have had a few people on Facebook comment that they were very unhappy not being able to distinguish between what the object of a like is. Such as when one likes a page about the death of a friend, or about some child tortured in some distant county. Of course FB was right to start with such a simple relation. Just as the web started with the a href=../a link. One starts with the simplest relations that make no clear distinction between what is liked and then following the pressure from the community, and business opportunities, one adds distinctions in the order of which is the most profitable to add next. I am sure Facebook is very greatful to academia for having shown that it will find it no trouble to move to liking objects and being able to distinguish those from web pages. But their job is to build tools that generate huge markets in order to build profit, so they will only increase the subtlety of their distinctions as their business cases require them. We can build ontologies that follow a similar path, starting from ontologies that don't require someone to distinguish between pages and things named by them. It will be interesting to work out how far one can go with that and at what point it breaks down conceptually. So with the like button, it does not allow one to distinguish the liking of an article or the death of a friend. But the procedural value of like - easy notification system - was big enough to build out a huge market - within the conceptual limitations of the relation. In any case it does not seem that this has anything to do with architectural limitations of the Web, since it is easy I think even in RDF to do both. Henry Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle
On Fri, Jun 17, 2011 at 2:04 PM, Tim Berners-Lee ti...@w3.org wrote: Not quite. It is saying that you can't give a review for my http://foobar.gov.uk/datasets/population web page because the RDF returned by the URI says it denotes a dataset not the web page. You can still review the dataset itself. You can review other web pages which don't return RDF data saying they are something other than a web page. [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. As an additional point, a review _is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. Ian
Re: Squaring the HTTP-range-14 circle
Hi, On 17 June 2011 14:04, Tim Berners-Lee ti...@w3.org wrote: On 2011-06 -17, at 08:51, Ian Davis wrote: ... Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. Well even that's debatable. I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. And that's presumably the way that Facebook and Amazon see it too because that data is associated with the status or review in whichever medium I look at it (page or app). Cheers, L. -- Leigh Dodds Programme Manager, Talis Platform Mobile: 07850 928381 http://kasabi.com http://talis.com Talis Systems Ltd 43 Temple Row Birmingham B2 5LS
Re: Squaring the HTTP-range-14 circle
On 6/17/11 2:18 PM, Ian Davis wrote: I am really not sure that I want to give up the ability in my browser to bookmark a page about something -- the IMDB page a about a movie, rather than the movie itself. OK, we differ here then. I would prefer to bookmark the movie because that's what I'm interested in. Yes, and you have that right as an individual using the Web. Same applies to those that want to bookmark a Page about the Movie. Thus, the eternal challenge remains: how does a system inherently cater for natural variations inherent in individuals. This is where AWWW scores big time re. Web as a Global Space for Information and Data :-) The page will change over the years but the movie will still persist. Yes, and I may be interested in understanding the evolution of the page over the years. The Page is as valid a Data Object as its Subject Matter. This is the crux of the matter. The system has to handle our individuality, as per earlier comment. Today you have no choice because your conceptual model does not give a URI to the movie and doesn't see the need to generate 2 URIs. Today, we don't have the options in question because a majority of Web users are still only utilizing its Information Space dimension. -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
Hi, On 6/17/2011 4:11 PM, Leigh Dodds wrote: Hi, On 17 June 2011 14:04, Tim Berners-Leeti...@w3.org wrote: On 2011-06 -17, at 08:51, Ian Davis wrote: ... Quite. When a facebook user clicks the Like button on an IMDB page they are expressing an opinion about the movie, not the page. BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. Well even that's debatable. I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. Well, that is obviously the level where the (abstract) information resource is located (can be located), or? ;) Cheers, Bob PS: cf., e.g., http://odontomachus.wordpress.com/2011/02/13/frbr-and-the-web/ ;)
Re: Squaring the HTTP-range-14 circle
On 6/17/11 2:55 PM, Ian Davis wrote: BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. As an additional point, a review_is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that a sourced from Data at an Address that's streamed to a client that uses a specific data presentation metaphor as basis for user comprehension? Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On 6/17/11 3:11 PM, Leigh Dodds wrote: I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. You have to apply context to your statement above. Is the context: WWW as an Information space or Data Space? These contexts can co-exist, but we need to allow users context-switch, unobtrusively. Thus, they have to co-exist, and that's why we have to leverage what the full URI abstraction delivers. As stated earlier, it doesn't mean others will follow or understand immediately, you need more than architecture for that; hence the need for a broad spectrum of solutions that do things properly. -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
Small typo changed the meaning of what I was saying: On Fri, Jun 17, 2011 at 2:18 PM, Ian Davis li...@iandavis.com wrote: OK, we differ here then. I would prefer to bookmark the movie because that's what I'm interested in. The page will change over the years but the movie will still persist. Today you have no choice because your conceptual model does not give a URI to the movie and doesn't see the need to generate 2 URIs. But I meant to write: Today you have no choice because your conceptual model does not give a URI to the movie and [the publisher] doesn't see the need to generate 2 URIs. Of course I recognise your conceptual model sees the need for multiple URIs... :) Ian
Re: Squaring the HTTP-range-14 circle
Hi all, This thread seems to me to be classic neat vs. scruffy argument [1]. I used to be a neat, when I was young, foolish and of course selfish. Now that I am old enough to see others' points of view, I have become scruffy. Either that, or I'm just tired of trying to force others to do things my way. The Web is a scruffy place and that is a feature, not a bug. Regards, Dave [1] http://en.wikipedia.org/wiki/Neats_vs._scruffies On Jun 17, 2011, at 10:27, Kingsley Idehen wrote: On 6/17/11 2:55 PM, Ian Davis wrote: BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. As an additional point, a review_is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that a sourced from Data at an Address that's streamed to a client that uses a specific data presentation metaphor as basis for user comprehension? Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On 6/17/11 3:27 PM, Kingsley Idehen wrote: Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that a sourced from Data at an Address that's streamed to a client that uses a specific data presentation metaphor as basis for user comprehension? Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ Meant to say: ** Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that Data streamed from an Address (provided by a server) to a client that uses a specific data presentation metaphor as basis for user comprehension? *** Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
Hi, On 17 June 2011 15:32, Kingsley Idehen kide...@openlinksw.com wrote: On 6/17/11 3:11 PM, Leigh Dodds wrote: I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. You have to apply context to your statement above. Is the context: WWW as an Information space or Data Space? I can't answer that because I don't know what you mean by those terms. It's just a web of resources as far as I'm concerned. Cheers, L. -- Leigh Dodds Programme Manager, Talis Platform Mobile: 07850 928381 http://kasabi.com http://talis.com Talis Systems Ltd 43 Temple Row Birmingham B2 5LS
Re: Squaring the HTTP-range-14 circle
On 6/17/11 3:36 PM, David Wood wrote: Hi all, This thread seems to me to be classic neat vs. scruffy argument [1]. I used to be a neat, when I was young, foolish and of course selfish. Now that I am old enough to see others' points of view, I have become scruffy. Either that, or I'm just tired of trying to force others to do things my way. The Web is a scruffy place and that is a feature, not a bug. May I say: it accommodates scruffiness because of its architecture :-) Kingsley Regards, Dave [1] http://en.wikipedia.org/wiki/Neats_vs._scruffies On Jun 17, 2011, at 10:27, Kingsley Idehen wrote: On 6/17/11 2:55 PM, Ian Davis wrote: BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. As an additional point, a review_is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that a sourced from Data at an Address that's streamed to a client that uses a specific data presentation metaphor as basis for user comprehension? Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On 6/17/11 3:44 PM, Leigh Dodds wrote: Hi, On 17 June 2011 15:32, Kingsley Idehenkide...@openlinksw.com wrote: On 6/17/11 3:11 PM, Leigh Dodds wrote: I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. You have to apply context to your statement above. Is the context: WWW as an Information space or Data Space? I can't answer that because I don't know what you mean by those terms. It's just a web of resources as far as I'm concerned. Cheers, L. Links that will help you with terminology. 1. http://linkeddatabook.com/editions/1.0/ -- Web as Global Data Space 2. http://www.xent.com/FoRK-archive/dec00/0608.html -- Web as Information Space (courtesy of quick Google search). -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
On Jun 17, 2011, at 6:35 AM, Dave Reynolds wrote: [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Or about the weather in Oacala, for example. Pat Dave IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle
On Fri, Jun 17, 2011 at 11:22 AM, Pat Hayes pha...@ihmc.us wrote: On Jun 17, 2011, at 6:35 AM, Dave Reynolds wrote: [As an aside, I would claim that most reviews are in fact about things - restaurants, books, music - not about the web pages.] Or about the weather in Oacala, for example. Pat And is the weather part of the essence of the resource? -Alan
Re: Squaring the HTTP-range-14 circle
On 6/17/11 3:52 PM, Kingsley Idehen wrote: On 6/17/11 3:44 PM, Leigh Dodds wrote: Hi, On 17 June 2011 15:32, Kingsley Idehenkide...@openlinksw.com wrote: On 6/17/11 3:11 PM, Leigh Dodds wrote: I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. You have to apply context to your statement above. Is the context: WWW as an Information space or Data Space? I can't answer that because I don't know what you mean by those terms. It's just a web of resources as far as I'm concerned. Cheers, L. Links that will help you with terminology. 1. http://linkeddatabook.com/editions/1.0/ -- Web as Global Data Space 2. http://www.xent.com/FoRK-archive/dec00/0608.html -- Web as Information Space (courtesy of quick Google search). Leigh, In addition to the above, and bearing in mind Harry's contribution to this conversation, here is one of his presentations in which slide #2 makes reference to WWW as an Information Space [1]. Links: 1. http://www.ibiblio.org/hhalpin/homepage/presentations/interface/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
Tim Berners-Lee wrote: And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. yes, common use case, many sites give karma to comments / reviews and have links to them both in and out of context. When the cost os just fixing Microdata syntax to make it easy to say things about the subject of a page. far from expert on microdata, but @itemid may well cater for this.
Re: Squaring the HTTP-range-14 circle
Ian Davis wrote: As an additional point, a review _is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. so the like data needs to be webized and exposed easily. also, realtime updates on / streams of such data would come in very useful. permissions and visibility would looked at though, so probably authentication via webid or other would be needed too.
Re: Squaring the HTTP-range-14 circle
Kingsley Idehen wrote: On 6/17/11 3:11 PM, Leigh Dodds wrote: I just had to go and check whether Amazon reviews and Facebook comments actually do have their own pages. That's because I've never seen them presented as anything other than objects within another container, either in a web page or a mobile app. So I think you could argue that when people are linking and marking things as useful, they're doing that on a more general abstraction, i.e. the Work (to borrow FRBR terminology) not the particular web page. You have to apply context to your statement above. Is the context: WWW as an Information space or Data Space? These contexts can co-exist, but we need to allow users context-switch, unobtrusively. Thus, they have to co-exist, and that's why we have to leverage what the full URI abstraction delivers. As stated earlier, it doesn't mean others will follow or understand immediately, you need more than architecture for that; hence the need for a broad spectrum of solutions that do things properly. and UX challenges, indeed if the ux was addressed first for the functionality, then whatever was implemented could be webized and standardized - could be a good way to force innovation in this area.
Re: Squaring the HTTP-range-14 circle
could also term it constrained vs diverse :) David Wood wrote: Hi all, This thread seems to me to be classic neat vs. scruffy argument [1]. I used to be a neat, when I was young, foolish and of course selfish. Now that I am old enough to see others' points of view, I have become scruffy. Either that, or I'm just tired of trying to force others to do things my way. The Web is a scruffy place and that is a feature, not a bug. Regards, Dave [1] http://en.wikipedia.org/wiki/Neats_vs._scruffies On Jun 17, 2011, at 10:27, Kingsley Idehen wrote: On 6/17/11 2:55 PM, Ian Davis wrote: BUT when the click a Like button on a blog they are expressing they like the blog, not the movie it is about. AND when they click like on a facebook comment they are saying they like the comment not the thing it is commenting on. And on Amazon people say I found this review useful to like the review on the product being reviewed, separately from rating the product. So there is a lot of use out there which involves people expressing stuff in general about the message not its subject. As an additional point, a review_is_ a seperate thing, it's not a web page. It is often contained within a webpage. It seems you are conflating the two here. Reviews and comments can be and often are syndicated across multiple sites so clearly any liking of the review needs to flow with it. Yes, it is a separate thing representable as a Data Object. Now the obvious question: what is a Web Page? Isn't that a sourced from Data at an Address that's streamed to a client that uses a specific data presentation metaphor as basis for user comprehension? Are the following identical or different, re. URI functionality ? 1. http://dbpedia.org/resource/Linked_Data 2. http://dbpedia.org/page/Linked_Data 3. http://dbpedia.org/data/Linked_Data.json . I may want to bookmark: http://dbpedia.org/page/Linked_Data, I may also be interested in its evolution over time via services lime memento [1] . The thing is that re. WWW we have an Information Space dimension and associated patterns that's preceded the Data Space dimension and emerging patterns that we (this community) are collectively trying to crystallize, in an unobtrusive manner. Links: 1. http://www.mementoweb.org/guide/quick-intro/ -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Danny Ayers wrote: On 16 June 2011 02:26, Pat Hayes pha...@ihmc.us wrote: If you agree with Danny that a description can be a substitute for the thing it describes, then I am waiting to hear how one of you will re-write classical model theory to accommodate this classical use/mention error. You might want to start by reading Korzybski's 'General Semantics'. IANAL, but I have heard of the use/mention thing, quite often. I don't honestly know whether classical model theory needs a rewrite, but I'm sure it doesn't on the basis of this thread. I also don't know enough to know whether it's applicable - from your reaction, I suspect not. As a publisher of information on the Web, I'm pretty much free to say what I like (cf. Tim's Design Notes). Fish are bicycles. But that isn't very useful. But if I say Sasha is some kind of weird Collie-German Shepherd cross, that has direct relevance to Sasha herself. More, the arcs in my description between Sasha and her parents have direct correspondence with the arcs between Sasha and her parents. There is information common to the reality and the description (at least in human terms). The description may, when you stand back, be very different in its nature to the reality, but if you wish to make use of the information, such common aspects are valuable. We've already established that HTTP doesn't deal with any kind of one true representation. Data about Sasha's parentage isn't Sasha, but it's closer than a non-committal 303 or rdfs:seeAlso. There's nothing around HTTP that says it can't be given the same name, and it's a darn sight more useful than a wave-over-there redirect or a random fish/bike association. I can't see anything it breaks either. You could use the same name for both if each name was always coupled to a universe, specified by the predicate, and you cut out type information from data, such that: x-sasha :animalname sasha ; :created 2011 . was read as: Animal(x-sasha) :animalname sasha . Document(x-sasha) :created 2011 . the ability to do this could be pushed on to ontologies, with domain and range and restrictions specifying universes and boundaries - but it's a big change. really, different names for different things is quite simple to stick to, and considering most (virtually all) documents on the web have several different elements and identifiable things, the one page one subject thing isn't worth spending too much time focusing on as a generic use case, as any solution based on it won't apply to the web at large which is very diverse and packed full of lots of potentially identifiable things. best, nathan
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Alan Ruttenberg wrote: Pat's knows something about the history of what's known to work and what isn't. You ignore that history at the peril of your ideas simply not working. well said, although I think we could bracket yourself in that category too :)
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 17 Jun 2011, at 22:42, Nathan wrote: You could use the same name for both if each name was always coupled to a universe, specified by the predicate, and you cut out type information from data, such that: x-sasha :animalname sasha ; :created 2011 . was read as: Animal(x-sasha) :animalname sasha . Document(x-sasha) :created 2011 . the ability to do this could be pushed on to ontologies, with domain and range and restrictions specifying universes and boundaries - but it's a big change. No its quite simple in fact, as I pointed out in a couple of e-mails in this thread. You just need to be careful when creating relations that certain relations are in fact inferred relations between primary topics. really, different names for different things is quite simple to stick to, yes, but there are a lot of people who say it is too complicated. I don't find it so, but perhaps it is for their use cases. I say that we describe the option they like, find out what the limitations are they will fall have, and document it. Then next time we can refer others to that discovery. So limitations to look for would be limitations as to the complexity of the data created. The other limitations is that even on simple blog pages there are at least three or four things on the page. and considering most (virtually all) documents on the web have several different elements and identifiable things, indeed. the one page one subject thing isn't worth spending too much time focusing on as a generic use case, as any solution based on it won't apply to the web at large which is very diverse and packed full of lots of potentially identifiable things. agree. But it is one of those things that newbies feel the urge to do, and will keep on wanting to do. So perhaps for them one should have special simple ontologies or guides for how to build these ObjectDocument ontologies. In any case this seems to be the type of thing the microformats people were (are?) doing. Henry best, nathan Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Henry Story wrote: On 17 Jun 2011, at 22:42, Nathan wrote: You could use the same name for both if each name was always coupled to a universe, specified by the predicate, and you cut out type information from data, such that: x-sasha :animalname sasha ; :created 2011 . was read as: Animal(x-sasha) :animalname sasha . Document(x-sasha) :created 2011 . the ability to do this could be pushed on to ontologies, with domain and range and restrictions specifying universes and boundaries - but it's a big change. No its quite simple in fact, as I pointed out in a couple of e-mails in this thread. You just need to be careful when creating relations that certain relations are in fact inferred relations between primary topics. I'd agree, but anything that involves being careful is pretty much doomed to failure on the web :p really, different names for different things is quite simple to stick to, yes, but there are a lot of people who say it is too complicated. I don't find it so, but perhaps it is for their use cases. I say that we describe the option they like, find out what the limitations are they will fall have, and document it. Then next time we can refer others to that discovery. So limitations to look for would be limitations as to the complexity of the data created. The other limitations is that even on simple blog pages there are at least three or four things on the page. there's also a primary limitation of the programming languages developers are using, if they've got locked in stone classes and objects, or even just structures, then the dynamics of RDF can be pretty hard to both understand mentally, and use practically. and considering most (virtually all) documents on the web have several different elements and identifiable things, indeed. the one page one subject thing isn't worth spending too much time focusing on as a generic use case, as any solution based on it won't apply to the web at large which is very diverse and packed full of lots of potentially identifiable things. agree. But it is one of those things that newbies feel the urge to do, and will keep on wanting to do. So perhaps for them one should have special simple ontologies or guides for how to build these ObjectDocument ontologies. In any case this seems to be the type of thing the microformats people were (are?) doing. hmm.. microformats seems to be pretty focussed on describing multiple items on one page, however the singularity is present in that they focussed on being described using a single Class Blueprint style, one class, a predetermined set of properties belonging to the class, and a simple chained heirarchy - this stems from most OO based languages. With a bit of trickery you can use RDF and OWL the same way, it just means you have different views over the data, where you can see Human(x) with a set of properties, or Male(x) with another set, or Administrator(x) with yet another set. This is less about the data published and more about how it's consumed viewed and processed though. Quite sure something can be done with that, where the simple version of the data uses a basic schema.org like ontology, and advanced usage is more RDF like using multiple ontologies. The views thing would be a way to merge the two approaches.. Best, Nathan
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Wed, Jun 15, 2011 at 6:24 AM, Richard Cyganiak rich...@cyganiak.dewrote: On 15 Jun 2011, at 01:07, Alan Ruttenberg wrote: Google won't scrap schema.org because your thought experiment proved that it's not “semantically clear.” Richard, that wasn't the point. You mocked the idea that semantically clear could be defined. I responded with an attempt. I have no interest in theoretical discussions that are detached from application. I assume you mean you are not interested in discussions of theory that are detached from application. In any case this is a non-sequitor. The definition is offered because some, including myself, think that there are important classes of applications for which it is an essential ingredient of success (like some of the ones I need to build), and because you implied that defining what we meant was not feasible. I think that we are beyond the point where that kind of extremely idealised account is useful for evaluating web technologies. We will agree to disagree then. Perhaps in another thread you will say what *will* be useful for evaluating web technologies. Adoption trends, ergonomics, fit with the existing technology ecosystem, existence of migration paths, marketability, potential of network effects. Does what the technology *accomplishes* fit in there somewhere? Looking at the above, one might conclude that a successful Ponzi scheme of some sort would score well. Regards, Alan
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 16 Jun 2011, at 07:05, Alan Ruttenberg wrote: I think that we are beyond the point where that kind of extremely idealised account is useful for evaluating web technologies. We will agree to disagree then. Perhaps in another thread you will say what *will* be useful for evaluating web technologies. Adoption trends, ergonomics, fit with the existing technology ecosystem, existence of migration paths, marketability, potential of network effects. Does what the technology *accomplishes* fit in there somewhere? Web technologies are never about accomplishing anything new; they are about taking something that already works on a small and local scale, and making it work across the internet with its loosely coordinated actors. Looking at the above, one might conclude that a successful Ponzi scheme of some sort would score well. :-) If you want to look at it that way, standards, like anything that exhibits network effects, are a bit like a ponzi scheme: once you're inside, you benefit from getting others in your vicinity on board. The difference is that “late adopters” in a ponzi scheme are the suckers who lose their investment; while late adopters of a standard get the largest benefit at the smallest cost. Best, Richard
Re: Squaring the HTTP-range-14 circle
On 15 Jun 2011, at 23:54, Francois-Paul Servant wrote: And here you and Pat and Alan (and TimBL, for that matter) are preaching that we can't use this one billion of fantastic free URIs to identify things because it wouldn't make semantic sense. do you mean that it's OK to use wikipedia URIs instead of dbPedia ones? I think it's ok. Should we stop using dbPedia URIs? When publishing data, it's good to use URIs that resolve to structured data; this is what weaves data into a web. Therefore, I prefer DBpedia or Freebase URIs. Best, Richard
Re: Squaring the HTTP-range-14 circle
I disagree with this post very strongly, and it is hard to know where to start, and I am surprised to see it. On 2011-06 -13, at 07:41, Richard Cyganiak wrote: On 13 Jun 2011, at 09:59, Christopher Gutteridge wrote: The real problem seems to me that making resolvable, HTTP URIs for real world things was a clever but dirty hack and does not make any semantic sense. Well, you worry about *real-world things*, but even people who just worry about *documents* have said for two decades that the web is broken because it conflates names and addresses. No, some people didn't get the architecture in that they had learned systems where there that there was a big distinction between names and address, and they had different properties, and then they came across URIs which had properties of both. And they keep proposing things like URNs and info: URIs and tag: URIs and XRIs and DOIs to fix that and to separate the naming concern from the address concern. And invariably, these things fizzle around in their little niche for a while and then mostly die, because this aspect that you call a “clever but dirty hack” is just SO INCREDIBLY USEFUL. And being useful trumps making semantic sense. I agree ... except that ther URI architectre being like names and like addresses isn't a clever but dirty hack. You then connect this with the idea of using HTTP URIs for real-world things, which is a separate queston. This again is a question of architecture. Of design of a system. We can make it work either way. We have to work out which is best. I don't think 303 is a quick and dirty hack. It does mean a large extension of HTTP to be uses with non-documents. It does have efficiency problems. It is an architectural extension to the web architecture. HTTP has been successfully conflating names and addresses since 1989. That is COMPLETELY irrelevant. It is not a question of the web being fuzzy or ambiguous and getting away with it. It is a clean architecture where the concepts of name and address don't connect directly with those of people or files on a disk or IP hosts. There is a trillion web pages out there, all named with URIs. And even if just 0.1% of these pages are unambiguously about a single specific thing, that gives us a billion free identifiers for real-world entities, all already equipped with rich *human-readable* representations, and already linked and interconnected with *human-readable*, untyped, @href links. And these one billion URIs are plain old http:// URIs. They don't have a thing:// in the beginning, nor a tdb://, nor a #this or #that in the end, nor do they respond with 303 redirects or to MGET requests or whatever other nutty proposals we have come up with over the years to disambiguate between page and topic. They are plain old http:// URIs. A billion. Then add to that another huge number that already responds with JSON or XML descriptions of some interesting entity, like the one from Facebook that Kingsley mentioned today in a parallel thread. Again, no thing:// or tdb:// or #this or 303 or MGET on any of them. I want to use these URIs as identifiers in my data, and I have no intention of redirecting through an intermediate blank node just because the TAG fucked up some years ago. If you want to give yourself the luxury of being able to refer to the subject of a webpage, without having to add anthing to disambiguate it from the web page, then for the sake of your system, so you can use the billion web pages for your purposes, then you now stop other like me from using semantic web systems to refer to those web pages, or in fact to the other hundred million web pages either. Maybe you should an efficient way of doing what you want without destroying the system (which you as well have done so much to build) I want to tell the publishers of these web pages that they could join the web of data just by adding a few @rels to some as, and a few @properties to some spans, and a few @typeofs to some divs (or @itemtypes and @itemprops). And I don't want to explain to them that they should also change http:// to thing:// or tdb:// or add #this or #that or make their stuff respond with 303 or to MGET requests because you can't squeeze a dog through an HTTP connection. Well actually I really want them to put metadata about BOTH the document and its subject. There is masses of metadata already about documents. Now you want to make it ambiguous so I don't know whether it is about the document or its subject? I don't think something like about=#product is rocket science or unnatural. I really want people to be able to use RDF or microdata to say things about more than one thing in the same page And here you and Pat and Alan (and TimBL, for that matter) are preaching that we can't use this one billion of fantastic free URIs to identify things because it wouldn't make semantic sense. We are saying that
Re: Squaring the HTTP-range-14 circle
Hi Tim , documents per se (a la HTTP response 200 response) on the web are less and less relevant as opposed to the conceptual entities that are represented by this document and held e.g. as DB records inside CMS, social networks etc. e.g. a social network is about people those are the important entities. Then there might be 1000 different HTTP documents that you can get e.g.i f you're logged if you're not logged, if you have a cookie if you have another cookie, if you add format=print. Specific URLs are pretty irrelevant as they contain all sort of extra information. Layouts of CMS or web apps change all the time (and so do the HTML docs) but not the entities. that's why http response 200 level annotations are of such little ambiguity really you say you have so many annotations about documents, i honestly dont understand what you're referring to, are these HTTP retrievable documents? where are the annotations? are we talking about the http headers? about the meta tags in the head these are about the subject of the page too most of the time, not the page itself. and this is the idea behind schema.org (opengraph whatever) which sorry Tim you have to live with and we have to do the most with. When someone refers to a URL which embeds a opengraph or schema.organnotation then it is 99.+ (with the number of 9 augmenting as the web evolves to a rich app platform) certain that they refer to the entity described in it and not to the web document itself (which can and does change all the time and is of overall no conceptual relevance). With respect to schema.org, we (as semantic web community) have not been ignored: our work and proposals have been very well considered and then diregarded alltogether - and for several reasons : 12 years of work, not an agreement on ontology, not an easy way for people to publish data ( the 303 thing is a complete total utter insanity (as i had said in vain so many times) ). etc. So, think of how browsers work: they fix all the broken HTML markup doing what it takes to undertand more or less the intention behind the broken markup. The same will exactly happen with applications that work on semantic markup at web scale. they will do the specific cleanups and adaptations as they need. *the UPSIDE* of this is that RDF is a totally cool technology which can most of the time rule them all . Sindice is entirely RDF based, but then reads and processes microformats, RDF, RDFa, and next week schema.org too microdata. So long life to all really. Fights work fighting: having RDFa play well along schema.org so that schema.org tags can be written in RDFa and search engines will still read it. This will allow people to still use rich representations and vocabularies while not loosing compatibilities with the mainstream apps which will be developed for schema.org compatible pages. Gio On Thu, Jun 16, 2011 at 7:04 PM, Tim Berners-Lee ti...@w3.org wrote: I disagree with this post very strongly, and it is hard to know where to start, and I am surprised to see it. On 2011-06 -13, at 07:41, Richard Cyganiak wrote: On 13 Jun 2011, at 09:59, Christopher Gutteridge wrote: The real problem seems to me that making resolvable, HTTP URIs for real world things was a clever but dirty hack and does not make any semantic sense. Well, you worry about *real-world things*, but even people who just worry about *documents* have said for two decades that the web is broken because it conflates names and addresses. No, some people didn't get the architecture in that they had learned systems where there that there was a big distinction between names and address, and they had different properties, and then they came across URIs which had properties of both. And they keep proposing things like URNs and info: URIs and tag: URIs and XRIs and DOIs to fix that and to separate the naming concern from the address concern. And invariably, these things fizzle around in their little niche for a while and then mostly die, because this aspect that you call a “clever but dirty hack” is just SO INCREDIBLY USEFUL. And being useful trumps making semantic sense. I agree ... except that ther URI architectre being like names and like addresses isn't a clever but dirty hack. You then connect this with the idea of using HTTP URIs for real-world things, which is a separate queston. This again is a question of architecture. Of design of a system. We can make it work either way. We have to work out which is best. I don't think 303 is a quick and dirty hack. It does mean a large extension of HTTP to be uses with non-documents. It does have efficiency problems. It is an architectural extension to the web architecture. HTTP has been successfully conflating names and addresses since 1989. That is COMPLETELY irrelevant. It is not a question of the web being fuzzy or ambiguous and getting away with it. It is a clean architecture where the concepts of name and address don't
Re: Squaring the HTTP-range-14 circle
On 13 Jun 2011, at 13:41, Richard Cyganiak wrote: I want to use these URIs as identifiers in my data, and I have no intention of redirecting through an intermediate blank node just because the TAG fucked up some years ago. The TAG did not f.up as you say, and you can do what you want anyway. - http:// URLs with 200 responses do point to information resources. - you can use those the way you wish to indirectly refer to you. Just change your vocabulary if you do. [snip] http://richard.cyganiak.de/ a foaf:Document; dc:title Richard Cyganiak's homepage; a foaf:Person; foaf:name Richard Cyganiak; owl:sameAs http://twitter.com/cygri; . so instead of using foaf:knows to relate a person and a document, create a docfriend ontology and have it relate a document that describes you to a number of attributes about you http://richard.cyganiak.de/ a foaf:Document; docfriend:name Richard Cyganiak; dc:title: Richard Cyganiak's homepage; docfriend:knows http://bblfish.net/ . Then you just need rules such as { ?pg docfriend:name ?nm } = { ?pg foaf:primaryTopic ?p . ?p foaf:name ?nm } and you can convert between the two. You don't of course need to use the bnode inducing foaf ontology, but can stick to your docfriend ontology. You just will notice of course that there cannot be more than one person talked of per such page. A restriction you will find it difficult to convince some people to abide by. This discussion is really of no importance, and is just a great time waster on the semantic web. I would suggest people go and build apps that work, that are used and that are viral: where every person who uses it increases the value of the network, gets others to join and use the data. Henry There. If your knowledge representation formalism isn't smart enough to make sense of that, then it may just not be quite ready for the web, and you may have some work to do. Best, Richard Social Web Architect http://bblfish.net/
Re: Squaring the HTTP-range-14 circle
On 6/16/11 6:53 PM, Giovanni Tummarello wrote: Hi Tim , documents per se (a la HTTP response 200 response) on the web are less and less relevant as opposed to the conceptual entities that are represented by this document and held e.g. as DB records inside CMS, social networks etc. e.g. a social network is about people those are the important entities. Then there might be 1000 different HTTP documents that you can get e.g.i f you're logged if you're not logged, if you have a cookie if you have another cookie, if you add format=print. Specific URLs are pretty irrelevant as they contain all sort of extra information. Layouts of CMS or web apps change all the time (and so do the HTML docs) but not the entities. that's why http response 200 level annotations are of such little ambiguity really you say you have so many annotations about documents, i honestly dont understand what you're referring to, are these HTTP retrievable documents? Tim is saying, and pretty clearly: there are a lot of resources in HTML format on the Web. You access these via URLs (Addresses). Basically, you GET data from Addresses. where are the annotations? are we talking about the http headers? about the meta tags in the head these are about the subject of the page too most of the time, not the page itself. In these resources (projected as HTML documents) there is a lot of metadata. Dig a little deeper, there are also varying degrees of metadata in the HTTP responses. What doesn't exist is use of an abstraction whereby the Subject Matter items (what the HTML docs are about) are Identified by Names that resolve to their Representations which are best served via graph pictorials. and this is the idea behind schema.org http://schema.org (opengraph whatever) which sorry Tim you have to live with and we have to do the most with. Tim is not saying he has a problem with schema.org. He might imply that schema.org is deviating from the aspect of AWWW that delivers the abstraction necessary for schema.org to refer to entities (data objects) by Names that are distinct from the Addresses of their Representation(s). When someone refers to a URL which embeds a opengraph or schema.org http://schema.org annotation then it is 99.+ (with the number of 9 augmenting as the web evolves to a rich app platform) certain that they refer to the entity described in it and not to the web document itself (which can and does change all the time and is of overall no conceptual relevance). We are already dealing with the schema.org issues [1][2] the best way it can be handled until opportunity costs veer them towards upping the semantic fidelity of their contribution. We can live with schema.org, but lets not conflate that effort with some vital fundamentals re. Linked Data and best practices based on AWWW. In my eyes, schema.org is a massive vector re. structured data injected into the Web. Semantic fidelity was never their focus. Basically, as stated in an older post, Google, Microsoft, Yahoo!, Facebook and friends, all seek to contribute structured data from their respective data spaces, this already makes sense to them and is 100% compatible with their respective business models. Naturally, we would like them to do more, but you can't tell them to do more, all you can do is make opportunity costs palpable to them and eventually they will respond. With respect to schema.org http://schema.org, we (as semantic web community) have not been ignored: our work and proposals have been very well considered and then diregarded alltogether - and for several reasons : 12 years of work, not an agreement on ontology, not an easy way for people to publish data ( the 303 thing is a complete total utter insanity (as i had said in vain so many times) ). etc. 303 isn't insanity! It is basic computing re. data access by reference. de-reference (indirection) and address-of operations are fundamental elements of any kind of environment that allows access, movement, and manipulation of data. You always have Names and Addresses. In fact, you have them in the real world, but I don't want veer down a discussion on semiotics and philosophy. The Web of Documents works because Document Addresses (URLs) have become intuitive. Evolving the Web to a Linked Data Space is a little trickier with HTTP URIs because the Name operation unveils a powerful but unnatural abstraction due to the fact that an HTTP URI based Name looks and feels like an HTTP URI based Location Name (Address). HTTP 303 is just doing what programming languages do behind the scenes whenever you access data objects by reference. If anything, its a tribute to the flexibility of the HTTP protocol. Basically, we have the Web now pulling of the same data access and manipulation capabilities that host operating systems have delivered to systems developers since forever. [SNIP] Kingsley Gio On Thu, Jun 16, 2011 at 7:04 PM, Tim
Re: Squaring the HTTP-range-14 circle
On 6/16/11 9:16 PM, Kingsley Idehen wrote: We are already dealing with the schema.org issues [1][2] the best way it can be handled until opportunity costs veer them towards upping the semantic fidelity of their contribution. Links: 1. http://schema.rdfs.org 2. http://uriburner.com/describe/?url=http%3A%2F%2Fwww.openlinksw.com%2Fschemas%2Frdfs%2F -- our WIP effort that builds on both schema.rdfs.org and schema.org (more to come re. mapping to FOAF, SIOC, GoodRelations, and others) . -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Jun 15, 2011, at 7:36 PM, Danny Ayers wrote: On 15 June 2011 18:30, Pat Hayes pha...@ihmc.us wrote: Boy, that is a humdinger of a non-sequiteur. Given that HTTP has flexibility, it is OK to identify a description of a thing with the actual thing? To me that sounds like saying, given that movies are projected, it is OK to say that fish are bicycles. Not that I think I did a non-sequiteur, it is totally ok to say that fish are bicycles, if that's what you want to say. [snip] OK, thanks. Here is your argument, as far as I can understand it. 1. HTTP representations may be partial or incomplete. (Agreed.) 2. HTTP reps can have many different media types, and this is OK. (Agreed, though I cant see what relevance this has to anything.) 3. A description is a kind of representation. (Agreed, and there was no need to get into the 'isomorphism' trap. We in KRep have been calling descriptions representations for decades now.) 4. Therefore, a HTTP URI can simultaneously be understood as referring to a document and a car. Whaaat? How in Gods name can you derive this conclusion from those premises? my wording could be better, but I stand by it... a document describing the car, through HTTP, can be an equally valid representation of the named car resource as the car itself (as long as it's qualified by media type) Not only do I not follow your reasoning, I don't even know what it is you are saying. The document is a valid *representation* of the car, yes of course. But as valid as the car itself? So you think a car is a representation of itself? Or are you drawing a contrast between the 'named car resource' and the car itself? ??? Maybe it would be best if we just dropped this now. I gather that you were offering me a way to make semantic sense of something, but Im not getting any sense at all out of this discussion, I am afraid. Pat Cheers, Danny. -- http://danny.ayers.name IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle
Tim, On Thu, Jun 16, 2011 at 6:04 PM, Tim Berners-Lee ti...@w3.org wrote: I don't think 303 is a quick and dirty hack. It does mean a large extension of HTTP to be uses with non-documents. It does have efficiency problems. It is an architectural extension to the web architecture. We have had many years for this architectural extension to be adopted and many of us producing linked data have been diligent in supporting, promoting and educating people about it. Even I, with my many many attempts to get this decision reconsidered, have promoted the W3C consensus. Conversely, many more people have studied this extension and rejected it. Companies such as Google, Facebook, Microsoft and Yahoo, who are all W3C members and can influence these decisions through formal channels if they wish, have looked at the httpRange decsion and decided it doesn't work for them. Instead they have chosen different approaches that require more effort to consume but lower the conceptual barrier for publishers. However, they are convinced of the need for URIs to identify things that are not just web pages which is a huge positive. These companies collectively account for a very large proportion of web traffic and activity. I think just saying that they're wrong and should change their approach is simply being dogmatic. They are telling us that we are wrong. We should listen to them. If you want to give yourself the luxury of being able to refer to the subject of a webpage, without having to add anthing to disambiguate it from the web page, then for the sake of your system, so you can use the billion web pages for your purposes, then you now stop other like me from using semantic web systems to refer to those web pages, or in fact to the other hundred million web pages either. The problem here is that there are so few things that people want to say about web pages compared with the multitude of things they want to say about every other type of thing in existence. Yet the httpRange decision makes the web page a privileged component. I understand why that might have seemed a useful decision, after all this is the web we are talking about, but it has turned out not to be. The web page is only the medium for conveying information about the things we are really interested in. The analogy is metadata about a book. Very little of it is about the physical book, i.e. the medium. Perhaps you would want to record its dimensions, mass, colour, binding or construction. There are many many more things you would want to record about the book's content, themes, people and places mentioned, author etc. Maybe you should an efficient way of doing what you want without destroying the system (which you as well have done so much to build) I think this is unreasonably strong. Nothing is being destroyed. Nothing has broken. A few days after I wrote this post (http://blog.iandavis.com/2010/12/06/back-to-basics/) I changed one of the many linked datasets I maintain to stop using 303 redirects over a few million resources. No-one has noticed yet. Nothing has broken. Ian
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Jun 15, 2011, at 8:27 PM, Danny Ayers wrote: On 16 June 2011 02:26, Pat Hayes pha...@ihmc.us wrote: If you agree with Danny that a description can be a substitute for the thing it describes, then I am waiting to hear how one of you will re-write classical model theory to accommodate this classical use/mention error. You might want to start by reading Korzybski's 'General Semantics'. IANAL, but I have heard of the use/mention thing, quite often. I don't honestly know whether classical model theory needs a rewrite, but I'm sure it doesn't on the basis of this thread. I also don't know enough to know whether it's applicable - from your reaction, I suspect not. As a publisher of information on the Web, I'm pretty much free to say what I like (cf. Tim's Design Notes). Fish are bicycles. But that isn't very useful. But if I say Sasha is some kind of weird Collie-German Shepherd cross, that has direct relevance to Sasha herself. True. More, the arcs in my description between Sasha and her parents Sasha and her parents are not themselves in your description. I presume you mean, the arcs between the terms you use, in your description, to refer to Sasha and her parents. have direct correspondence with the arcs between Sasha and her parents. Sasha and her parents don't have arcs between them (unless you are indulging in some cruel treatment of animals.) I presume you mean to refer to certain relationships which hold between Sasha and her parents. In this simple case (explicitly named relationships, explicit referring names) there is a kind of structural correspondence between the description and the reality, indeed. But as soon as you make the descriptive language even slightly more expressive, this breaks down. (Try adding negation or disjunction of even blank nodes.) And as soon as you admit that reality is more complex than any description of it, it breaks down. So its not a very good foundation to build a semantic theory upon. There is information common to the reality and the description (at least in human terms). No. The reality is what it is; the information is held in the description (the one with the arcs and the names in it.) The information is ABOUT Sash and her parents (and the relationship of parenthood and various categories of doggitude, and so forth.) The description may, when you stand back, be very different in its nature to the reality, You betcha. but if you wish to make use of the information, such common aspects are valuable. What common aspects? If you mean to refer to the fact that a description with arcs and names can be TRUE OF some aspect of reality, you are talking about classical model-theoretic semantics, which is based on the idea of reference (AKA denotation) at its root; it is the interpretation mapping from names to the things they are interpreted to refer to (eg between Sasha and Sasha.) But the truth-in-an-interpretation relationship is not similarity or isomorphism, and it certainly does not warrant identifying the name with the thing named. Quite the contrary, it relies upon keeping this distinction clear. As Korzybski famously said, the map is not the territory. We've already established that HTTP doesn't deal with any kind of one true representation. Data about Sasha's parentage isn't Sasha, but it's closer than a non-committal 303 or rdfs:seeAlso. Closer? In what metric? I would say it is about as different as anything can get. There's nothing around HTTP that says it can't be given the same name, and it's a darn sight more useful than a wave-over-there redirect or a random fish/bike association. I can't see anything it breaks either. OF COURSE it breaks things. It might be true to say that Sasha is a Collie-German Shepherd cross, but Sasha's description or web page certainly isn't. It might be true to say that the description is written in RDF, but Sasha isn't. Pat Cheers, Danny. -- http://danny.ayers.name IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Jun 15, 2011, at 10:04 PM, Jason Borro wrote: Apologies if my keyboard sneered at you, though comparing an application problem to 1% of hr14 at web scale hardly trivializes it; certainly it does the opposite. Good luck preserving your mental model if you require webmasters to spell Korzybski. I'd prefer they actually read him, though I won't hold my breath. Sorry to bother you by using a very long foreign name. Pat On 6/15/2011 6:26 PM, Pat Hayes wrote: On Jun 15, 2011, at 1:35 PM, Jason Borro wrote: I agree with your sentiments Danny, fwiw. The current scheme is a burden on publishers for the sake of a handful of applications that wish to refer to these information resources themselves, making them unable to talk about Web pages using the Web description language RDF. What about minting a new URI at http://information.resourcifier.net/encodedURI; or similar for talking about such things? The service could even add value by tracking last update times, content types, encodings, etc. Jason p.s. Don't bother criticizing the half baked idea, I thought about it for 10 seconds. The point is 100 alternatives could have been hashed out in the time spent discussing and implementing http-range-14. I confess to finding this kind of sneering remark rather annoying. If you think it is this trivial to work out some 'alternative', why don't you come up with a few actual ideas and see what happens when they get a little peer review? Your idea, above, hardly makes first base, as Im sure you already realized when you added the p.s. So why not try inventing one that has a snowballs chance in hell of actually working? Im sure that the world would be delighted if you could solve this trivial problem in 5 ways, let alone a hundred. If you agree with Danny that a description can be a substitute for the thing it describes, then I am waiting to hear how one of you will re-write classical model theory to accommodate this classical use/mention error. You might want to start by reading Korzybski's 'General Semantics'. Pat IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Jun 16, 2011, at 4:38 AM, Richard Cyganiak wrote: On 16 Jun 2011, at 07:05, Alan Ruttenberg wrote: I think that we are beyond the point where that kind of extremely idealised account is useful for evaluating web technologies. We will agree to disagree then. Perhaps in another thread you will say what *will* be useful for evaluating web technologies. Adoption trends, ergonomics, fit with the existing technology ecosystem, existence of migration paths, marketability, potential of network effects. Does what the technology *accomplishes* fit in there somewhere? Web technologies are never about accomplishing anything new; they are about taking something that already works on a small and local scale, and making it work across the internet with its loosely coordinated actors. Looking at the above, one might conclude that a successful Ponzi scheme of some sort would score well. :-) If you want to look at it that way, standards, like anything that exhibits network effects, are a bit like a ponzi scheme: once you're inside, you benefit from getting others in your vicinity on board. The difference is that “late adopters” in a ponzi scheme are the suckers who lose their investment; while late adopters of a standard get the largest benefit at the smallest cost. LOL Pat IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Thu, 2011-06-16 at 16:38 -0500, Pat Hayes wrote: On Jun 15, 2011, at 8:27 PM, Danny Ayers wrote: [ . . . ] There's nothing around HTTP that says it can't be given the same name, and it's a darn sight more useful than a wave-over-there redirect or a random fish/bike association. I can't see anything it breaks either. OF COURSE it breaks things. It might be true to say that Sasha is a Collie-German Shepherd cross, but Sasha's description or web page certainly isn't. It might be true to say that the description is written in RDF, but Sasha isn't. Let's go further and clarify exactly what breaks: Using the same URI both for Sasha and Sasha's web page breaks *some* applications and not others. Applications that need to distinguish between dogs and web pages will find the URI ambiguous; applications that do not will be perfectly happy. This state of affairs is a universal fact of life that is true of *all* possible distinctions that may be made, regardless of whether the distinction is between web pages and dogs, or between different kinds of dogs, or between different kinds of proteins or anything else. Except in the absurdly reductionist sense that *every* URI is ambiguous (because finer distinctions can always be made), whether a URI is ambiguous or unambiguous is *not* a fundamental property of the URI: ambiguity is relative to the *application* that is using that URI. Given this fact of life, I maintain that permitting the same URI to denote both a web page and a dog does *not* break the architecture of the web. I agree with TimBL that this is a design choice about the architecture of the web, and a clean, extensible architecture is needed. I agree with TimBL that 303 (and hash URIs) are useful for those who *choose* to distinguish between the web page and something else. I agree with TimBL that the httpRange-14 rule is very useful, even if it was not ideally stated, and should *not* be abandoned. However, the major flaw lies not in the httpRange-14 rule itself, but in the associated assumption that a URI cannot sensibly denote both an information resource and a dog: http://www.w3.org/TR/webarch/#def-information-resource This assumption is fatally flawed because: (a) it attempts to make an IR/non-IR distinction that can never be nailed down precisely (as several people have pointed out); and (b) it unnecessarily elevates one particular axis of ambiguity over all others. It is analogous to a rule that says all URIs for dogs MUST distinguish between male dogs and female dogs: the only applications that break without this rule are the ones that *need* to distinguish between male dogs and female dogs. All other applications will continue to work just fine without it. And that is exactly the way it should be for *any* axis of ambiguity. I agree with TimBL that it is *good* to distinguish between web pages and dogs -- and we should encourage folks to do so -- because doing so *does* help applications that need this distinction. But the failure to make this distinction does *not* break the web architecture any more than a failure to distinguish between male dogs and female dogs. -- David Booth, Ph.D. http://dbooth.org/ Opinions expressed herein are those of the author and do not necessarily reflect those of his employer.
Re: Squaring the HTTP-range-14 circle
Ian, On 2011-06 -16, at 16:41, Ian Davis wrote: Tim, On Thu, Jun 16, 2011 at 6:04 PM, Tim Berners-Lee ti...@w3.org wrote: I don't think 303 is a quick and dirty hack. It does mean a large extension of HTTP to be uses with non-documents. It does have efficiency problems. It is an architectural extension to the web architecture. We have had many years for this architectural extension to be adopted and many of us producing linked data have been diligent in supporting, promoting and educating people about it. Even I, with my many many attempts to get this decision reconsidered, have promoted the W3C consensus. Conversely, many more people have studied this extension and rejected it. Companies such as Google, Facebook, Microsoft and Yahoo, who are all W3C members and can influence these decisions through formal channels if they wish, have looked at the httpRange decsion and decided it doesn't work for them. I haven't seen that them saying that. I have only seen the resulting RDFa. Instead they have chosen different approaches that require more effort to consume but lower the conceptual barrier for publishers. However, they are convinced of the need for URIs to identify things that are not just web pages which is a huge positive. Each of these players has said they nee to use the same URI for BOTH the document and the dog? It was perhaps just taking RDF/a as non-rdf by people who weren't using it in RDF systems and so who were not combining it with info about the web page. These companies collectively account for a very large proportion of web traffic and activity. I think just saying that they're wrong and should change their approach is simply being dogmatic. They are telling us that we are wrong. We should listen to them. I have not said that they are wrong in trying to make it very simple for people to say things about the subject of the page. Facebook used the standard in the simplest way could. For example, OGP uses consistently a set of properties which of the style: ogp:foo Whatever. where other might have written foaf:primarySubject #grapes. #grapes ex:foo Whatever. (Here ex: is a parallel namespace to ogp:) These triples are all consistent, in fact, so a rule which generates one for the other is easy. you can also do it in OWL by declaring org:foo to be a chain of primarySubject and ex:foo. Two ways to go to make this work, and many more can be done 1) Allow the parallel properties to exist and be related publically to the normal ones. 2) Fix the RDFa/microdata/whatever syntax to make it trivial and obvious to make statements about a single subject. We can clearly do both. If you want to give yourself the luxury of being able to refer to the subject of a webpage, without having to add anthing to disambiguate it from the web page, then for the sake of your system, so you can use the billion web pages for your purposes, then you now stop other like me from using semantic web systems to refer to those web pages, or in fact to the other hundred million web pages either. The problem here is that there are so few things that people want to say about web pages compared with the multitude of things they want to say about every other type of thing in existence. Well, that is a wonderful new thing. For a long while it was difficult to put data on the web, while there is quite a lot of metadata. Wonderful idea that the semantic web may be beating the document web hands down but that's not totally clear that we should trash the use of URIs for use to refer to documents as do in the document web. Yet the httpRange decision makes the web page a privileged component. With 200 yes, as you have to allow the exiting web, small though you say it is, to still function, when 200 is returned. I understand why that might have seemed a useful decision, after all this is the web we are talking about, but it has turned out not to be. The web page is only the medium for conveying information about the things we are really interested in. That may be true, but that doesn't mean that anyone should use the same URI for talking about both. The analogy is metadata about a book. Very little of it is about the physical book, i.e. the medium. Perhaps you would want to record its dimensions, mass, colour, binding or construction. There are many many more things you would want to record about the book's content, themes, people and places mentioned, author etc. Maybe you should an efficient way of doing what you want without destroying the system (which you as well have done so much to build) I think this is unreasonably strong. Nothing is being destroyed. Nothing has broken. If you use HTTP 200 for something different, then you break my ability to look at a page, review it, and then express my review in RDF, using the page's URI as the identifier. A few days
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 13 June 2011 07:52, Pat Hayes pha...@ihmc.us wrote: OK, I am now completely and utterly lost. I have no idea what you are saying or how any of it is relevant to the http-range-14 issue. Want to try running it past me again? Bear in mind that I do not accept your claim that a description of something is in any useful sense isomorphic to the thing it describes. As in, some RDF describing, say, the Eiffel tower is not in any way isomorphic to the actual tower. (I also do not understand why you think this claim matters, by the way.) Perhaps we are understanding the meaning of http-range-14 differently. My understanding of it is as follows: if an HTTP GET applied to a bare URI http:x returns a 200 response, then http:x is understood to refer to (to be a name for, to denote) the resource that emitted the response. Hence, it follows that if a URI is intended to refer to something else, it has to emit a different response, and a 303 redirect is appropriate. It also follows that in the 200 case, the thing denoted has to be the kind of thing that can possibly emit an HTTP response, thereby excluding a whole lot of things, such as dogs, from being the referent in such cases. Even with information resources there's a lot of flexibility in what HTTP can legitimately respond with, there needn't be bitwise identity across representations of an identified resource. Given this, I'm proposing a description can be considered a good-enough substitute for an identified thing. Bearing in mind it's entirely up to the publisher if they wish to conflate things, and up to the consumer to try and make sense of it. As a last attempt - this is a tar pit! - doing my best to take on board your (and other's) comments, I've wrapped up my claims in a blog post: http://dannyayers.com/2011/06/15/httpRange-14-Reflux Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
I agree with your sentiments Danny, fwiw. The current scheme is a burden on publishers for the sake of a handful of applications that wish to refer to these information resources themselves, making them unable to talk about Web pages using the Web description language RDF. What about minting a new URI at http://information.resourcifier.net/encodedURI; or similar for talking about such things? The service could even add value by tracking last update times, content types, encodings, etc. Jason p.s. Don't bother criticizing the half baked idea, I thought about it for 10 seconds. The point is 100 alternatives could have been hashed out in the time spent discussing and implementing http-range-14. Kudos to google et al for ignoring it. On 6/15/2011 9:27 AM, Danny Ayers wrote: On 13 June 2011 07:52, Pat Hayespha...@ihmc.us wrote: OK, I am now completely and utterly lost. I have no idea what you are saying or how any of it is relevant to the http-range-14 issue. Want to try running it past me again? Bear in mind that I do not accept your claim that a description of something is in any useful sense isomorphic to the thing it describes. As in, some RDF describing, say, the Eiffel tower is not in any way isomorphic to the actual tower. (I also do not understand why you think this claim matters, by the way.) Perhaps we are understanding the meaning of http-range-14 differently. My understanding of it is as follows: if an HTTP GET applied to a bare URI http:x returns a 200 response, then http:x is understood to refer to (to be a name for, to denote) the resource that emitted the response. Hence, it follows that if a URI is intended to refer to something else, it has to emit a different response, and a 303 redirect is appropriate. It also follows that in the 200 case, the thing denoted has to be the kind of thing that can possibly emit an HTTP response, thereby excluding a whole lot of things, such as dogs, from being the referent in such cases. Even with information resources there's a lot of flexibility in what HTTP can legitimately respond with, there needn't be bitwise identity across representations of an identified resource. Given this, I'm proposing a description can be considered a good-enough substitute for an identified thing. Bearing in mind it's entirely up to the publisher if they wish to conflate things, and up to the consumer to try and make sense of it. As a last attempt - this is a tar pit! - doing my best to take on board your (and other's) comments, I've wrapped up my claims in a blog post: http://dannyayers.com/2011/06/15/httpRange-14-Reflux Cheers, Danny.
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
* [2011-06-14 08:55:09 -0700] Pat Hayes pha...@ihmc.us écrit: ] Well, you have got me confused. Are you saying here that it does ] in fact make sense to say that a description of the eiffel tower ] is 356M tall? I'm just saying that things like this will be published because the publisher is confused, or mistaken or doesn't think that making the distinction is important or convenient and consumers of the data have to deal with it. We should encourage the publishers to do a better job but some of them will balk and sometimes, like with the schema.org that started this thread, big, important publishers with a lot of influence will balk. If we're lucky we can convince them to fix it, otherwise writers of software that consumes the data and tries to reason with it have to work out a way to be robust in the face of this kind of ambiguity. That's all. -w -- William Waitesmailto:w...@styx.org http://river.styx.org/ww/sip:w...@styx.org F4B3 39BF E775 CF42 0BAB 3DF0 BE40 A6DF B06F FD45
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/15/11 4:27 PM, Danny Ayers wrote: On 13 June 2011 07:52, Pat Hayespha...@ihmc.us wrote: OK, I am now completely and utterly lost. I have no idea what you are saying or how any of it is relevant to the http-range-14 issue. Want to try running it past me again? Bear in mind that I do not accept your claim that a description of something is in any useful sense isomorphic to the thing it describes. As in, some RDF describing, say, the Eiffel tower is not in any way isomorphic to the actual tower. (I also do not understand why you think this claim matters, by the way.) Perhaps we are understanding the meaning of http-range-14 differently. My understanding of it is as follows: if an HTTP GET applied to a bare URI http:x returns a 200 response, then http:x is understood to refer to (to be a name for, to denote) the resource that emitted the response. Hence, it follows that if a URI is intended to refer to something else, it has to emit a different response, and a 303 redirect is appropriate. It also follows that in the 200 case, the thing denoted has to be the kind of thing that can possibly emit an HTTP response, thereby excluding a whole lot of things, such as dogs, from being the referent in such cases. Even with information resources there's a lot of flexibility in what HTTP can legitimately respond with, there needn't be bitwise identity across representations of an identified resource. Given this, I'm proposing a description can be considered a good-enough substitute for an identified thing. Bearing in mind it's entirely up to the publisher if they wish to conflate things, and up to the consumer to try and make sense of it. As a last attempt - this is a tar pit! - doing my best to take on board your (and other's) comments, I've wrapped up my claims in a blog post: http://dannyayers.com/2011/06/15/httpRange-14-Reflux Cheers, Danny. Danny, This is part of the problem: TBL's argument: the HTTP URIs (without #) should be understood as referring to documents, not cars. It assumes that the audience doesn't have a clue, so the description has to be so condescending albeit inadvertent. How about: TBL's argument: the HTTP URIs (without #) should be understood as referring to an Address. A Data Source Name. What data publisher provides to user agents for accessing specific data in a given format, courtesy of content negotiation or lack thereof etc.. The confusion is a self inflicted one courtesy of narrative style and tone, methinks. URIs abstract Names and Addresses. This whole thing isn't unlike DNS. Points of presence on TCP/IP networks have NIC addresses and cnames, courtesy of DNS. Spreadsheets have offered cell addresses and cell names since forever. Programmers have worked with de-reference (indirection) and address-of operators forever. Most of the time when they encounter the: ... is a document, not cars ... style narrative, its throws them for a loop! As you know, a Document == Data Container that's projected to users via user agents (typically browsers) using a specific presentation oriented metaphor. Using 303 to deliver indirection is an accurate reflection of the required heuristic for implementing de-reference (indirection) via HTTP URI based Names. Otherwise, use a # terminated URI and get similar (but ultimately limited) effects without an actual 303. Web users started off using Addresses as Names for Resources (Web Docs). Now we're introducing a new abstraction where Name and Address are Distinct (i.e., we have Named Objects and Object Representation Addresses, interwoven), thus we need to find a variety of ways to explain and demonstrate this new abstraction generally known as Linked Data. One size never fits all, and http-range-14 is certainly not going to be the narrative that breaks that age-old mold :-) -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle
Awesome rant Richard! I think this bit would work better live : I want to tell the publishers of these web pages that they could join the web of data just by adding a few @rels to some as, and a few @properties to some spans, and a few @typeofs to some divs (or @itemtypes and @itemprops). And I don't want to explain to them that they should also change http:// to thing:// or tdb:// or add #this or #that or make their stuff respond with 303 or to MGET requests because you can't squeeze a dog through an HTTP connection. for arguments with value, you may have hit the nail on the head here : Being useful trumps making semantic sense. The web succeeded *because* it conflates name and address. The web of data will succeed *because* it conflates a thing and a web page about the thing. Now tell me, why is it so easy to wind you up on these issues? Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle
On 14 June 2011 10:49, Richard Cyganiak rich...@cyganiak.de wrote: On 13 Jun 2011, at 20:51, David Booth wrote: http://richard.cyganiak.de/ a foaf:Document; dc:title Richard Cyganiak's homepage; a foaf:Person; foaf:name Richard Cyganiak; owl:sameAs http://twitter.com/cygri; . That should be fine for applications that do not need to distinguish between foaf:Documents and foaf:Persons . . . which is a large class of applications. OTOH, there *are* applications that need to distinguish between foaf:Documents and foaf:Persons. *Those* applications will need to apply disambiguation techniques, and some of their owners will (wrongly) blame you for the perceived extra work it causes them -- extra only because they happen to be implementing a different class of application than your data best supports. Yes, good analysis. Not sure I'm comfortable with the notion of data being published with a predetermined class of consuming applications. The bottom lines are: publish what you want, interpret how you see fit. Somewhere between Postel and Aleister Crowley. My comments on httpRange-14 could not be any less relevant to the reality, I'd just rather things were kinda tidy rather than swept under the carpet (at home I have dog fur on the tiles). Yes, I do think if we can have some approximation of a consistent common model, that is better for communication. But it's pretty much a certainty that the best course of action is to live with whatever comes up and make the best of it. Build on what we can. Cue cliche if history has taught us anything... Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Jun 15, 2011, at 1:35 PM, Jason Borro wrote: I agree with your sentiments Danny, fwiw. The current scheme is a burden on publishers for the sake of a handful of applications that wish to refer to these information resources themselves, making them unable to talk about Web pages using the Web description language RDF. What about minting a new URI at http://information.resourcifier.net/encodedURI; or similar for talking about such things? The service could even add value by tracking last update times, content types, encodings, etc. Jason p.s. Don't bother criticizing the half baked idea, I thought about it for 10 seconds. The point is 100 alternatives could have been hashed out in the time spent discussing and implementing http-range-14. I confess to finding this kind of sneering remark rather annoying. If you think it is this trivial to work out some 'alternative', why don't you come up with a few actual ideas and see what happens when they get a little peer review? Your idea, above, hardly makes first base, as Im sure you already realized when you added the p.s. So why not try inventing one that has a snowballs chance in hell of actually working? Im sure that the world would be delighted if you could solve this trivial problem in 5 ways, let alone a hundred. If you agree with Danny that a description can be a substitute for the thing it describes, then I am waiting to hear how one of you will re-write classical model theory to accommodate this classical use/mention error. You might want to start by reading Korzybski's 'General Semantics'. Pat Kudos to google et al for ignoring it. On 6/15/2011 9:27 AM, Danny Ayers wrote: On 13 June 2011 07:52, Pat Hayespha...@ihmc.us wrote: OK, I am now completely and utterly lost. I have no idea what you are saying or how any of it is relevant to the http-range-14 issue. Want to try running it past me again? Bear in mind that I do not accept your claim that a description of something is in any useful sense isomorphic to the thing it describes. As in, some RDF describing, say, the Eiffel tower is not in any way isomorphic to the actual tower. (I also do not understand why you think this claim matters, by the way.) Perhaps we are understanding the meaning of http-range-14 differently. My understanding of it is as follows: if an HTTP GET applied to a bare URI http:x returns a 200 response, then http:x is understood to refer to (to be a name for, to denote) the resource that emitted the response. Hence, it follows that if a URI is intended to refer to something else, it has to emit a different response, and a 303 redirect is appropriate. It also follows that in the 200 case, the thing denoted has to be the kind of thing that can possibly emit an HTTP response, thereby excluding a whole lot of things, such as dogs, from being the referent in such cases. Even with information resources there's a lot of flexibility in what HTTP can legitimately respond with, there needn't be bitwise identity across representations of an identified resource. Given this, I'm proposing a description can be considered a good-enough substitute for an identified thing. Bearing in mind it's entirely up to the publisher if they wish to conflate things, and up to the consumer to try and make sense of it. As a last attempt - this is a tar pit! - doing my best to take on board your (and other's) comments, I've wrapped up my claims in a blog post: http://dannyayers.com/2011/06/15/httpRange-14-Reflux Cheers, Danny. IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 15 June 2011 18:30, Pat Hayes pha...@ihmc.us wrote: Boy, that is a humdinger of a non-sequiteur. Given that HTTP has flexibility, it is OK to identify a description of a thing with the actual thing? To me that sounds like saying, given that movies are projected, it is OK to say that fish are bicycles. Not that I think I did a non-sequiteur, it is totally ok to say that fish are bicycles, if that's what you want to say. [snip] OK, thanks. Here is your argument, as far as I can understand it. 1. HTTP representations may be partial or incomplete. (Agreed.) 2. HTTP reps can have many different media types, and this is OK. (Agreed, though I cant see what relevance this has to anything.) 3. A description is a kind of representation. (Agreed, and there was no need to get into the 'isomorphism' trap. We in KRep have been calling descriptions representations for decades now.) 4. Therefore, a HTTP URI can simultaneously be understood as referring to a document and a car. Whaaat? How in Gods name can you derive this conclusion from those premises? my wording could be better, but I stand by it... a document describing the car, through HTTP, can be an equally valid representation of the named car resource as the car itself (as long as it's qualified by media type) Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 16 June 2011 02:26, Pat Hayes pha...@ihmc.us wrote: If you agree with Danny that a description can be a substitute for the thing it describes, then I am waiting to hear how one of you will re-write classical model theory to accommodate this classical use/mention error. You might want to start by reading Korzybski's 'General Semantics'. IANAL, but I have heard of the use/mention thing, quite often. I don't honestly know whether classical model theory needs a rewrite, but I'm sure it doesn't on the basis of this thread. I also don't know enough to know whether it's applicable - from your reaction, I suspect not. As a publisher of information on the Web, I'm pretty much free to say what I like (cf. Tim's Design Notes). Fish are bicycles. But that isn't very useful. But if I say Sasha is some kind of weird Collie-German Shepherd cross, that has direct relevance to Sasha herself. More, the arcs in my description between Sasha and her parents have direct correspondence with the arcs between Sasha and her parents. There is information common to the reality and the description (at least in human terms). The description may, when you stand back, be very different in its nature to the reality, but if you wish to make use of the information, such common aspects are valuable. We've already established that HTTP doesn't deal with any kind of one true representation. Data about Sasha's parentage isn't Sasha, but it's closer than a non-committal 303 or rdfs:seeAlso. There's nothing around HTTP that says it can't be given the same name, and it's a darn sight more useful than a wave-over-there redirect or a random fish/bike association. I can't see anything it breaks either. Cheers, Danny. -- http://danny.ayers.name
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On Wed, Jun 15, 2011 at 11:04 PM, Jason Borro ja...@openguid.net wrote: Good luck preserving your mental model if you require webmasters to spell Korzybski. This is an odd comment. It's like saying good luck preserving your model of TCP if you require network developers to know where Postel worked. TCP has to work, whether or not webmasters know the intellectual history its development, and the same will be true of whatever eventually becomes what the semweb ideas are aiming at. Pat's knows something about the history of what's known to work and what isn't. You ignore that history at the peril of your ideas simply not working. -Alan
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
re On Mon, Jun 13, 2011 at 08:33:47PM -0700, Pat Hayes wrote: But if you are a semantic inference engine, and you get the dog and its picture muddled, will you likely generate a lot of nonsensical assertions? Answer, Yes, you will. Which is the key point at issue here. We should be able to present the user a lot of sensical assertions (and maybe some nonsensical ones) if we know he is concerned with information about dogs instead of information about pictures. Anyway - I think special purpose reasoners will play a much bigger role in the near future than general purpose reasoners because they perform better with big and messy data. And publishers will start to differenciate between dogs and pictures of dogs as soon as it provides them added value. Until that day, we will have to live with the situation and try to nudge people in the right direction (which includes httprange-14). But mass adoption means messy data in any case. Regards, Michael Brunnbauer -- ++ Michael Brunnbauer ++ netEstate GmbH ++ Geisenhausener Straße 11a ++ 81379 München ++ Tel +49 89 32 19 77 80 ++ Fax +49 89 32 19 77 89 ++ E-Mail bru...@netestate.de ++ http://www.netestate.de/ ++ ++ Sitz: München, HRB Nr.142452 (Handelsregister B München) ++ USt-IdNr. DE221033342 ++ Geschäftsführer: Michael Brunnbauer, Franz Brunnbauer ++ Prokurist: Dipl. Kfm. (Univ.) Markus Hendel
Re: Squaring the HTTP-range-14 circle
On 13 Jun 2011, at 20:51, David Booth wrote: http://richard.cyganiak.de/ a foaf:Document; dc:title Richard Cyganiak's homepage; a foaf:Person; foaf:name Richard Cyganiak; owl:sameAs http://twitter.com/cygri; . That should be fine for applications that do not need to distinguish between foaf:Documents and foaf:Persons . . . which is a large class of applications. OTOH, there *are* applications that need to distinguish between foaf:Documents and foaf:Persons. *Those* applications will need to apply disambiguation techniques, and some of their owners will (wrongly) blame you for the perceived extra work it causes them -- extra only because they happen to be implementing a different class of application than your data best supports. Yes, good analysis. Best, Richard
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/13/11 1:28 AM, Pat Hayes wrote: But I don't think all this is really germane to the http-range-14 issue. The point there is, does the URI refer to something like a representation (information resource, website, document, RDF graph, whatever) or something which definitely canNOT be sent over a wire? The Referent of a URI re., http-range-14 is the observation (or description) subject. In this context the subject may or may not be a real world object or entity. In the context of Linked Data, the observation (or description) subject URI resolves to a Representation of its Referent. Actual representation is accessible via an Address. Data representation formats are *optionally* negotiable e.g., via content negotiation, and ultimately varied i.e., many serialization formats for byte stream that actually transmits data from its source to its consumers. -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
On 6/13/11 6:52 AM, Pat Hayes wrote: OK, I am now completely and utterly lost. I have no idea what you are saying or how any of it is relevant to the http-range-14 issue. Want to try running it past me again? Bear in mind that I do not accept your claim that a description of something is in any useful sense isomorphic to the thing it describes. As in, some RDF describing, say, the Eiffel tower is not in any way isomorphic to the actual tower. (I also do not understand why you think this claim matters, by the way.) Perhaps we are understanding the meaning of http-range-14 differently. My understanding of it is as follows: if an HTTP GET applied to a bare URI http:x returns a 200 response, then http:x is understood to refer to (to be a name for, to denote) the resource that emitted the response. No, 200 OK means this URI is functionally an Address i.e., a place that's ready to transmit the byte stream associated with the Address. Hence, it follows that if a URI is intended to refer to something else, it has to emit a different response, and a 303 redirect is appropriate. When the functionality of the URI changes i.e., its a Name rather than an Address, courtesy of de-reference (indirection), there is a 303 redirect (an act of indirection). It also follows that in the 200 case, the thing denoted has to be the kind of thing that can possibly emit an HTTP response, Yes, a data server indicates to a client that a given Address is functional i.e., I'll transmit you a byte stream from this place which I crafted for this specific purpose. thereby excluding a whole lot of things, such as dogs, from being the referent in such cases. Yes, if the response is 200 OK since the URI is an Address. No if the response is a 303 since the URI is a Name. It still boils down to the URI abstraction which ingeniously caters for two vital data access by reference operations: Name (for de-reference and indirection) and Address (for Data Access). Kingsley Pat On Jun 12, 2011, at 6:46 PM, Danny Ayers wrote: On 13 June 2011 02:28, Pat Hayespha...@ihmc.us wrote: Next point: there can indeed be correspondences between the syntactic structure of a description and the aspects of reality it describes. That is what I was calling isomorphism (which I still don't think was inaccurate). But ok, say there are correspondences instead. I would suggest that those correspondences are enough to allow the description to take the place of a representation under HTTP definitions. But I don't think all this is really germane to the http-range-14 issue. The point there is, does the URI refer to something like a representation (information resource, website, document, RDF graph, whatever) or something which definitely canNOT be sent over a wire? I'm saying conceptually it doesn't matter if you can put it over the wire or not. But replace a novel written by a dog for dog in the above. Why should the concept of a document be fundamentally any different from the concept of a dog, hence representations of a document and representations of a dog? I dont follow your point here. If you mean, a document is just as real as a dog, I agree. So? But if you mean, there is no basic difference between a document and a dog, I disagree. And so does my cat. Difference sure, but not necessarily relevant. Ok, you can squeeze something over the wire that represents a novel written by a dog but you (probably) can't squeeze a dog over, but that's just a limitation of the protocol. So improved software engineering will enable us to teleport dogs over the internet? Come on, you don't actually believe this. It would save a lot of effort sometimes (walkies!) but all I'm suggesting is that if, hypothetically, you could teleport matter over the internet, all you'd be looking at as far as http-range-14 is concerned is another media type. Working back from there, and given correspondences as above, a descriptive document can be a valid representation of the identified resource even if it happens to be an actual thing, given that there isn't necessary any one true representation. We don't need the Information Resource distinction here (useful elsewhere maybe). Cheers, Danny. -- http://danny.ayers.name IHMC (850)434 8903 or (650)494 3973 40 South Alcaniz St. (850)202 4416 office Pensacola(850)202 4440 fax FL 32502 (850)291 0667 mobile phayesAT-SIGNihmc.us http://www.ihmc.us/users/phayes -- Regards, Kingsley Idehen President CEO OpenLink Software Web: http://www.openlinksw.com Weblog: http://www.openlinksw.com/blog/~kidehen Twitter/Identi.ca: kidehen
Re: Squaring the HTTP-range-14 circle [was Re: Schema.org in RDF ...]
Before I comment, I just want to summarise my understanding because http-range-14 is a weird term; I understand it as the range-14 issue that when you use 302 to redirect from a URI-A to a URL-B we have a convention that URL-B has some relationship to URI-A but it's not defined, we don't treat this as semantic information and tend to throw it away. (stated to make sure I've understood correctly) This bit a chap working with some of my data; * he loaded some data from URI-A using a library * URI-A did a nice content-negotiated 302 to URL-B (and RDF document) * URL-B had a description of URI-A * The problem was he also wanted to auto extract the license for this data, but the triples gave the license as a relation to URL-B, but the system treated the data as loaded from URI-A At the most simple level, we could add some triples when loading a graph via redirection... URI-A myprefix:http302redirect URL-B or something richer with dates, http options etc. You could do something even fussier with http headers stating an explicit relationship with the 302, but all of this is very nice but the main problem seems to be that it's hard and doesn't benefit someone who just wants to knock something up quickly. The real problem seems to me that making resolvable, HTTP URIs for real world things was a clever but dirty hack and does not make any semantic sense. We should use thing://data.totl.net/scooby to refer to the dog and have a convention that http://data.totl.net/scooby will refer to some content about my dog. This URL can of course then content negotiate as normal. You could also use this in reverse. *thing*://www.imdb.com/title/tt0910554/ is the primary topic of http://www.imdb.com/title/tt0910554/ Yes, you could end up with a whole bunch of URIs for the same thing; thing://data.totl.net/scooby thing://data.totl.net/scooby.html thing://data.totl.net/scooby.pdf thing://data.totl.net/scooby.csv all are the same thing, but big deal. The only tricky thing would be people may get confused about the thing URI related to a document. For example, given a document in pdf, word and html, you might need a separate thing:// URI to describe the abstract concept of the document, but that's not the primary topic of any of the documents. Such fiddling details are more the province of people with experience, so I'm not too worried. What we should be doing is making the common garden data really easy to produce. I've spent a lot of time trying to teach these concepts to people at hackdays barcamps, plus in a professional context. http:// URIs for real world things clearly make it harder to learn. The follow-you-nose gimick is cool, but we could do that with a change convention, and a trivial update to existing libraries (just resolve thing:// via http://) I expect the answer is it's too late to change now. To which I am tempted to say change or die. (again, another Monday morning ranty mail! but I feel like someone should be commenting on the emperors URI convention. If there's a cheat sheet I should read before continuing commenting on these subject, please point me to it.) Kingsley Idehen wrote: On 6/13/11 1:28 AM, Pat Hayes wrote: But I don't think all this is really germane to the http-range-14 issue. The point there is, does the URI refer to something like a representation (information resource, website, document, RDF graph, whatever) or something which definitely canNOT be sent over a wire? The Referent of a URI re., http-range-14 is the observation (or description) subject. In this context the subject may or may not be a real world object or entity. In the context of Linked Data, the observation (or description) subject URI resolves to a Representation of its Referent. Actual representation is accessible via an Address. Data representation formats are *optionally* negotiable e.g., via content negotiation, and ultimately varied i.e., many serialization formats for byte stream that actually transmits data from its source to its consumers. -- Christopher Gutteridge -- http://id.ecs.soton.ac.uk/person/1248 You should read the ECS Web Team blog: http://blogs.ecs.soton.ac.uk/webteam/
Re: Squaring the HTTP-range-14 circle
On 13 Jun 2011, at 09:59, Christopher Gutteridge wrote: The real problem seems to me that making resolvable, HTTP URIs for real world things was a clever but dirty hack and does not make any semantic sense. Well, you worry about *real-world things*, but even people who just worry about *documents* have said for two decades that the web is broken because it conflates names and addresses. And they keep proposing things like URNs and info: URIs and tag: URIs and XRIs and DOIs to fix that and to separate the naming concern from the address concern. And invariably, these things fizzle around in their little niche for a while and then mostly die, because this aspect that you call a “clever but dirty hack” is just SO INCREDIBLY USEFUL. And being useful trumps making semantic sense. HTTP has been successfully conflating names and addresses since 1989. There is a trillion web pages out there, all named with URIs. And even if just 0.1% of these pages are unambiguously about a single specific thing, that gives us a billion free identifiers for real-world entities, all already equipped with rich *human-readable* representations, and already linked and interconnected with *human-readable*, untyped, @href links. And these one billion URIs are plain old http:// URIs. They don't have a thing:// in the beginning, nor a tdb://, nor a #this or #that in the end, nor do they respond with 303 redirects or to MGET requests or whatever other nutty proposals we have come up with over the years to disambiguate between page and topic. They are plain old http:// URIs. A billion. Then add to that another huge number that already responds with JSON or XML descriptions of some interesting entity, like the one from Facebook that Kingsley mentioned today in a parallel thread. Again, no thing:// or tdb:// or #this or 303 or MGET on any of them. I want to use these URIs as identifiers in my data, and I have no intention of redirecting through an intermediate blank node just because the TAG fucked up some years ago. I want to tell the publishers of these web pages that they could join the web of data just by adding a few @rels to some as, and a few @properties to some spans, and a few @typeofs to some divs (or @itemtypes and @itemprops). And I don't want to explain to them that they should also change http:// to thing:// or tdb:// or add #this or #that or make their stuff respond with 303 or to MGET requests because you can't squeeze a dog through an HTTP connection. And here you and Pat and Alan (and TimBL, for that matter) are preaching that we can't use this one billion of fantastic free URIs to identify things because it wouldn't make semantic sense. Being useful trumps making semantic sense. The web succeeded *because* it conflates name and address. The web of data will succeed *because* it conflates a thing and a web page about the thing. http://richard.cyganiak.de/ a foaf:Document; dc:title Richard Cyganiak's homepage; a foaf:Person; foaf:name Richard Cyganiak; owl:sameAs http://twitter.com/cygri; . There. If your knowledge representation formalism isn't smart enough to make sense of that, then it may just not be quite ready for the web, and you may have some work to do. Best, Richard