Re: BobQL? Boxes of (related) boxes ...

David Huynh Sun, 31 May 2009 21:16:47 -0700

Dan Brickley wrote:

(I changed subject line again, sorry but to avoid misleaing further :)
OK, you goaded me into writing up what I was thinking about...

On 31/5/09 21:09, Andreas Harth wrote:
Hi,

Dan Brickley wrote:
Basic idea in a nutshell is that SPARQL is great for data access, but
there may be additional query-oriented data structures worth spec'ing
based around the set-oriented navigation very nicely articulated by
David Huynh in the Parallax screencast. And that if such a structure
could be exchanged between systems we could hope that the navigational
paradigm it supports could be found in various concrete UIs, and that
the results of exploring data this way could become useful and
standard artifacts in the public Web, rather than just bookmarks
within some specific system.
there's at least two issues with using standard SPARQL endpoints
in a faceted browsing system (as far as I can see from my experiements
with SWSE and VisiNav):

* A lot of end-user systems for RDF data navigation offer keyword
search, which is not in standard SPARQL. Emulating fulltext search
with SPARQL regex's seems suboptimal. Using endpoint-specific
FILTERs or magic predicates requires adaptations depending on
the RDF store used, and might be tricky when you want to mix
SPARQL endpoints from different vendors.

* Ranking is essential for systems offering navigation over web data.
LIMIT is ok to improve performance by keeping the result size small,
but query processors will then return arbitrary results that wouldn't
satisfy end users who expect relevant (i.e. ranked) results.
Oh, I quite agree. Actually I made a lot of these same points re whatto expect from SPARQL last friday at LIDA2009 -http://www.slideshare.net/danbri/understanding-the-standards-gap ......ie that SPARQL is good for what it's meant for, but peopleexpecting standards-supporting tools to do more will be in for adissapointment.
What I was getting at instead in the conversation here is that wecould directly serialize the "box of box of related things" conceptualmodel that we see in various SW toolsets. Am avoiding other containerwords like (rdf:)Bag, Set, Class etc although the concepts areobviously related. So for here it's "boxes".
So to take David's example,
Box 1: A journey exploring information about presidents, their kidsand their education...
box 1.1: All things that are US presidents
box 1.2: All things that are children of things in bag_1.1
box 1.3: All things that are educational institutions, attended bythings in bag 1.2bag1.4: All things that are places that are locations of things in bag1.3...
Box 2: A journey into info about hong kong skyscrapers, theirdesigners, and the buildings those designers have made
box 2.1: All things that are skyscrapers in Hong Kong
box 2.2: All things that are the architects of things in bag 2.1
box 2.3: All things that are buildings designed by things that are inbag 2.2 ...etc
Note: each sub-box is an RDFesque expression couched in terms of typesand relations, with a reference to the set of things handed along fromthe previously box. In theory each of these could also evaluated withdifferent "according to..." criteria, which could map into SPARQLGRAPH provenance, different databases, or various other ways ofindicating who-said-what. Also note that the class hierarchies belovedof OWL enthusiasts is potentially useful here: if a trail goes cold,eg. because you only find 3 or 4 things that are schools of kids ofpresidents, you could explore higher up the class hierarchy in one ofthese expression ("Not much found. Try Politician instead of USPresident? Try Educational Institution instead of School?")...
Each of the outer boxes captures a journey into some data. In theFreebase/Parallax articulation, the data is from the samemega-database. Obviously Semwebby people are fascinated by distributedsystems and standards, which is what I'm getting at here. Theconceptual model above - navigating by groups of things, is prettybasic and potentially universal. The restrictions in old style booleanquery against bibliographic databases, or the set machinery in OWL,are closely related. What I'd like to see is something in thisdirection that can be made separate of UI, separate from particulardatabase, and universal enough to be taught in schools.
The sketch above is playing with a spec that could ultimately becompiled down to SPARQL (though maybe not so simplistically, given theneed for counts etc), but which captures something a little closer toUI and user intent too, so that the query could be usefully mutated ifit didn't throw up many results...
Sorry this is so sketchy, but am I making any sense here?

This sounds pretty exciting! I'm glad you're thinking of doing somethingin this direction!

A while ago, I did think about formulating a query language to capturethe browsing paradigm of Parallax, but I was afraid that the querylanguage might restrict innovations, at least at that early stage of thegame. But over the long term, however, once we've understood theconceptual model, and all of the nuances that are required foruser-friendliness, then it's definitely worthwhile to have somestandard, perhaps in the form of a query language, that lets differentsystem work together, as you envision.

So I'd say that the best practical plan here is to develop a realworking and usable system alongside with the query language (or protocolor whatever kind of abstraction), but make the system the primaryartifact--fine-tuning it to make it usable--and keep the query languageup to date with the system. Of course the elegance of the query languageshould also inform the design and implementation of the system.

It would be really neat if you can even set this up as an open sourceproject that everyone here can work together on. (The worst casescenario is that it's just a paperware project that falls apart afterthe paper has been published.)


David

Re: BobQL? Boxes of (related) boxes ...

Reply via email to