Juan Sequeda wrote:
Does Sindice crawl this (or any other semantic web search engines)?
Juan,

Sponger is not about Sindice crawling our proxy URIs. The Web of Linked Data shouldn't be about mass crawling (search engine style) etc... Its really supposed to be about smarter data network traversals triggered by data access requests. Basically, make the pathway "on the fly", remember it for future reference, and know when its obsolete.

If you look at it the other way round, our Sponger has Meta Cartridges that will lookup Sindice (via their APIs) for specific data about a various entities. It won't seek a complete dump of Sindice etc.. The same applies to a plethora of Web 2.0 style services.


We can do smart database queries on the Web by simply meshing fundamental database principles with the inherent sophistication of HTTP :-)

Kingsley



Juan Sequeda, Ph.D Student
Dept. of Computer Sciences
The University of Texas at Austin
www.juansequeda.com <http://www.juansequeda.com>
www.semanticwebaustin.org <http://www.semanticwebaustin.org>


On Sat, Oct 17, 2009 at 4:24 AM, Martin Hepp (UniBW) <h...@ebusiness-unibw.org <mailto:h...@ebusiness-unibw.org>> wrote:

    Dear all:

    I just found out that the Virtuoso Sponger technology is even more
    powerful than I thought.

    Briefly: "Spongers" create rich GoodRelations (and other RDF)
    meta-data
    for existing Web pages on-the-fly. Other than traditional
    screen-scraping approaches, Spongers reuse public APIs and other
    techniques, so the data is of unprecedented degree of structure.

    Now, this can be directly used in arbitrary queries... by simply using
    the URI of the *existing* HTML Web page in the FROM clause of a SPARQL
    query.

    Example:


    
http://www.amazon.com/Semantic-Web-Real-World-Applications-Industry/dp/0387485309

    is a Web page in plain HTML offering a book. Amazon does not yet
    produce GoodRelations meta-data on their pages.

    If you go to

       http://uriburner.com/sparql

    and paste the URI in the "Default Graph URI " field and select
    "Retrieve
    remote RDF for all missing source graphs", then a query like

      "SELECT * WHERE {?s ?p ?o} LIMIT 50"

    returns a fully-fledged GoodRelations description for that page -
    as if
    Amazon was already supporting GoodRelations for each of its > 4
    million
    items!

    There are spongers for BestBuy, eBay, Zillow, and many other types of
    resources.

    Wow!

    Congrats to Kingsley and his team!

    Best wishes

    Martin Hepp

-- --------------------------------------------------------------
    martin hepp
    e-business & web science research group
    universitaet der bundeswehr muenchen

    e-mail:  h...@ebusiness-unibw.org <mailto:h...@ebusiness-unibw.org>
    phone:   +49-(0)89-6004-4217
    fax:     +49-(0)89-6004-4620
    www:     http://www.unibw.de/ebusiness/ (group)
            http://www.heppnetz.de/ (personal)
    skype:   mfhepp
    twitter: mfhepp

    Check out GoodRelations for E-Commerce on the Web of Linked Data!
    =================================================================

    Webcast:
    http://www.heppnetz.de/projects/goodrelations/webcast/

    Recipe for Yahoo SearchMonkey:
    http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey

    Talk at the Semantic Technology Conference 2009:
    "Semantic Web-based E-Commerce: The GoodRelations Ontology"
    
http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287

    Overview article on Semantic Universe:
    
http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html

    Project page:
    http://purl.org/goodrelations/

    Resources for developers:
    http://www.ebusiness-unibw.org/wiki/GoodRelations

    Tutorial materials:
    CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on
    Introduction to the GoodRelations Ontology, RDFa, and Yahoo!
    SearchMonkey
    
http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_IEEE_CEC%2709






--


Regards,

Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO OpenLink Software Web: http://www.openlinksw.com





Reply via email to