Re: Linked data in packaged content (ePub)

2010-05-14 Thread John Erickson
One aspect of this thread concerns reasoning on limited-compute (i.e.
mobile) platforms. I'd like to point out Pychinko, the Python-based
reasoner (c.f. http://bit.ly/bPglu1 ), which was designed to run on
mobile platforms such as Nokia phones where Java-based reasoners were
considered too compute-intensive.

Granted, given the current closed nature of Apple, we should probably
be looking for Cocoa/Objective-C based reasoners... :P

John

On Wed, Apr 28, 2010 at 2:39 PM, Stuart A. Yeates syea...@gmail.com wrote:
 On Wed, Apr 28, 2010 at 11:36 PM, John Erickson olyerick...@gmail.com wrote:
 Stuart, t's not clear to me what you're trying to accomplish...For
 whom are you trying to add value?

 We are funded to digitise teaching, learning and research materials
 for our staff and students. Value to anyone else is incidental, but
 indicative.

 Are you imagining creating some kind of meshup within the reading
 experience, perhaps meshing metadata and links bound to entities
 within the ePub'd document with external linked data?

 Ideally, I'd like a protocol such as Open URL (
 http://en.wikipedia.org/wiki/Open_URL ), linking books on the device
 up to the bibliographies of other books that also happen to be on the
 device. For low CPU devices the links might have to be pre-calculated
 when connected to a desktop PC. I understand that Open URL can't
 actaully do this because it assumes the web.

 cheers
 stuart




-- 
John S. Erickson, Ph.D.
http://bitwacker.wordpress.com
olyerick...@gmail.com
Twitter: @olyerickson



Re: Linked data in packaged content (ePub)

2010-04-29 Thread Stuart A. Yeates
On Wed, Apr 28, 2010 at 8:42 PM, Ed Summers e...@pobox.com wrote:
 On Tue, Apr 27, 2010 at 4:40 PM, Stuart A. Yeates syea...@gmail.com wrote:
 Does anyone know of any other attempts to put linked data into
 packages like this?

 While arguably not Linked Data per-se, you might be interested in work
 being done on the Open Publication Distribution System (OPDS) [1],
 which aims to use Atom for making metadata about ebooks available. A
 key part of an OPDS feed are opds:acquisition links between an
 atom:entry and a epub document identified with a URI and a media type,
 for example:

 link type=application/epub+zip
 href=http://www.feedbooks.com/book/4440.epub;
 rel=http://opds-spec.org/acquisition/

 Implementors include people at O'Reilly, Internet Archive, Ibis
 Reader, FeedBooks, to name a few. Much of the work is actually going
 on in open bi-weekly conference calls, and on a discussion list [3] if
 you are interested. On a recent call Hadrien Gardeur of Feedbooks was
 talking about embedding opds atom documents in the ebook
 serializations, so it might be worthwhile pinging him and/or the
 discussion list to see where things are at.

 //Ed

 [1] http://code.google.com/p/openpub/wiki/CatalogSpecDraft
 [2] http://ibisreader.com/
 [3] http://groups.google.com/group/openpub

Thank Ed.

I was unaware of some of those developments. The schema in the draft
looks very useful, because it's something solid to check against,
rather than sucking it and seeing.

cheers
stuart



Linked data in packaged content (ePub)

2010-04-28 Thread Stuart A. Yeates
I'm interested in putting linked data into eBooks published in the
(open standard) ePub format (http://www.openebook.org/ ). The format
is essentially a relocatable zip file of XHTML, associated media files
and a few metadata files.

The target platforms of this content impose some restrictions on what
is practical: e-ink devices (which are the only current eBook readers
with the battery life to last an entire novel) typically don't have an
internet connection (thus no resolving of links) and have very little
in the way of processing power (thus no full reasoning).

We already have some data-interlinking between our collection
(http://www.nzetc.org/ ) and librarything
(http://www.librarything.com/ ) at the FRBR work level
(http://vocab.org/frbr/core.html#Work ) and also some links to
wikipedia / dbpedia for named entities (principally authors and
places).  We believe we have quite good authority control over author
names, even those who published under multiple names (see, for example
http://www.nzetc.org/tm/scholarly/name-208662.html or
http://www.nzetc.org/tm/scholarly/name-208310.html ). We have ~1300
ePubs, the largest of which exceed the size limits of most ePub tools.

Does anyone know of any other attempts to put linked data into
packages like this?

There are two main issues I can see: (a) how to self-identify the
package (naive hashing doesn't work, as some eBook readers open the
package and add custom metadata) and (b) how to package the linked
data to get maximal use when a paucity of CPU precludes a full
reasoner.

The traditional identifier used in this field, the ISBN, is
essentially a print-run identifier, and not of a whole lot of obvious
use to us since: (a) most of our books' original publishing predates
ISBNs and (b) our digital republishing of them doesn't qualify for an
ISBN according to our local ISBN issuer (the National Library of New
Zealand).

cheers
stuart




Re: Linked data in packaged content (ePub)

2010-04-28 Thread Christoph LANGE
2010-04-27 22:40 Stuart A. Yeates syea...@gmail.com:
 I'm interested in putting linked data into eBooks published in the
 (open standard) ePub format (http://www.openebook.org/ ). The format
 is essentially a relocatable zip file of XHTML, associated media files
 and a few metadata files.
 
 ...
 
 Does anyone know of any other attempts to put linked data into
 packages like this?

The mere embedding is a current topic of interest of the RDFa WG (see
http://lists.w3.org/Archives/Public/public-rdfa-wg/2010Mar/0200.html), and I
suppose they will be quite interested in the further implications you
mentioned.

Cheers,

Christoph

-- 
Christoph Lange, Jacobs Univ. Bremen, http://kwarc.info/clange, Skype duke4701



Re: Linked data in packaged content (ePub)

2010-04-28 Thread Ed Summers
On Tue, Apr 27, 2010 at 4:40 PM, Stuart A. Yeates syea...@gmail.com wrote:
 Does anyone know of any other attempts to put linked data into
 packages like this?

While arguably not Linked Data per-se, you might be interested in work
being done on the Open Publication Distribution System (OPDS) [1],
which aims to use Atom for making metadata about ebooks available. A
key part of an OPDS feed are opds:acquisition links between an
atom:entry and a epub document identified with a URI and a media type,
for example:

link type=application/epub+zip
href=http://www.feedbooks.com/book/4440.epub;
rel=http://opds-spec.org/acquisition/

Implementors include people at O'Reilly, Internet Archive, Ibis
Reader, FeedBooks, to name a few. Much of the work is actually going
on in open bi-weekly conference calls, and on a discussion list [3] if
you are interested. On a recent call Hadrien Gardeur of Feedbooks was
talking about embedding opds atom documents in the ebook
serializations, so it might be worthwhile pinging him and/or the
discussion list to see where things are at.

//Ed

[1] http://code.google.com/p/openpub/wiki/CatalogSpecDraft
[2] http://ibisreader.com/
[3] http://groups.google.com/group/openpub



Re: Linked data in packaged content (ePub)

2010-04-28 Thread John Erickson
Stuart, t's not clear to me what you're trying to accomplish...For
whom are you trying to add value?

Since ePub is XHTML, it makes sense to embed metadata as RDFa. But
why? Is the purpose to enhance the reading experience? Or perhaps the
local collection management experience?

Publishers should be, and most likely are, obtaining DOIs
http://doi.org for their materials through a registration agency
such as CrossRef http://crossref.org. As we have discussed elsewhere
(and for more than a decade...) the DOI enables multiple stakeholders
to manage and publish metadata about the item; linked data best
practices are a promising approach , but RDFa on an RA's landing page
for the item is also a possibility. The DOI is not about embedding
metadata in individual instances, however, which is what you seem to
be asking about.

Are you imagining creating some kind of meshup within the reading
experience, perhaps meshing metadata and links bound to entities
within the ePub'd document with external linked data?

John

On Tue, Apr 27, 2010 at 4:40 PM, Stuart A. Yeates syea...@gmail.com wrote:
 I'm interested in putting linked data into eBooks published in the (open 
 standard) ePub format (http://www.openebook.org/ ). The format is essentially 
 a relocatable zip file of XHTML, associated media files and a few metadata 
 files.

 The target platforms of this content impose some restrictions on what is 
 practical: e-ink devices (which are the only current eBook readers with the 
 battery life to last an entire novel) typically don't have an internet 
 connection (thus no resolving of links) and have very little in the way of 
 processing power (thus no full reasoning).

 We already have some data-interlinking between our collection 
 (http://www.nzetc.org/ ) and librarything (http://www.librarything.com/ ) at 
 the FRBR work level (http://vocab.org/frbr/core.html#Work ) and also some 
 links to wikipedia / dbpedia for named entities (principally authors and 
 places).  We believe we have quite good authority control over author names, 
 even those who published under multiple names (see, for example 
 http://www.nzetc.org/tm/scholarly/name-208662.html or 
 http://www.nzetc.org/tm/scholarly/name-208310.html ). We have ~1300 ePubs, 
 the largest of which exceed the size limits of most ePub tools.

 Does anyone know of any other attempts to put linked data into packages like 
 this?

 There are two main issues I can see: (a) how to self-identify the package 
 (naive hashing doesn't work, as some eBook readers open the package and add 
 custom metadata) and (b) how to package the linked data to get maximal use 
 when a paucity of CPU precludes a full reasoner.

 The traditional identifier used in this field, the ISBN, is essentially a 
 print-run identifier, and not of a whole lot of obvious use to us since: (a) 
 most of our books' original publishing predates ISBNs and (b) our digital 
 republishing of them doesn't qualify for an ISBN according to our local ISBN 
 issuer (the National Library of New Zealand).

 cheers
 stuart






-- 
John S. Erickson, Ph.D.
http://bitwacker.wordpress.com
olyerick...@gmail.com
Twitter: @olyerickson



Re: Linked data in packaged content (ePub)

2010-04-28 Thread Stuart A. Yeates
On Wed, Apr 28, 2010 at 11:36 PM, John Erickson olyerick...@gmail.com wrote:
 Stuart, t's not clear to me what you're trying to accomplish...For
 whom are you trying to add value?

We are funded to digitise teaching, learning and research materials
for our staff and students. Value to anyone else is incidental, but
indicative.

 Are you imagining creating some kind of meshup within the reading
 experience, perhaps meshing metadata and links bound to entities
 within the ePub'd document with external linked data?

Ideally, I'd like a protocol such as Open URL (
http://en.wikipedia.org/wiki/Open_URL ), linking books on the device
up to the bibliographies of other books that also happen to be on the
device. For low CPU devices the links might have to be pre-calculated
when connected to a desktop PC. I understand that Open URL can't
actaully do this because it assumes the web.

cheers
stuart



Re: Linked data in packaged content (ePub)

2010-04-28 Thread John Erickson
On Wed, Apr 28, 2010 at 2:39 PM, Stuart A. Yeates syea...@gmail.com wrote:
 Ideally, I'd like a protocol such as Open URL 
 (http://en.wikipedia.org/wiki/Open_URL ), linking books on the device up to 
 the bibliographies of other books that also happen to be on the device. For 
 low CPU devices the links might have to be pre-calculated when connected to a 
 desktop PC. I understand that Open URL can't actually do this because it 
 assumes the web.

So the zeroth answer is to only provide such behavior under only
certain conditions, for example when explicit DOIs for the items in
the bibliographic record are provided and the other items on the
device have DOIs...this would allow for an easy registration when
books were put on the device. The browser of course would have to know
to intervene, however.

To actually do what OpenURL enables --- embed citation data ala COinS
http://ocoins.info/, use that as the basis for constructing OpenURL
references, and resolve those on the device --- would be a stretch on
a disconnected device, but not impossible. The citation metadata for
each installed ePub would have to be indexed, and then a proxy on the
device would have to check this before attempting to resolve
off-device. This sort of resolution doesn't require any advanced
reasoning, although it still might be intense.

Is this what you were thinking?

-- 
John S. Erickson, Ph.D.
http://bitwacker.wordpress.com
olyerick...@gmail.com
Twitter: @olyerickson