from:"Richard Wallis"

Re: [CODE4LIB] Structured Data Markup on library web sites

2016-03-31 Thread Richard Wallis

BTW the OpenLink Structured Data Sniffer (OSDS) 
is a great little browser plugin to see what structured data is lurking in
the html.

I believe that it supersedes the OpenLink Data Explorer Extension.

~Richard

On 31 March 2016 at 16:15, Brian Kennison  wrote:

>
> On Mar 31, 2016, at 10:24 AM, Kevin Ford > wrote:
>
> p.s.  Curl command I used:
>
>  curl -L -H 'Application/rdf+xml'
> http://bmcr.brynmawr.edu/2014/2014-02-18.html | grep schema
>
> I tried a few variations, such as removing the .html from the end of the
> URL etc.  Nada.
>
>
> Kevin,
>
> I can’t make the curl work either (I’ll have to investigate this further)
> but I do get it when I use wget. I also have the OpenLink Data Explorer
> Extension  installed (like Karen) and it also
> downloads the rdf.
>
> The caveat here seems to be what Richard said that the search engines are
> not doing content negotiation.
>
> —Brian
>
>

Re: [CODE4LIB] Structured Data Markup on library web sites

2016-03-31 Thread Richard Wallis

As a FYI, as far as I am aware the search engines do not access pages using
content negotiation (e.g.. asking for Application/rdf+xml) when looking for
structured data such as schema.org in their crawl process.

They expect to find it embedded in the HTML as Microdata, RDFa, or
increasingly JSON-LD in a script tag.

~Richard


On 31 March 2016 at 15:24, Kevin Ford  wrote:

> Hi Brian,
>
> I've tried the wget command and curl and in both cases I just get the HTML
> version of the document.  I don't think any meaningful content negotiation
> is happening.  It's probably as Karen suspected: they didn't return and
> embed schema in older reviews.  Are you getting something else?
>
> I think the tool Karen is using takes the URL as the identifier (logical)
> and converts the ' (which seems fair).  That's how the tools comes up with the little bit it
> does for this item.
>
> Yours,
> Kevin
>
> p.s.  Curl command I used:
>
>  curl -L -H 'Application/rdf+xml'
> http://bmcr.brynmawr.edu/2014/2014-02-18.html | grep schema
>
> I tried a few variations, such as removing the .html from the end of the
> URL etc.  Nada.
>
>
>
>
> On 03/31/2016 08:39 AM, Brian Kennison wrote:
>
>>
>> On Mar 29, 2016, at 12:46 PM, Kevin Ford  k...@3windmills.com>> wrote:
>>
>> FWIW, I'm looking at the HTML itself.  You may be using a tool that is
>> generating a little but of schema.  Is that accurate?
>>
>> Kevin,
>>
>> I was perplexed by this also but I realized that there was “content
>> negotiation” going on. I set the header to accept rdf and indeed there is
>> data for this document.
>>
>> —Brian
>>
>> wget --header "Accept: application/rdf+xml"
>> http://bmcr.brynmawr.edu/2014/2014-02-18.html
>>
>>

Re: [CODE4LIB] Deduping linked data in search - was RE: [CODE4LIB] Structured Data Markup on library web sites

2016-03-29 Thread Richard Wallis

It is always good sport trying to second guess what Google will do with
data……  ;-)

‘Localness’ is not always a matter of geographical nearness.  IP address
ranges for example are rumoured to have some influence in this area.

However it is as much about the relationship between well described
resources - in Schema.org terms having exampleOfWork relationships with
well described CreativeWork instances - and the Organization instances that
hold them.   A poorly described schema:library with little geolocation
information that is not linked to other references to the library, in
WikiData or Wikipedia for example, will hold back the relevance of the
resources held as much as poor description of the resources themselves.

Equally if the properties marked up in schema.org are all strings and not
canonical URIs for authors, subjects, publishers, etc. a search engine may
well be able to ingest the data, but linking it to other relevant entities
in their knowledge graph will be a challenge without them.

As to the major search engines tracking and consuming Bibframe.  As Kevin
implies, if they find it useful they will.   With 12+ Million sites across
many sectors publishing Schema.org data, I would imagine a few libraries
publishing Bibframe would be a bit of an edge case at the moment.

Kevin is also spot on to say “For a variety of reasons, organizations may
choose to use Bibframe as a more library-specific vocabulary but publish
the data for general consumption (read: for Google and others) in the
schema.org vocabulary.”  In simple terms choose your vocabulary based on
your audience. If you want to get data to non-library folks, go for the
most widely used and understood vocablary.

~Richard

Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Linkedin: http://www.linkedin.com/in/richardwallis
Twitter: @rjw

On 29 March 2016 at 16:09, Harper, Cynthia <char...@vts.edu> wrote:

> Hopefully Google will have a means to let libraries/patrons
> select/deselect areas where they will advertise their resources. We're a
> private institution in Alexandria VA. Our resources are pertinent to other
> people on our single IP domain, but less so to others in Alexandria VA.
> Maybe they'd use the same libraries you choose for Google Scholar link
> resolvers.
>
> Cindy Harper
>
> -Original Message-
> From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
> Kevin Ford
> Sent: Tuesday, March 29, 2016 10:45 AM
> To: CODE4LIB@LISTSERV.ND.EDU
> Subject: Re: [CODE4LIB] Deduping linked data in search - was RE:
> [CODE4LIB] Structured Data Markup on library web sites
>
> It's probably not safe to say that "all search is local" but there is most
> certainly a strong local component considered for every search.
> For me, every hit on the first page of Google's results for a search for
> "ice cream parlor" is related to Chicago, which is where I executed the
> search.  A search for a book (I chose a current bestseller as a test),
> however, does not return a local hit in the first two pages.  That's not to
> say it can't happen.  It might simply (hah! 'simple') be that Google does
> not know enough about local inventory (books available from a local library
> or in stock at a local bookstore) to offer that type of
> assistance/precision.  While this may seem like a theory only, Zepheira's
> libhub initiative has been trying to make this a reality by publishing
> individual libraries' structured data so that Google can make sense of it.
> And, at this point, if anyone from Libhub is on this list, I'll let you
> take it from here...
>
> Yours,
> Kevin
>
>
> On 03/29/2016 08:52 AM, Ruth Tillman wrote:
> > An off-the-cuff response: I've heard it suggested in talks about
> > Bibframe that just as Google tailors your results based on location
> > (i.e. if I put in "pizza," I'll get pizza places in South Bend, as
> > well as pizza recipes and whatnot), they'd tailor your library results
> > based on location. So if I were in downtown DC, and Googled a book, I
> > would see the DCPL holdings but not Indiana, and vice-versa.
> >
> > There are maybe 5 or 10 assumptions happening there that other people
> > can spell out better, but it would be a reasonable solution for
> > deduping assuming the metadata pretty much matches.
> >
> > On Tue, Mar 29, 2016 at 9:40 AM, Harper, Cynthia <char...@vts.edu>
> wrote:
> >
> >> Forgive me if I'm confusing schema.org and Bibframe, but I wonder how
> >> Google is going to dedupe all the sources of a given
> >> document/material when many libraries have their holdings in
> >> bibframe?  These sample searches made me wonder about that again.  has
> this been discussed?
> >>
> >> C

Re: [CODE4LIB] OCLC shutting down xISBN and xID (was Re: [CODE4LIB] Matching print and electronic editions of the same book)

2015-12-14 Thread Richard Wallis

@Sara:
You can lookup using OCLCNumber based and ISBN based URIs.  Note I am not
using the word ’search’.  You are not searching an index for references to
a number string, you asking for a description of the thing with identifier ‘
http://worldcat.org/oclc/##’.  May seem a little pedantic, but it is
fundamental to linked data.

As far as I am aware there is no direct access to lccn via this route. That
*would* be more of a search operation: ‘return me the Thing(s), and their
unique URI(s), that are sameAs the thing this identified by this’ lccn.

@Cindy:
All this data is openly available, as Linked Data under an Open Data
Commons Attribution License <http://www.oclc.org/data/attribution.en.html>,
in the way I described so you do not need an account key, or an API to
access it.

OCLC have APIs that also makes some of this data available via other
routes.  I will leave it to folks at OCLC to comment on those and any
limits, or not, associated with the Linked Data.

~Richard.

Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Linkedin: http://www.linkedin.com/in/richardwallis
Twitter: @rjw

On 14 December 2015 at 15:11, Harper, Cynthia <char...@vts.edu> wrote:

> My question - is there a limit of number of requests, or request rate?  Is
> this done with an account key?
>
> Thanks,
> Cindy Harper
>
> -Original Message-
> From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
> sara amato
> Sent: Monday, December 14, 2015 9:57 AM
> To: CODE4LIB@LISTSERV.ND.EDU
> Subject: Re: [CODE4LIB] OCLC shutting down xISBN and xID (was Re:
> [CODE4LIB] Matching print and electronic editions of the same book)
>
> One last question on this - I see that I can search
> worldcat.org/oclc/##  and worldcat.org/isbn/#- is there any
> way to search the lccn?
>
>
>
> > On Dec 12, 2015, at 3:52 PM, Richard Wallis <
> richard.wal...@dataliberate.com <mailto:richard.wal...@dataliberate.com>>
> wrote:
> >
> > Sara,
> >
> > The canonical URI you are looking for is
> http://worldcat.org/oclc/8410511 <http://worldcat.org/oclc/8410511> <
> http://worldcat.org/oclc/8410511 <http://worldcat.org/oclc/8410511>>
> which silently redirects [via a http 303] to where the data is currently
> stored (experiment.worldcat.org <http://experiment.worldcat.org/> <
> http://experiment.worldcat.org/ <http://experiment.worldcat.org/>>).
> This approach enables the canonical WorldCat identifiers to be maintained
> over time in a fixed namespace, whilst providing flexibility as to where
> the actual data is stored.
> >
> > You can use http content-negotiation to get the serialisation [html,
> rdfxml, triples, turtle] that you require, or as an option you can suffix
> the url with .jsonld etc.  - See my blogpost <
> http://dataliberate.com/2013/06/content-negotiation-for-worldcat/ <
> http://dataliberate.com/2013/06/content-negotiation-for-worldcat/>> for a
> longer explanation.
> >
> > The same pattern occurs when you follow the exampleOfWork triple to get
> the work id.  The data references the canonical
> http://worldcat.org/entity/work/id/3357516 <
> http://worldcat.org/entity/work/id/3357516> <
> http://worldcat.org/entity/work/id/3357516 <
> http://worldcat.org/entity/work/id/3357516>> URI.  Accessing that
> redirects, via a http 303, to the description of that resource at
> http://experiment.worldcat.org/entity/work/data/3357516 <
> http://experiment.worldcat.org/entity/work/data/3357516> <
> http://experiment.worldcat.org/entity/work/data/3357516 <
> http://experiment.worldcat.org/entity/work/data/3357516>>.
> >
> > The moral of this is always use the canonical URIs to look things up.
> >
> > You ask if ‘experiment.worldcat.org <http://experiment.worldcat.org/> <
> http://experiment.worldcat.org/ <http://experiment.worldcat.org/>>’ is
> going to be around for a while.  From the above you can infer that it is
> the wrong question to ask.WorldCat itself maintains the canonical URIs,
> for Works, OCLCNUMS, and other entities.  By using those in your code, you
> will be protected against any architectural and or system changes behind
> the scenes.
> >
> > Then, as Terry suggests, following the relationships in the Linked Data
> is the way to achieve many of the same ends as using xID.
> >
> > ie. Using an OCLCNUM, generate the URI of the associated WorldCat entity
> ‘http://worldcat.org/oclc/x <http://worldcat.org/oclc/x> <
> http://worldcat.org/oclc/x <http://worldcat.org/oclc/x>>'.  From
> that entity description extract the schema:exampleOfWork triple. Use the
> URI

Re: [CODE4LIB] OCLC shutting down xISBN and xID (was Re: [CODE4LIB] Matching print and electronic editions of the same book)

2015-12-12 Thread Richard Wallis

Sara,

The canonical URI you are looking for is http://worldcat.org/oclc/8410511
which silently redirects [via a http 303] to where the data is currently
stored (experiment.worldcat.org).  This approach enables the canonical
WorldCat identifiers to be maintained over time in a fixed namespace,
whilst providing flexibility as to where the actual data is stored.

You can use http content-negotiation to get the serialisation [html,
rdfxml, triples, turtle] that you require, or as an option you can suffix
the url with .jsonld etc.  - See my blogpost
<http://dataliberate.com/2013/06/content-negotiation-for-worldcat/> for a
longer explanation.

The same pattern occurs when you follow the exampleOfWork triple to get the
work id.  The data references the canonical
http://worldcat.org/entity/work/id/3357516 URI. Accessing that redirects,
via a http 303, to the description of that resource at
http://experiment.worldcat.org/entity/work/data/3357516.

The moral of this is always *use the canonical URIs to look things up*.

You ask if ‘experiment.worldcat.org’ is going to be around for a while.
>From the above you can infer that it is the wrong question to ask. WorldCat
itself maintains the canonical URIs, for Works, OCLCNUMS, and other
entities. By using those in your code, you will be protected against any
architectural and or system changes behind the scenes.

Then, as Terry suggests, following the relationships in the Linked Data is
the way to achieve many of the same ends as using xID.

ie. Using an OCLCNUM, generate the URI of the associated WorldCat entity ‘
http://worldcat.org/oclc/x'.  From that entity description extract the
schema:exampleOfWork triple. Use the URI from that to obtain the Work URI.
Obtain the description of the Work from that URI and the extract the values
of the contained schema:workExample triples.

~Richard.

Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Linkedin: http://www.linkedin.com/in/richardwallis
Twitter: @rjw

On 12 December 2015 at 22:47, Sara Amato <sam...@willamette.edu> wrote:

> In thinking about using worldcat.org as a bridge to a works record, I'm
> wondering about using 'experiment.worldcat.org', e.g.
> http://experiment.worldcat.org/oclc/841051199.jsonld as an easy way to get
> the work id. I'm having trouble finding out any info about exactly what
> 'experiment.worldcat.org' is.   Is it likely to be around for a while?  Is
> it up to date and just reformatting the www.worldcat.org data???  Anybody
> know?
>
>
>
>
> On Fri, Dec 11, 2015 at 2:17 PM, Terry Reese <ree...@gmail.com> wrote:
>
> > I think the best replacement at this point as a single API is to look at
> > Librarythings api though I'm not sure if it would work in all cases --
> > otherwise, I think using worldcat.org as a bridge to their works records
> > probably is your best bet.
> >
> > --tr
> >
> > -Original Message-
> > From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
> > Brian Riley
> > Sent: Friday, December 11, 2015 5:14 PM
> > To: CODE4LIB@LISTSERV.ND.EDU
> > Subject: Re: [CODE4LIB] OCLC shutting down xISBN and xID (was Re:
> > [CODE4LIB]
> > Matching print and electronic editions of the same book)
> >
> > Does anyone know if OCLC is recommending an alternate solution that will
> > provide the same or at least similar functionality?
> >
> > I had played around with the Worldcat Discovery API when it was in beta
> but
> > am not sure of its present status or if its the most logical replacement
> > for
> > xID.
> >
> >Brian
> >
> > 
> > From: Code for Libraries <CODE4LIB@LISTSERV.ND.EDU> on behalf of Eric
> > Hellman <e...@hellman.net>
> > Sent: Friday, December 11, 2015 1:31 PM
> > To: CODE4LIB@LISTSERV.ND.EDU
> > Subject: Re: [CODE4LIB] OCLC shutting down xISBN and xID (was Re:
> > [CODE4LIB]
> > Matching print and electronic editions of the same book)
> >
> > Users of xID services would be wise to check the termination clauses of
> > their usage agreements to see whether they are permitted to keep and
> reuse
> > the data they have cached.
> >
> > Think about it. The world outside of 43017 has invented all sorts of new
> > techniques to update and maintain metadata cooperatively.
> >
> > Eric
> >
> >
> >
> > Eric Hellman
> > President, Free Ebook Foundation
> > Founder, Unglue.it https://unglue.it/
> > https://go-to-hellman.blogspot.com/
> > twitter: @gluejar
> >
>

[CODE4LIB] W3C Community Group Launched for Archives and Schema.org

2015-07-23 Thread Richard Wallis

*Apologies for cross posing*

Following interest expressed at the LODLAM Summit 2015
http://summit2015.lodlam.net/ a W3C Community Group has been created to
discuss potential extension of the Schema.org vocabulary to describe
archives and their contents.

The group named *Schema Architypes* can be accessed via the Architypes W3C
Community Group Site https://www.w3.org/community/architypes/:

The mission of the group is to discuss and prepare proposal(s) for
extending Schema.org schema for the improved representation of digital and
physical archives and their contents. The goal being focused upon the
creation and future maintenance of an archive.schema.org extension.


All who are interested are welcome to join the group and the discussion.

Note: In order to join the group
https://www.w3.org/community/architypes/join, you will need a W3C account
https://www.w3.org/accounts/request. Please note, however, that W3C
Membership
https://www.w3.org/community/about/faq/#is-w3c-membership-required-to-participate-in-a-community-or-business-group
is
*not* required to join a Community Group.

Regards,
Richard Wallis

Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Linkedin: http://www.linkedin.com/in/richardwallis
Twitter: @rjw

Re: [CODE4LIB] Library Hours

2015-05-06 Thread Richard Wallis

 doesn't have to be shown.
 Who knew fines and library/student-IDs were a thing of the past?

 The only data sets I can find where they got the 17,000 number is for
 public libraries:
 http://www.imls.gov/research/pls_data_files.aspx
 Maybe I missed something.
 There is an hours field on one of the CSVs I downloaded, etc for 2012

 data

 (the most recent I could find).

 Asking 10k for something targeted for completion in June and without a
 grasp on what types of libraries there are and how volatile the hours
 information is (especially in crisis) ...
 Sounds naive at best, sketchy at worst.

 The flexible funding button says this campaign will receive all

 funds

 raised even if it does not reach its goals.

 The value of these places for youth cannot be underestimated.
 So is the value of a quick buck ...

 On Tue, May 5, 2015 at 4:53 PM, McCanna, Terran 
 tmcca...@georgialibraries.org wrote:

   I'm not at all surprised that this doesn't already exist, and even
 if

 OCLC's was available, I'd be willing to bet it was out of date.

 Public library hours, especially in underfunded areas, may fluctuate
 depending on funding cycles, seasons (whether school is in or out),

 etc.,

 not to mention closing/reopening/moving because of old buildings that
 need
 to be updated. We have around 280 locations in our consortium and we

 have

 to rely on self-reporting to find out if their hours change. We

 certainly

 don't have staff time to check every one of their web sites on

 regular

 basis, I can't imagine keeping track of 17,000!


 Terran McCanna
 PINES Program Manager
 Georgia Public Library Service
 1800 Century Place, Suite 150
 Atlanta, GA 30345
 404-235-7138
 tmcca...@georgialibraries.org


 - Original Message -
 From: Peter Murray jes...@dltj.org
 To: CODE4LIB@LISTSERV.ND.EDU
 Sent: Tuesday, May 5, 2015 4:36:56 PM
 Subject: Re: [CODE4LIB] Library Hours

 OCLC has an institutional registry [1], which had (in part) library
 hours,
 addresses, and so forth.  It seems to be unavailable, though [2].

 That

 is
 the only systematic collection of library hours data that I know

 about.


 Peter

 [1] https://www.oclc.org/worldcat-registry.en.html
 [2] https://www.worldcat.org/registry/institution/

   On May 5, 2015, at 4:16 PM, Bigwood, David dbigw...@hou.usra.edu
 wrote:

  This looks like a decent group, but I find this statement hard to

  believe.

  Your tax-deductible donation supports adding the names, address and

 the

 hours of operation of all libraries to Range. The Institute of Museum

 and

 Library Services publishes an open data catalog which is the source

 we'll

 use for the names and the addresses of the nation's libraries.

 However,

 there isn't a listing of the days and hours of operation for all
 libraries
 in the US. We are going to track down the hours of operation for all
 17,000
 libraries and make that information available -- in Range and for

 other

 developers who may want to use it.


 https://www.indiegogo.com/projects/range-food-and-safe-places-for-youth

 Are the hours of public libraries really not available?

 Sincerely,
 David Bigwood
 dbigw...@gmail.commailto:dbigw...@gmail.com
 Lunar and Planetary Institute
 @LPI_Library
 https://www.flickr.com/photos/lunarandplanetaryinstitute/


  --
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 m: +1-510-435-8234
 skype: kcoylenet/+1-510-984-3600




 --
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 m: +1-510-435-8234
 skype: kcoylenet/+1-510-984-3600




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis

 is Wikipedia which includes the ISNI
 for Catherine Sefton in the Wikipeda page for Martin Waddell (
 http://en.wikipedia.org/wiki/Martin_Waddell) (although oddly not the
 ISNI for Martin Waddell under his own name).

 Owen

 Owen Stephens
 Owen Stephens Consulting
 Web: http://www.ostephens.com
 Email: o...@ostephens.com
 Telephone: 0121 288 6936

 On 18 Jun 2014, at 23:28, Stuart Yeates stuart.yea...@vuw.ac.nz wrote:

  My reading of that suggests that http://isni-url.oclc.nl/isni/
 000122816316 shouldn't have both Bell, Currer and Brontë,
 Charlotte, which it clearly does...

 Is this is a case of one of our sources of truth doesn't distinguish
 betweens identities and entities and we're allowing it to pollute our data?

 If that source of truth is wikipedia, we can fix that.

 cheers
 stuart

 On 06/19/2014 12:11 AM, Richard Wallis wrote:

 Hi all,

 Seeing this thread I checked with the ISNI team and got the following
 answer from Janifer Gatenby who asked me to post it on her behalf:

 SNI identifies “public identities”.The scope as stated in the
 standard
 is



 “This International Standard specifies the International Standard name
 identif*i*er (ISNI) for the identification of public identities of
 parties;
 that is, the identities used publicly by parties involved throughout the
 media content industries in the creation, production, management, and
 content distribution chains.”



 The relevant definitions are:



 *3.1*

 *party*

 natural person or legal person, whether or not incorporated, or a group
 of
 either

 *3.3*

 *public identity*

 Identity of a *party *(3.1) or a fictional character that is or was
 presented to the public

 *3.4*

 *name*

 character string by which a *public identity *(3.3) is or was commonly
 referenced



 A party may have multiple public identities and a public identity may
 have
 multiple names (e.g. pseudonyms)



 ISNI data is available as linked data.  There are currently 8 million
 ISNIs
 assigned and 16 million links.



 Example:



 [image: image001.png]

 ~Richard.


 On 16 June 2014 10:54, Ben Companjen ben.compan...@dans.knaw.nl
 wrote:

  Hi Stuart,

 I don't have a copy of the official standard, but from the documents on
 the ISNI website I remember that there are name variations and 'public
 identities' (as the lemma on Wikipedia also uses). I'm not sure where
 the
 borderline is or who decides when different names are different
 identities.

 If it were up to me: pseudonyms are definitely different public
 identities, name changes after marriage probably not, name change after
 gender change could mean a different public identity. Different public
 identities get different ISNIs; the ISNI organisation says the ISNI
 system
 can keep track of connected public identities.

 Discussions about name variations or aliases are not new, of course. I
 remember the discussions about 'aliases' vs 'Artist Name Variations'
 that
 are/were happening on Discogs.com, e.g. 'is J Dilla an alias or a ANV
 of
 Jay Dee?' It appears the users on Discogs finally went with aliases,
 but
 VIAF put the names/identities together: http://viaf.org/viaf/32244000
 -
 and there is no ISNI (yet).

 It gets more confusing when you look at Washington Irving who had
 several
 pseudonyms: they are just listed under one ISNI. Maybe because he is
 dead,
 or because all other databases already know and connected the
 pseudonyms
 to the birth name? (I just sent a comment asking about the record at
 http://isni.org/isni/000121370797 )


 [Here goes the reference list…]

 Hope this helps :)

 Groeten van Ben

 On 15-06-14 23:11, Stuart Yeates stuart.yea...@vuw.ac.nz wrote:

  Could someone with access to the official text of ISO 27729:2012 tell
 me
 whether an ISNI is a name identifier or an entity identifier? That is,
 if someone changes their name (adopts a pseudonym, changes their name
 by
 to marriage, transitions gender, etc), should they be assigned a new
 identifier?

 If the answer is 'No' why is this called a 'name identifier'?

 Ideally someone with access to the official text would update the
 article at
 https://en.wikipedia.org/wiki/International_Standard_Name_Identifier
 With a brief quote referenced to the standard with a page number.

 [The context of this is ORCID, which is being touted as an entity
 identifier, while not being clear on whether it's a name or entity
 identifier.]

 cheers
 stuart









-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis

Hi Eric,

What distinguishes one from another?

The communities behind them, the [often overlapping] communities they
are intended to serve, and the technical implementation.

As a librarian, why should I care?

I would, as a non-librarian, suggest that once you are happy with
the ‘authority’ of them, you shouldn’t have to care. Ideally, we are not
there yet, systems should be flexible and accommodating enough to link to
any appropriate authority.

I will probably get flamed for over generalisation here but - VIAF is
an aggregation of National Libraries Authority files.  - ISNI is a more
publisher focused but similar effort.  - OCID comes from and and tries to
serve individual academic institutions, their researchers and falsity
authors.


  authority control |simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | | X|  |
 +-+--+--+
   ISNI  |X| X|X |
 +-+--+--+

~Richard


On 20 June 2014 15:42, Eric Lease Morgan emor...@nd.edu wrote:

 On Jun 20, 2014, at 10:31 AM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:

  ISNI has a suite of programs that detects pseudonyms coded as name
 variants
  and changes them into related name and generates related identity
 records.
  It is a while since it was run and will be re-run in the next few weeks.
  This should change Currer Bell into a related name of Charlotte Brontë .


 Please humor me as I ask this question again. What is the difference
 between ISNI and other identifiers systems (like ORCID, etc.)? What
 distinguishes one from another? As a librarian, why should I care? Was as a
 faculty member/scholar, why should I care? Under what context is one
 identifier expected to be used instead of another? Maybe a picture/graph is
 in order:

   authority control simple pointer
  +-+--+
   VIAF   |X|  |
  +-+--+
   ORCID  | | X|
  +-+--+
ISNI  | |  |
  +-+--+

 —
 Eric Lease Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis

In what ways does ISNI support linked data?

See: http://www.isni.org/how-isni-works#HowItWorks_LinkedData

~Richard


On 20 June 2014 18:57, Eric Lease Morgan emor...@nd.edu wrote:

 On Jun 20, 2014, at 10:56 AM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:

   authority control|simple identifier |Linked Data capability
  +-+--+--+
   VIAF   |X|X |  X   |
  +-+--+--+
   ORCID  | |X |  |
  +-+--+--+
ISNI  |X|X |  X   |
  +-+--+--+


 Increasingly I like linked data, and consequently, here is clarification
 and a question. ORCID does support RDF, but only barely. It can output
 FOAF-like data, but not bibliographic. Moreover, it is experimental, at
 best:

   curl -L -H 'accept: application/rdf+xml'
 http://orcid.org/-0002-9952-7800

 In what ways does ISNI support linked data?

 —
 Eric Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-19 Thread Richard Wallis

More from Janifer….

Thanks Richard for forwarding.  I don’t know how long my answer is allowed
to be.  Here are some extracts from a recent article (in publication):



Meryl Streep has only one public identity that she uses for her creative
works.  VIAF includes Streep, Meryl (Mary Louise) as the name form on her
birth certificate is Streep, Mary Louise.  Her married name Gummer, Mary
Louise, is mentioned in Wikipedia and could also be considered as a name
variant.  Changes in the preferred forenames and change of name due to
marriage are not considered as constituting a change of identity as there
is not necessarily a wish by the person to separate his or her creative
output corresponding with a name change.  Researchers, for example will
mention the ensemble of their publications in their curriculum vitae where
they have been published over a lifetime involving name changes.

Alternative name forms that do not usually indicate a change in identity
can take variant forms including names in different scripts or
transliterated with different schemes, changes due to change of sex,
marriage and deed poll.



Kingsley Amis in the figure above is an example of an author writing under
3 different identities, writing as himself and writing under two different
pseudonyms.  His works attributed to each of the three different identities
are clearly separated.  In this case, the association of the pseudonyms
with the real person is public but that is not always the case.  For
example, J.K. Rowling chose to create a pseudonym Robert Galbraith (35) so
that she could write in a different genre and have her works in the new
genre judged without any influence of the successful Harry Potter
series.When the relationship between a pseudonym and a real person is
or becomes public, the works tend to become associated with the real person
as well as the pseudonym; for example they may be republished or released
in collected works.  One example is John Wyndham, the main pen name of John
Wyndham Parkes Lucas Beynon Harris (10 July 1903 – 11 March 1969).  Under
an earlier pen name, John Beynon, he wrote*Secret People *but this is now
published under John Wyndham.   As a general rule, separate authority
records should be made for each identity used as a public identity.  This
allows the history of works to be accurately traced, enabling catalogs to
respond to questions such as “under which identity was the work conceived
and first published?”  The authority records related to each of the public
identities should be linked where the relationship is public.
Encyclopedias such as Wikipedia tend to have only one entry for a person,
grouping all public identities.  This entails the need for one-to-many
relationships in linking especially from library authority records to
encyclopedias.

Best regards,



Janifer


On 19 June 2014 09:53, Owen Stephens o...@ostephens.com wrote:

 An aside but interesting to see how some of this identity stuff seems to
 be playing out in the wild now. Google for Catherine Sefton:

 https://www.google.co.uk/search?q=catherine+sefton

 The Knowledge Graph displays information about Martin Waddell. Catherine
 Sefton is a pseudonym of Martin Waddell. It is impossible to know, but the
 most likely source of this knowledge is Wikipedia which includes the ISNI
 for Catherine Sefton in the Wikipeda page for Martin Waddell (
 http://en.wikipedia.org/wiki/Martin_Waddell) (although oddly not the ISNI
 for Martin Waddell under his own name).

 Owen

 Owen Stephens
 Owen Stephens Consulting
 Web: http://www.ostephens.com
 Email: o...@ostephens.com
 Telephone: 0121 288 6936

 On 18 Jun 2014, at 23:28, Stuart Yeates stuart.yea...@vuw.ac.nz wrote:

  My reading of that suggests that
 http://isni-url.oclc.nl/isni/000122816316 shouldn't have both Bell,
 Currer and Brontë, Charlotte, which it clearly does...
 
  Is this is a case of one of our sources of truth doesn't distinguish
 betweens identities and entities and we're allowing it to pollute our data?
 
  If that source of truth is wikipedia, we can fix that.
 
  cheers
  stuart
 
  On 06/19/2014 12:11 AM, Richard Wallis wrote:
  Hi all,
 
  Seeing this thread I checked with the ISNI team and got the following
  answer from Janifer Gatenby who asked me to post it on her behalf:
 
  SNI identifies “public identities”.The scope as stated in the
 standard
  is
 
 
 
  “This International Standard specifies the International Standard name
  identif*i*er (ISNI) for the identification of public identities of
 parties;
  that is, the identities used publicly by parties involved throughout the
  media content industries in the creation, production, management, and
  content distribution chains.”
 
 
 
  The relevant definitions are:
 
 
 
  *3.1*
 
  *party*
 
  natural person or legal person, whether or not incorporated, or a group
 of
  either
 
  *3.3*
 
  *public identity*
 
  Identity of a *party *(3.1) or a fictional character that is or was
  presented

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-18 Thread Richard Wallis

Hi all,

Seeing this thread I checked with the ISNI team and got the following
answer from Janifer Gatenby who asked me to post it on her behalf:

SNI identifies “public identities”.The scope as stated in the standard
is



“This International Standard specifies the International Standard name
identif*i*er (ISNI) for the identification of public identities of parties;
that is, the identities used publicly by parties involved throughout the
media content industries in the creation, production, management, and
content distribution chains.”



The relevant definitions are:



*3.1*

*party*

natural person or legal person, whether or not incorporated, or a group of
either

*3.3*

*public identity*

Identity of a *party *(3.1) or a fictional character that is or was
presented to the public

*3.4*

*name*

character string by which a *public identity *(3.3) is or was commonly
referenced



A party may have multiple public identities and a public identity may have
multiple names (e.g. pseudonyms)



ISNI data is available as linked data.  There are currently 8 million ISNIs
assigned and 16 million links.



Example:



[image: image001.png]

~Richard.


On 16 June 2014 10:54, Ben Companjen ben.compan...@dans.knaw.nl wrote:

 Hi Stuart,

 I don't have a copy of the official standard, but from the documents on
 the ISNI website I remember that there are name variations and 'public
 identities' (as the lemma on Wikipedia also uses). I'm not sure where the
 borderline is or who decides when different names are different identities.

 If it were up to me: pseudonyms are definitely different public
 identities, name changes after marriage probably not, name change after
 gender change could mean a different public identity. Different public
 identities get different ISNIs; the ISNI organisation says the ISNI system
 can keep track of connected public identities.

 Discussions about name variations or aliases are not new, of course. I
 remember the discussions about 'aliases' vs 'Artist Name Variations' that
 are/were happening on Discogs.com, e.g. 'is J Dilla an alias or a ANV of
 Jay Dee?' It appears the users on Discogs finally went with aliases, but
 VIAF put the names/identities together: http://viaf.org/viaf/32244000 -
 and there is no ISNI (yet).

 It gets more confusing when you look at Washington Irving who had several
 pseudonyms: they are just listed under one ISNI. Maybe because he is dead,
 or because all other databases already know and connected the pseudonyms
 to the birth name? (I just sent a comment asking about the record at
 http://isni.org/isni/000121370797 )


 [Here goes the reference list…]

 Hope this helps :)

 Groeten van Ben

 On 15-06-14 23:11, Stuart Yeates stuart.yea...@vuw.ac.nz wrote:

 Could someone with access to the official text of ISO 27729:2012 tell me
 whether an ISNI is a name identifier or an entity identifier? That is,
 if someone changes their name (adopts a pseudonym, changes their name by
 to marriage, transitions gender, etc), should they be assigned a new
 identifier?
 
 If the answer is 'No' why is this called a 'name identifier'?
 
 Ideally someone with access to the official text would update the
 article at
 https://en.wikipedia.org/wiki/International_Standard_Name_Identifier
 With a brief quote referenced to the standard with a page number.
 
 [The context of this is ORCID, which is being touted as an entity
 identifier, while not being clear on whether it's a name or entity
 identifier.]
 
 cheers
 stuart




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] barriers to open metadata?

2014-04-30 Thread Richard Wallis

To unpack the several questions lurking in Karen’s question.

As to being able to use the WorldCat Works data/identifiers there is no
difference between a or b - it is ODC-BY licensed data.

Getting a Work URI may be easier for a) as they should be able to identify
the OCLC Number and hence use the linked data from it’s URI 
http://worldcat.org/oclc/{ocn} to pick up the link to it’s work.

Tools such as xISBN http://xisbn.worldcat.org/xisbnadmin/doc/api.htm can
step you towards identifier lookups and are openly available for low volume
usage.

Citation lookup is more a bib lookup feature, that you could get an OCLC
Number from. One of colleagues may be helpful on the particulars of this.

Apologies for being WorldCat specific, but Karen did ask.

~Richard.


On 30 April 2014 17:15, Karen Coyle li...@kcoyle.net wrote:

 My question has to do with discoverability. Let's say that I have a
 bibliographic database and I want to add the OCLC work identifiers to it.
 Obviously I don't want to do it by hand. I might have ISBNs, but in some
 cases I will have a regular author/title-type citation.

 and let's say that I am asking this for two different institutions:
 a) is an OCLC member institution
 b) is not

 Thanks,
 kc




 On 4/30/14, 8:47 AM, Dan Scott wrote:

 On Tue, Apr 29, 2014 at 11:37 PM, Roy Tennant roytenn...@gmail.com
 wrote:

 This has now instead become a reasonable recommendation
 concerning ODC-BY licensing [3] but the confusion and uncertainty
 about which records an OCLC member may redistribute remains.

 [3] http://www.oclc.org/news/releases/2012/201248.en.html

 Allow me to try to put this confusion and uncertainty to rest once and
 for
 all:

 ALL THE THINGS. ALL.

 At least as far as we are concerned. I think it's well past time to put
 the
 past in the past.

 That's great, Roy. That's a *lot* simpler than parsing the
 recommendations, WCRR, community norms, and such at [A, B] :)

  Meanwhile, we have just put nearly 200 million works records up as linked
 open data. [1], [2], [3]. If that doesn't rock the library open linked
 data
 world, then no one is paying attention.
 Roy

 [1] http://oclc.org/en-US/news/releases/2014/201414dublin.html
 [2]
 http://dataliberate.com/2014/04/worldcat-works-197-million-
 nuggets-of-linked-data/
 [3] http://hangingtogether.org/?p=3811

 Yes, that is really awesome. But Laura was asking about barriers to
 open metadata, so damn you for going off-topic with PR around a lack
 of barriers to some metadata (which, for those who have not looked
 yet, have a nice ODC-BY licensing statement at the bottom of a given
 Works page) :)

 A. http://oclc.org/worldcat/community/record-use.en.html
 B. http://oclc.org/worldcat/community/record-use/data-
 licensing/questions.en.html


 --
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 m: 1-510-435-8234
 skype: kcoylenet




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] viaf

2014-04-07 Thread Richard Wallis

Hi Eric,

Is this what you are looking for: http://viaf.org/viaf/data/

~Richard


On 7 April 2014 22:15, Eric Lease Morgan emor...@nd.edu wrote:

 Can somebody here point me to an RDF file describing the VIAF ontology?
 Does an authoritative list of VIAF classes and properties exist, and if so,
 where can I get a copy? --Eric Lease Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] viaf

2014-04-07 Thread Richard Wallis

You are correct that in the xmlns namespace definitions in the xml 
http://viaf.org/viaf/231063554/rdf.xml references a namespace that is no
longer used in the output.  There is a single commented out reference to '
viaf:NameAuthorityCluster' in the output.

This post covers the change that deprecated the use of this:
http://outgoing.typepad.com/outgoing/2011/04/changes-to-viafs-rdf.html

From the other xmlns definitions it can be seen that use is made of void,
owl, foaf, skis-xl, skis, RDA ElementsGR2, FRBRentitiesRDA to describe VIAF
resources.

I'll pass this on to some of the team behind VIAF to see if they have
further comment

~Richard.




On 7 April 2014 23:04, Eric Lease Morgan emor...@nd.edu wrote:

 On Apr 7, 2014, at 10:52 PM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:

  Is this what you are looking for: http://viaf.org/viaf/data/

 Alas, no, not really. I'm looking for an RDF file listing the classes and
 properties used by VIAF. VIAF can return RDF for specific entities, as in
 curl:

   curl -L -H accept: application/rdf+xml http://viaf.org/viaf/231063554/

 Upon what RDF schema is this output based? The header of the output
 alludes to the following URI, but it returns an error:

   http://viaf.org/ontology/1.1/#

 --
 Eric Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] The lie of the API

2013-12-01 Thread Richard Wallis

 Accept headers it will do string equality tests against.

 Rob



 On Fri, Nov 29, 2013 at 6:24 AM, Seth van Hooland svhoo...@ulb.ac.be
 wrote:
 
  Dear all,
 
  I guess some of you will be interested in the blogpost of my colleague
 and co-author Ruben regarding the misunderstandings on the use and abuse of
 APIs in a digital libraries context, including a description of both good
 and bad practices from Europeana, DPLA and the Cooper Hewitt museum:
 
  http://ruben.verborgh.org/blog/2013/11/29/the-lie-of-the-api/
 
  Kind regards,
 
  Seth van Hooland
  Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
  Université Libre de Bruxelles
  Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
  http://homepages.ulb.ac.be/~svhoolan/
  http://twitter.com/#!/sethvanhooland
  http://mastic.ulb.ac.be
  0032 2 650 4765
  Office: DC11.102




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

Re: [CODE4LIB] What do you want to learn about linked data?

2013-09-04 Thread Richard Wallis

 about? Some obvious
  examples are: how to do SPARQL queries; how to use triples in
  databases; maybe how to use Protege (free software) [1] to create an
  ontology. Those are just a quick shot across the bow, and from my
  basically non-techie point of view. Please add your own.
 
  If you can't say it in terms of technology, it would be as good (if
  not maybe better) to say it in terms of what you'd like to be able to
  do (do searches, create data... )
 
  This is very unscientific, but I think it's a worthwhile conversation
  to have, and maybe can help get some ideas for training.
 
  kc
  [1] http://protege.stanford.edu/
 
 
  --
  Karen Coyle
  kco...@kcoyle.net http://kcoyle.net
  ph: 1-510-540-7596
  m: 1-510-435-8234
  skype: kcoylenet
 
  
 
  This e-mail message (including any attachments) is for the sole use of
  the intended recipient(s) and may contain confidential and privileged
  information. If the reader of this message is not the intended
  recipient, you are hereby notified that any dissemination, distribution
  or copying of this message (including any attachments) is strictly
  prohibited.
 
  If you have received this message in error, please contact
  the sender by reply e-mail message and destroy all copies of the
  original message (including attachments).




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

[CODE4LIB] WorldCat Implements Content-Negotiation for Linked Data

2013-06-03 Thread Richard Wallis

The Linked Data for the millions of resources in WorldCat.org is now
available as RDF/XML, JSON-LD, Turtle, and Triples via content-negotiation.

Details:
   http://dataliberate.com/2013/06/content-negotiation-for-worldcat/

~Richard.

Re: [CODE4LIB] Deadline for Elag 2013 proposals extended to 15 February

2013-02-08 Thread Richard Wallis

I know conferences like proposals in early, but 18 years before is pushing
it a bit!   ;-)


On 8 February 2013 10:31, Boheemen, Peter van peter.vanbohee...@wur.nlwrote:

 We have extended the deadline for your ELAG 2031 proposals ! Send them in
 before 15 february

 Peter van Boheemen
 Elag Chair




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw

[CODE4LIB] Schema.org and its extension for bib data (was Coins)

2012-11-21 Thread Richard Wallis

On 21 November 2012 07:37, Dave Caroline dave.thearchiv...@gmail.comwrote:

But... it is no good choosing a random extension if the Search engine
is or will be blind to that particular method.
As someone who likes to leverage SEO the right way so one does not
get penalised, some standardisation is needed.

This is exactly what is behind the Schema.org initiative - 'the search
engines' (Google, Bing, Yahoo and Yandex) have agreed to recognise
structured markup using the Schema.org vocabulary and will expand that
recognition over time as the vocabulary evolves.

There is a process, under the wing of the W3C, to propose extensions to the
vocabulary to improve it's descriptive capabilities for particular domains.
As Jeff mentioned, this has already occurred in the areas of news,
commerce, jobs. Coming mostly from a groups organisations in those
domains, these proposals were successful as they came with some authority
to the [by definition] broadly focused group behind schema.org.

It is for that reason, I formed the W3C Group Schema Bib Extend
http://www.w3.org/community/schemabibex/ to create such a consensus in the
community concerned with publishing bibliographic data on the web. All are
welcome to join this group, membership of the W3C is not a requirement.
Elements of this COinS conversation are obviously relevant to such
proposals.

*Bit of Background for those new to this:*

* Schema.org http://schema.org/ introduced in mid 2011 by Google, Yahoo,
Bing, and Yandex.
* A generic vocabulary for describing most things in structured data on the
web that the search engines will recognise
* By June 2012, Google Bing report that 7%-10% of crawled pages contain
schema.org markup
http://dataliberate.com/2012/06/schema-org-consensus-at-semtechbiz/
* Schema Bib Extend W3C Group formed Sept 2012 as a short lived group to
propose bibliographic (in the widest sense) extensions to Schema.
* SchemaBibEx not just focusing on library needs, includes publishers etc.
- anyone wanting to publish bibliographic structured data on the web.
* Schema.org, due to its broad generic nature will only complement, not
replace, other detailed library standards.
* By publishing bib metadata in this way we have at last a way to tell the
world, not only the search engines, about our resources using markup that
will be broadly understood.
* Using markup 'the world will understand, and can use' underpins the
release of WorldCat linked data earlier this year
http://dataliberate.com/2012/06/oclc-worldcat-linked-data-release-significant-in-many-ways/

~Richard.
--
Richard Wallis
Founder, Data Liberate
http://dataliberate.com

Technology Evangelist, OCLC

Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] CODE4LIB equivalent in UK?

2012-10-08 Thread Richard Wallis

The Mashed Library folks might be fertile ground for gaining interest in a
code4libuk.
 http://www.mashedlibrary.com

~Richard.

On 7 October 2012 16:28, Tim Hill th...@astreetpress.com wrote:

 Here's another lurking UK code4libber! I work for a UK/US company, but
 I spend the bulk of my time in the UK (and never enough in the US to
 coincide with a code4lib meetup). I'd certainly be interested in
 getting the/a community more active in the UK.

 Tim Hill

 On Tue, Oct 2, 2012 at 9:12 AM, Simeon Warner simeon.war...@cornell.edu
 wrote:
  Have a look at http://devcsi.ukoln.ac.uk/ . This is mainly focused on
  repositories but seems somewhat similar from an outside view.
 
  Cheers,
  Simeon (lurking expat Brit)
 
 
  On 10/2/12 4:11 AM, Michael Hopwood wrote:
 
  Yes - my question was implicitly aimed at lurking UKavians.
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
  Dave Caroline
  Sent: 02 October 2012 09:08
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] CODE4LIB equivalent in UK?
 
  On Tue, Oct 2, 2012 at 8:55 AM, Michael Hopwood mich...@editeur.org
  wrote:
 
  I know that CODE4LIB isn't per se in the USA but it seems like a
 large
  number of its active users are.
 
  Is there an equivalent list that you folks know of?
 
 
  I dont know of an equivalent British list but there are a few of us
 brits
  about lurking in #cod4lib too (archivist)
 
  Dave Caroline
 
 




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Get Yourself a Linked Data Piece of WorldCat to Play With

2012-08-22 Thread Richard Wallis

Hi Karen,

I was not ignoring you previous question about where, in Marc terms, data
was coming from. I need to talk with someone who was in the core of the
processing that produces the data. Unfortunately I am currently being
thwarted by vacations.

In the meantime, can you let me have a few examples of where you are seeing
discrepancies between the download triples and the RDFa embedded in
WorldCat.org pages.

~Richard.

On 22 August 2012 19:08, Karen Coyle li...@kcoyle.net wrote:

Richard, I've run into yet another area where documentation would be
helpful. There are differences between the schema.org/RDFa that is
embedded in WorldCat data and the exported WorldCat triples in the file.
One of those differences happens to be the source of the place of
publication, if I am reading it right. So, again, a request for
documentation on the fields included and their MARC source.

Thanks,

On 8/17/12 8:38 AM, Richard Wallis wrote:

In case you missed the press release earlier this week.

You can now download a significant number of RDF triples describing the
most highly held 1.2 million resources in WorldCat. Licensed under
ODC-BY.

I've posted more details on my blog:
http://dataliberate.com/2012/**08/get-yourself-a-linked-data-**
piece-of-worldcat-to-play-**with/http://dataliberate.com/2012/08/get-yourself-a-linked-data-piece-of-worldcat-to-play-with/

~Richard.

--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet

--
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

[CODE4LIB] Get Yourself a Linked Data Piece of WorldCat to Play With

2012-08-17 Thread Richard Wallis

In case you missed the press release earlier this week.

You can now download a significant number of RDF triples describing the
most highly held 1.2 million resources in WorldCat.  Licensed under ODC-BY.

I've posted more details on my blog:
http://dataliberate.com/2012/08/get-yourself-a-linked-data-piece-of-worldcat-to-play-with/

~Richard.

Re: [CODE4LIB] Worldcat schema.org search API

2012-07-10 Thread Richard Wallis

Hi Karen

At this stage there is no specific api as such to get at the embedded RDFa
data in WorldCat - you can use the normal UI of WorldCat itself or one of
the WorldCat Search API options such as
OpenSearchhttp://oclc.org/developer/documentation/worldcat-search-api/opensearch.
  This experimental first step at exposing WorldCat data as linked data
will evolve. As more development and discussion guides us, more data and
ways to get at it will appear.

You can get at the raw RDF from the embedded RDFa it in a couple of ways.
 The W3C RDFa 1.1 Distiller http://www.w3.org/2012/pyRdfa/ is one.
 Another is using the ARC2 PHP
Libraryhttps://github.com/semsol/arc2/wiki/Getting-started-with-ARC2
for
those that want to write some simple code.  Bruce Washburn has published a
posthttp://www.oclc.org/developer/news/linked-data-now-worldcat-facebook-app
sharing
how he used ARC2 in the enhanced WorldCat Facebook App to extract the RDF
from WorldCat and process it to link on and use the same technique on Viaf
and FAST.  He includes code snippets and a link to the full source for
those that are interested.

A minor point on licensing - the linked data is licensed under
ODC-BYhttp://opendatacommons.org/licenses/by/,
not CC-BY.  ODC-BY is a data oriented license, as against CC which is more
creative work oriented.

Sorry my email was hard to find - it is richard.wal...@oclc.org.  Also if
you have questions or comments about OCLC linked data formatting or
publishing you can drop an email to d...@oclc.org.

~Richard.

On 10 July 2012 19:42, Karen Coyle li...@kcoyle.net wrote:

 I have demonstrated the schema.org/RDFa microdata in the WC database to
 various folks and the question always is: how do I get access to this? (The
 only source I have is the Facebook API, me being a user rather than a
 maker.) The microdata is CC-BY once you get a Worldcat URI, but is there
 an open search to get one to the desired records in the first place? I'm
 poorly-versed in WC APIs so I'm hoping others have a better grasp.

 @rjw: the OCLC website does a thorough job of hiding email addresses or I
 would have asked this directly. Then again, a discussion here could have
 added value.

 Thanks,
 kc

 --
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 ph: 1-510-540-7596
 m: 1-510-435-8234
 skype: kcoylenet




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Worldcat schema.org search API

2012-07-10 Thread Richard Wallis

Karen,

RDFa and the basic schema.org vocabulary, plus the intention of the
proposed library extension, are not OCLC specific - they are generic tools
and techniques applicable across many domains.

I would therefore avoid library focussed tool sites, which would run the
risk of not keeping up with wider developments. RDFa.info seems to be
shaping up as a good resource.

Schema.org itself also is a good resource.

On the point of how to gain the best from linked data, many especially in
the library community, immediately look towards search as the default *
paradise* for dealing with data. Many of the benefits of linked data
emerge not from search, but from identifying relationships and following
links. I heard this described the other day as 'facets on steroids' - not
entirely accurate, but it conjures up the right kind of image ;-)

I am not saying ignore search, far from it, just suggesting that innovation
with linked data often comes from what you can do once you have found (often
by traditional methods) a thing .

~Richard.

On 10 July 2012 21:34, Karen Coyle li...@kcoyle.net wrote:

Thanks, Kevin! And Richard!

I'm thinking we need a good web site with links to tools. I had already
been introduced to

http://www.w3.org/2012/pyRdfa/

where you can past a URI and get ttl or rdf/xml. These are all good
resources. But what about someone who wants to do this programmatically,
not through a web site? Richard's message indicates that this isn't yet
available, so perhaps we should be gathering use cases to support the need?
And have a place to post various solutions, even ones that are not
OCLC-specific? (Because I am hoping that the use of microformats will
increase in general.)

On 7/10/12 12:12 PM, Kevin Ford wrote:

is there an open search to get one to the desired records in the first
place?
-- I'm not certain this will fully address your question, but try these
two sites:

Website:
http://www.google.com/**webmasters/tools/richsnippetshttp://www.google.com/webmasters/tools/richsnippets
Example: http://tinyurl.com/dx3h5bg

Website:
http://linter.structured-data.**org/http://linter.structured-data.org/
Example: http://tinyurl.com/bmm8bbc

These sites will extract the data, but I don't think you get your choice
of serialization. The data are extracted and displayed on the resulting
page in the HTML, but at least you can *see* the data.

Additionally, there are a number of tools to help with microdata
extraction here:

http://schema.rdfs.org/tools.**html http://schema.rdfs.org/tools.html

Some of these will allow you to output specific (RDF) serializations.

HTH,

Kevin

On 07/10/2012 02:42 PM, Karen Coyle wrote:

I have demonstrated the schema.org/RDFa microdata in the WC database to
various folks and the question always is: how do I get access to this?
(The only source I have is the Facebook API, me being a user rather
than a maker.) The microdata is CC-BY once you get a Worldcat URI, but
is there an open search to get one to the desired records in the first
place? I'm poorly-versed in WC APIs so I'm hoping others have a better
grasp.

@rjw: the OCLC website does a thorough job of hiding email addresses or
I would have asked this directly. Then again, a discussion here could
have added value.

Thanks,
kc

--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet

--
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Worldcat schema.org search API

2012-07-10 Thread Richard Wallis

 a web site? Richard's message indicates that this isn't
 yet
 available, so perhaps we should be gathering use cases to support the
 need? And have a place to post various solutions, even ones that are
 not
 OCLC-specific? (Because I am hoping that the use of microformats will
 increase in general.)

 kc


 On 7/10/12 12:12 PM, Kevin Ford wrote:

 is there an open search to get one to the desired records in the

 first

 place?

 -- I'm not certain this will fully address your question, but try
 these two sites:

 Website: 
 http://www.google.com/**webmasters/tools/richsnippetshttp://www.google.com/webmasters/tools/richsnippets
 Example: http://tinyurl.com/dx3h5bg

 Website: 
 http://linter.structured-data.**org/http://linter.structured-data.org/
 Example: http://tinyurl.com/bmm8bbc

 These sites will extract the data, but I don't think you get your
 choice of serialization.  The data are extracted and displayed on the
 resulting page in the HTML, but at least you can *see* the data.

 Additionally, there are a number of tools to help with microdata
 extraction here:

 http://schema.rdfs.org/tools.**htmlhttp://schema.rdfs.org/tools.html

 Some of these will allow you to output specific (RDF) serializations.


 HTH,

 Kevin


 On 07/10/2012 02:42 PM, Karen Coyle wrote:

 I have demonstrated the schema.org/RDFa microdata in the WC
 database to
 various folks and the question always is: how do I get access to
 this?
 (The only source I have is the Facebook API, me being a user
 rather
 than a maker.) The microdata is CC-BY once you get a Worldcat
 URI, but
 is there an open search to get one to the desired records in the
 first
 place? I'm poorly-versed in WC APIs so I'm hoping others have a
 better
 grasp.

 @rjw: the OCLC website does a thorough job of hiding email
 addresses or
 I would have asked this directly. Then again, a discussion here
 could
 have added value.

 Thanks,
 kc


 --
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 ph: 1-510-540-7596
 m: 1-510-435-8234
 skype: kcoylenet




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Transcription/dictation software?

2012-02-27 Thread Richard Wallis

Hi,

Not sure of Mechanical Turk.

However, have used CastingWords [http://castingwords.com/] successfully for
transcribing podcasts - their budget rate is $1/minute.

~Richard

On 27 February 2012 20:22, Nathan Tallman ntall...@gmail.com wrote:

 Is Amazon Mechanical Turk expensive? Anyone know an average cost for an
 hour of audio?

 Nathan

 On Mon, Feb 27, 2012 at 1:56 PM, Sean Hannan shan...@jhu.edu wrote:

  Mechanical Turk it.




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] RDF advice

2012-02-13 Thread Richard Wallis

Hi Ethan,

I will defer to those with greater insight, into what has been discussed
earlier in this thread, than myself as to some of the semantics you are
trying to crystallise here.

What I can offer instead is a bit of advice as to lubricating the process.

Firstly, stay as far away from XML as possible whilst trying to shape your
model/ontologies - it a) introduces hierarchical thinking/visualisation in
to what may well not be a problem of hierarchy, b) is difficult to read, c)
in the world of RDF, best reserved for machine to machine communication.

Secondly, put away the computer and get out the white/blackboard and pen.
Start drawing some ellipses, rectangles, and arrows. When you have a
model that looks something like the real world you are trying to represent
(not the traditional metadata records you previously held), transform that
in to a form of RDF that a computer will understand.

This is an approximation of the process the British Library used to work
their way towards their data
modelhttp://dataliberate.com//wp-content/uploads/2012/01/British-Library-Data-Model-v1.01.pdf
for
the British National Bibliography.

Oh, and the XML? - Let a tool like Raptor produce it for you from the more
human friendly turtle you come up with.

~Richard.

On 13 February 2012 21:43, Ethan Gruber ewg4x...@gmail.com wrote:

Hi Patrick,

Thanks. That does make sense. Hopefully others will weigh in with
agreement (or disagreement). Sometimes these semantic languages are so
flexible that it's unsettling. There are a million ways to do something
with only de facto standards rather than restricted schemas. For what it's
worth, the metadata files describe coin-types, an intellectual concept in
numismatics succinctly described at
http://coins.about.com/od/coinsglossary/g/coin_type.htm, not physical
objects in a collection.

Ethan
--

Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-13 Thread Richard Wallis

Peter,

On 12 December 2011 22:11, Peter Noerr pno...@museglobal.com wrote:

 Trying to synthesize what Karen, Richard and Simon have bombarded us with
 here, leads me to conclude that linking to existing (or to be created)
 external data (ontologies and representations) is a matter of: being sure
 what you’re the system's current user's context is, and being able to
 modify the external data brought into the users virtual EMU(see below ***
 before reading further).


Sorry for the bombarding ;-)

being sure what you’re the system's current user's context is - sounds
like a nice idea, but when you are publishing data you have little control,
and even less knowledge, of the consuming 'user' and their context.

Taking things to the next level, by building services and applications for
users, you hopefully will have some understanding of the virtual and actual
users' contexts and can take [what I like to call editorial] decisions
about how much data in what format to deliver to them, and which links to
follow to enrich your service.

So, back down at the data level, model your domain to include all the
information you are aware of for the entities you are describing, plus link
them to other domains that can enrich those descriptions.   Leave it to the
consumers of your data to decide what is best for them in their context.


 I think Simon is right that records will increasingly become virtual in
 that they are composed as needed by this user for this purpose at this
 time.


Yes - you could envisage, for some domains,  a minimalistic description of
their resource could be sufficient in the form of a single triple:
http://mylib.org/resource/12345 owl:sameAs 
http://bnb.data.bl.uk/id/resource/008740700 .


 I think Simon (maybe Richard, maybe all of you) was working towards a
 single unique EMU for the entity which holds all unique information about
 it for a number of different uses/scenarios/facets/formats. Of course
 deciding on what is unique and what is obtained from some more granular
 breakdown is another issue. (Some experience with this onion skin
 modeling lies deep in my past, and may need dredging up.)


I am suggesting that you in your domain/catalog/library would probably
assign a unique identifier, in your domain, for each of the things you
describe:
 http://mylib.org/resource/12345
 http://mylib.org/person/CarpenterEdward1910-1998

Describe those things:
http://mylib.org/resource/008740700 rdf:type bibo:Book .
 http://mylib.org/person/CarpenterEdward1910-1998 foaf:name Edward
Carpenter .

Describe the relationships between those things:
 http://mylib.org/resource/008740700 dct:creator 
http://mylib.org/person/CarpenterEdward1910-1998 .

Then link them to external descriptions of the same concepts:
 http://mylib.org/resource/12345 owl:sameAs 
http://bnb.data.bl.uk/id/resource/008740700 .
 http://mylib.org/person/CarpenterEdward1910-1998 owl:sameAs 
http://viaf.org/viaf/53127337 .

That way you end up with internal identifiers that you can link to, from
things like comments, circulation records, physical location information,
etc.  These are then linked out to distributed descriptions which you, or
consumers of your data, can then merge with your data to provide richer
information.   I know the above examples are a bit simplistic, but
nevertheless it could be near good-enough for some use cases.


*** I suggest (and use above) the Entity Metadata Unit = EMU. This contains
 the totality of unique information stored about this entity in this single
 logical location.


In my current location, and the current economic climate, I am wary of an
acronym the same as European Monetary Union.  ;-)

However, I think you are thinking in the right direction - I am resigning
myself to just using the word 'description'.

~Richard.


-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-13 Thread Richard Wallis

Simon,

You wrote:

 Q: In your definition, can *descriptions *be put* * into 1:1 correspondence
 with records (where a record is a atomic asserted set of propositions about
 a resource)?


I do not believe so, especially when referencing back to where we started -
the Marc Record.

A Marc record more often than not, contains propositions about many things:
 * The book itself (lets assume that's what the record is about) - isbn,
number of pages, cost, format, shelf location
 * The author - name, birth/death date
 * The publisher - name, location
 * Publication event - date, publisher, location
 * Subject(s)

In my view this record contains information to populate 5 or more separate
descriptions, plus the related links between them.


 On Tue, Dec 13, 2011 at 3:22 PM, Karen Coyle li...@kcoyle.net wrote:


  Yes, I realize that you were asking Richard, but I'm a bit forward, as we
  know.


Karen, thanks for diving in ;-)

I do NOT see a description as atomic in the sense that a record is
  atomic. A record has rigid walls, a description has permeable ones. A
  description always has the POTENTIAL to have a bit of unexpected data
  added; a record cuts off that possibility.


Yes.  Take the author thing from above. It may have it's basic, Marc record
derived information, enhanced, by merging with external resources, to add
an author's website or image.


 
  That said, I am curious about the permeability of the edges of a named
  graph. I don't know their degree of rigidity in terms of properties
 allowed.
 

 Named graphs were supposed to be invariant under the original proposal;
  there is a lot of mess around the semantics right now. Dan Brickley wrote
 a very nice example : http://danbri.org/words/2011/11/03/753 .


As per the comments on Dan's blog, it is dangerous to jump on named graphs
as the solution to perceived problems.  If I wanted to load RDF from three
separate libraries in to a triple store I would  assign them to three named
graphs, but then probably query the default global graph giving a merged
view.

Using named graphs to try to recreate our original source record seems to
defeat the [opening up] purpose of moving to Linked Data modeling in the
first place.  I also think it would add in a layer of complexity without an
obvious justifying data consumer use case.

~Richard



-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-13 Thread Richard Wallis

On 13 December 2011 22:17, Peter Noerr pno...@museglobal.com wrote:

 I agree with Karen below that a record seems more bounded and static,
 whereas a description varies according to need. And that is the distinction
 I was trying to get at: that the item stored in some database is everything
 unique about that entity - and is static, until some data actually changes,
 whereas the description is built at run time for the user and may contain
 some data from the item record, and some aggregated from other, linked,
 item records. The records all have long term existence in databases and the
 like, whereas the description is a view of all that stored data appropriate
 for the moment. It will only be stored as a processing intermediate result
 (as a record, since its contents are now fixed), and not long term, since
 it would be broken up to bits of entity data and stored in a distributed
 linked fashion (much like, as I understand it, the BL did when reading MARC
 records and storing them as entity updates.)


Yes.  However those descriptions have the potential to be as permanent as
the records that they were derived from.  As in the BL's case where the RDF
is stored, published and queried in [Talis] Kasabi.com:
http://kasabi.com/dataset/british-national-bibliography-bnb



 Having said all that, I don't like the term description as it carries a
 lot of baggage, as do all the other terms. But I'm stuck for another one.


Me too.  I'm still searching searching for a budget airline term - no
baggage!

~Richard.

-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-12 Thread Richard Wallis

On 11 December 2011 23:47, Karen Coyle li...@kcoyle.net wrote:

 Quoting Richard Wallis richard.wal...@talis.com:


  You get the impression that the BL chose a subset of their current
 bibliographic data to expose as LD - it was kind of the other way around.
 Having modeled the 'things' in the British National Bibliography domain
 (plus those in related domain vocabularis such as VIAF, LCSH, Geonames,
 Bio, etc.), they then looked at the information held in their [Marc] bib
 records to identify what could be extracted to populate it.


 Richard, I've been thinking of something along these lines myself,
 especially as I see the number of translating X to RDF projects go on. I
 begin to wonder what there is in library data that is *unique*, and my
 conclusion is: not much. Books, people, places, topics: they all exist
 independently of libraries, and libraries cannot take the credit for
 creating any of them. So we should be able to say quite a bit about the
 resources in libraries using shared data points -- and by that I mean, data
 points that are also used by others. So once you decide on a model (as BL
 did), then it is a matter of looking *outward* for the data to re-use.


Yes!




 I maintain, however, as per my LITA Forum talk [1] that the subject
 headings (without talking about quality thereof) and classification
 designations that libraries provide are an added value, and we should do
 more to make them useful for discovery.


The wider world is always looking for good ways to categorise things.  The
library community should make it easy for others to utilise their rich
heritage of such things. LCSH is an obvious candidate, so is VIAF amongst
others.  The easier we make it, the more uptake there will be and the more
inbound links in to library resources we will get.  By easier, I am
suggesting that efforts to map these library concepts (where they fit) to
their wider world equivalents found in places like Dbpeadia, New York
Times, and Geonames, will greatly enhance the use and visibility of library
resources.




 I know it is only semantics (no pun intended), but we need to stop using
 the word 'record' when talking about the future description of 'things' or
 entities that are then linked together.   That word has so many built in
 assumptions, especially in the library world.


 I'll let you battle that one out with Simon :-), but I am often at a loss
 for a better term to describe the unit of metadata that libraries may
 create in the future to describe their resources. Suggestions highly
 welcome.


Your are not the only one who is looking for a better term for what is
being created - maybe we should hold a competition to come up with one.



-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-12 Thread Richard Wallis

On 12 December 2011 11:16, Alexander Johannesen 
alexander.johanne...@gmail.com wrote:

 Richard Wallis richard.wal...@talis.com wrote:
  Your are not the only one who is looking for a better term for what is
  being created - maybe we should hold a competition to come up with one.

 A named graph gets thrown around a lot, and even though this is
 technically correct, it's neither nice nor sexy.


It also carries lots of baggage from the Linked Data/Triple store
communities that would get in the way.


 In my past a bucket was much used, as you can easily thrown things in or
 take it out (as opposed to the more terminal record being set), however
 people have a problem with the conceptual size of said bucket, which more
 or less summarizes why this term is so hard to pin down.


Yes, most would assume that a bucket would be the place to put their [think
of a better word than] records.



 I have, however, seen some revert the old RDBMS world of rows, as they
 talk about properties on the same line, just thinking the line to be more
 flexible than what it used to be, but we'll see if it sticks around.


Collection of triples?


 Personally I think the problem is that people *like* the idea of a closed
 little silo that is perfectly contained, no matter if it is technically
 true or not, and therefore futile. This is also why, I think, it's been so
 hard to explain to more traditional developers the amazing advantages you
 get through true semantic modelling; people find it hard to let go of a
 pattern that has helped them so in the past.


A classic example of only being able to describe/understand the future in
the terms of your past experience.


 Breaking the meta data out of the wonderful constraints of a MARC record?
 FRBR/RDA will never fly, at least not until they all realize that the
 constraints are real and that they truly and utterly constrain not just the
 meta data but the future field of librarying ... :)


:-)

~Richard.
-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-11 Thread Richard Wallis

On 10 December 2011 13:14, Karen Coyle li...@kcoyle.net wrote:

I don't believe that anyone is saying that we have a goal of having a
 re-serialization of ISO 2709 in RDF so that we can begin to use that as our
 data format. We *do* have millions of records in 2709 with cataloging based
 on AACR or ISBD or other rules. The move to any future format will have to
 include some kind of transformation of that data. The result will be
 something ugly, at least at first: AACR in RDF is not going to be good
 linked data.


I agree with your sentiment here but, from what you imply at
http://futurelib.pbworks.com/w/page/29114548/MARC%20elements,
transformation in to something that would be recognisable by the
originators of the source Marc will be difficult - and yes ugly.

The refreshing thing about the work done by the BL is that they stepped
away from the 'record', modeled the things that make up the BnB domain.
Then they implemented processes to extract rich data from the source Marc,
enrich it with external links, and load it to an RDF representation of the
model.

On the way, embedded in the extraction/transformation/enrichment processes
there was much ugly data, but that was not exposed beyond the process.  An
approach I applaud, unlike muddying the waters by attempting to publish
vocabularies for every Marc tag you can think of.


I believe that you and I share a concern: that current library data is
 based on such a different model than that of the Semantic Web that by
 looking at our past data we will fail to understand or take advantage of
 linked data as it should be.


Concern shared.   I would however lower my sights slightly by setting the
current objective to be 'Publishing bibliographic information as Linked
Data to become a valuable and useful part of a Web of Data'.   Using the
Semantic Web as a goal introduces even more vagueness and baggage.  I
firmly believe that establishing a linked web of data will eventually
underpin a Semantic Web, but  there is still a few steps to go before we
get anywhere near that.


  Unfortunately, the library cataloging world has no proposal for linked
 data cataloging. I'm not sure where we could begin.


This is not surprising and I believe, at this stage, it is not a problem.
Lets eat the elephant one bite at a time - I envisage a lengthy interim
phase where publishing linked bibliographic data derived from traditional
Marc records (using processes championed by a community such as CODE4LIB),
is the norm.  Cataloging processes and systems that use a Linked Data model
at the core should then emerge, to satisfy a then established need.

~Richard

-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-11 Thread Richard Wallis

Karen,

On 11 December 2011 15:18, Karen Coyle li...@kcoyle.net wrote:

Quoting Richard Wallis richard.wal...@talis.com:

I agree with your sentiment here but, from what you imply at
http://futurelib.pbworks.com/**w/page/29114548/MARC%**20elementshttp://futurelib.pbworks.com/w/page/29114548/MARC%20elements
,
transformation in to something that would be recognisable by the
originators of the source Marc will be difficult - and yes ugly.

The refreshing thing about the work done by the BL is that they stepped
away from the 'record', modeled the things that make up the BnB domain.
Then they implemented processes to extract rich data from the source Marc,
enrich it with external links, and load it to an RDF representation of the
model.

Richard, this is an interesting statement about the BL data. Are you
saying that they chose a subset of their current bibliographic data to
expose as LD? (I haven't found anything yet that describes the process
used, so if there is a document I missed, please send link!)

There is no document I am aware of, but I can point you at the blog post by
Tim Hodson [
http://consulting.talis.com/2011/07/british-library-data-model-overview/]
who helped the BL get to grips with and start thinking Linked Data.
Another by the BL's Neil Wilson [
http://consulting.talis.com/2011/10/establishing-the-connection/] filling
in the background around his recent presentations about their work.

You get the impression that the BL chose a subset of their current
bibliographic data to expose as LD - it was kind of the other way around.
Having modeled the 'things' in the British National Bibliography domain
(plus those in related domain vocabularis such as VIAF, LCSH, Geonames,
Bio, etc.), they then looked at the information held in their [Marc] bib
records to identify what could be extracted to populate it.

This almost sounds like the FRBR process, BTW - modeling the domain, which
is also step one of the Singapore Framework/Dublin Core Application Profile
process, then selecting data elements for the domain. [1] FRBR,
unfortunately, has perceived problems as model (which I am attempting to
gather up here [2] but may move to the LLD community wiki space to give it
more visibility).

The BL will tell you that their model is designed to add to the
conversation around how to progress the modelling bibliographic information
as Linked Data. There is still a way to go. They are currently looking at
how to model multi-part works in the current model and hope to enhance it
to bring in other concepts such as FRBR.

The work that I'm doing is not based on the assumption that all of MARC
will be carried forward. The reason I began my work is that I don't think
we know what is in the MARC record -- there is similar data scattered all
over, some data that changes meaning as indicators are applied, etc. There
is no implication that a future record would have all of those data
elements, ...

I know it is only semantics (no pun intended), but we need to stop using
the word 'record' when talking about the future description of 'things' or
entities that are then linked together. That word has so many built in
assumptions, especially in the library world.

Concern shared. I would however lower my sights slightly by setting the
current objective to be 'Publishing bibliographic information as Linked
Data to become a valuable and useful part of a Web of Data'. Using the
Semantic Web as a goal introduces even more vagueness and baggage. I
firmly believe that establishing a linked web of data will eventually
underpin a Semantic Web, but there is still a few steps to go before we
get anywhere near that.

My concern is the creation of LD silos. BL data uses some known namespaces
(BIBO, FOAF, BIO), which in fact is a way to join the web of data that
many others are participating in, because your foaf:Person can interact
with anyone else's foaf:Person. But there are a great number of efforts
that are modeling current records (FRBRer, ISBD, MODS, RDA) and are
entirely silo'd - there is nothing that would connect the data to anyone
else's data (and the ones mentioned would not even connect to each other).
So I don't know what you mean by part of a Web of data but to me using
non-silo'd properties is enough to meet that criterion. Another possibility
is to create links from your properties to properties outside of your silo,
e.g. from RDA:Person to foaf:Person, for sharing and discoverability.

There a couple of ways that your domain can link in to the wider web of
data. Firstly, as you identify, by sharing vocabularies. There is a small
example in the middle of the BL model, where a Resource is both a
dct:BiblographicResource and also (when appropriate) a bibo:Book.

In Linked Data there is nothing wrong in mixing ontologies within one
domain. If the thing you are modelling is identified as being a
foaf:person, there is no reason why it can not also be defined

Re: [CODE4LIB] Namespace management, was Models of MARC in RDF

2011-12-08 Thread Richard Wallis

On 7 December 2011 16:29, Karen Coyle li...@kcoyle.net wrote:

 (As an aside, there is some concern that the use of FRBR will make linking
 from library bibliographic data to non-library bibliographic data
 difficult, if not impossible. Having had some contact with members of the
 FRBR review group, they seem impervious to that concern.)

 kc


I somehow missed out on this thread and it's predecessor, until a major
fail in the British rail system resulted in an unexpected coffee with Owen
yesterday - I hope he got home OK.However the benefit of being late to
a conversation is that you can see where the points of friction are.  So a
few thoughts on those:

Why bother?
Transforming Marc in to RDF is an interesting and challenging exercise, but
there is little point in doing it without having some potential benefits in
mind beyond the it would be great to have our stuff in a new format


RDF is a means to an end
We shouldn't loose sight of the RDF TLA, Resource Description Framework -
it is a framework for describing [our] resources.   It is the, de facto,
standard for publishing Linked Data.   Publishing descriptions of our
resources as Linked Data does fall in to the potential benefits arena -
reuse, mixing, merging, lowering barriers to use of data across, and from
outside of, the library community.


If it waddles and quacks, it is probably still a duck
Transforming a Marc record to XMLMarc just created the same record in in a
different wrapper.  Apart from the technical benefit (of being able to use
generic tools to work with it), it did not move us much further forward
towards opening up our data to wider use. Transforming Marc, of any flavor,
into an RDF representation of a record still leaves us with a record per
item - a digital card catalogue equivalent.


A record is a silo within a silo
A record within a catalogue duplicates the publisher/author/subject/etc.
information stored in adjacent records describing items by the same
author/publisher/etc.  This community spends much of it's effort on the
best ways to index and represent this duplication to make records
accessible.   Ideally an author, for instance, should be described
[preferably only once] and then related to all the items they produced


Linked Data should be the goal
At the event mentioned by Mike, Linked Data and Libraries[1], the British
Library launched their initial data model for the British National
Bibliography[2].  One of the key concepts of Linked Data is to represent
data as a set of interlinked things. These things are referred to as
objects of interest, they are things about which we can make statements.
In this model you get statements about things (eg. books, authors,
publishers, publishing events, subjects, places, etc.) and the links
between them - not a record per item.


Storing Marc in an RDF triple, or link to it?
The question I would ask is, which consumer of your data would this be
useful for?  Secondly, whatever your answer, it does not make sense to say
that this item, or author, or publisher 'thing' was derived from a
particular Marc record - you could perhaps at data set, or graph, level
(using the provenance vocabulary) define that it was transformed from a
particular source, at a time, using a method, by a person/process.


Who's Ontology
Do we only use library domain ontologies/vocabularies or do we employ dc,
foaf, bibo, etc. ?  Do we use dc:creator which most of the [non-library]
world will understand, or some esoteric [to them] rda properties to
describe corporate and many other nuance of authorship?   If you want to
enable general application developers/data consumers to use your data, you
need to apply the well known [if possibly course-grained or lossy] terms.
If you want to preserve the rich detail extracted from the source Marc, you
need to delve deeper in to bibliographically oriented properties.   Can you
do both? Yes.  Should you do both? Probably.

~Richard.

I think I better stop now and contemplate a blog post to further these
thoughts.


[1]
http://consulting.talis.com/resources/presentations-from-linked-data-and-libraries-2011/
[2]http://consulting.talis.com/2011/07/british-library-data-model-overview/



-- 
Richard Wallis
Technology Evangelist, Talis
http://consulting.talis.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

Re: [CODE4LIB] Apps to reduce large file on the fly when it's requested

2011-08-04 Thread Richard Wallis

Why not let someone else, such as the Google, do the heavy lifting for you:
https://docs.google.com/viewer

~Richard.

On 4 August 2011 07:39, Dave Caroline dave.thearchiv...@gmail.com wrote:

 One method is to dispense with PDF and just view the scanned pages online
 as
 images or OCR'd text or point the user to a directory with the scans
 for the document.
 He then only needs an image viewer using a lot less of his machines memory.

 Large PDF's also cause problems in the viewing computer. I was
 reviewing someones
 25mb PDF the other day and it peaked at 3.3 gig memory use, which on a
 2.5gig
 memory box meant it went into swap and slowed to a crawl.
 The viewer used there was evince.

 I scan to jpg and only produce a PDF if nagged

 http://www.collection.archivist.info/archive/manuals/IS44_Tektronix_602_display_unit/

 As I serve from home and the upload is on the slow side individual
 pages helps there too.
 And when in a good mood I finish off a document thus
 http://www.collection.archivist.info/searchv13.php?searchstr=lucas+tp1
 where all pages are web viewable. Been too lazy to write a page to
 page link on the page
 view so far (need a round tuit).

 Dave Caroline




-- 
Richard Wallis
Technology Evangelist, Talis
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.com

[CODE4LIB] Last few places available for Talis Open Day: Linked Data and Libraries - British Library London 21st July

2010-07-08 Thread Richard Wallis

Apologies for cross posting if you receive this more than once.

If you have not already registered for this popular event, there are only a few 
places remaining to join representatives from several National Libraries and 
others interested in the potential for, and application of, Linked data at this 
free Talis Open Day at the British Library Conference Centre, London on 21st 
July.  Register now to avoid 
disappointmenthttp://www.talis.com/platform/events/.

It is a day for those interested in the potential for, and application of, 
Linked data by International and National libraries.  Sessions will cover an 
introduction to Linked Data, a short RDF and SPARQL tutorial (pitched at a 
level which will engage the technical and inform the non-technical), and 
descriptions of National and International Library Linked Data projects and 
initiatives.

Speakers confirmed include:

 *   Romain Wenz, Bibliothèque nationale de France — The BnF Pivot project to 
to build a Linked Data RDF catalogue linked in to the web of data.
 *   Antoine Issacs, Scientific Coordinator, Europeana — W3C Library Linked 
Data Incubator Group
 *   Rob Styles, Talis — Finding Semantic Relationships in Marc


Linked Data is being adopted by many significant organisations across the web.  
Data.gov.ukhttp://ata.gov.uk/ and the BBC are just two organisations that are 
working with Talis, applying Linked Data Semantic Web techniques and 
technologies.   As can be seen from the agenda below, this day will (in 
addition to addressing general Linked Data issues) be covering leading library 
specific initiatives in this area.

AGENDA

 *   Introduction to Linked Data
 *   Overview of the Talis Platform
 *   The Bnf Pivot project
 *   W3C Library Linked Data Incubator Group
 *   Linked Data, RDF and SPARQL tutorial
 *

Linked Bibliographic Data – Semantic relationships in Marc, Bibo ontology

 *   Linked Data in action
 *   Lightning Talks

This is an ideal free day for those wanting an insight in to the potential and 
the practicalities of applying Linked Data to library data.

More information and registration:  
http://blogs.talis.com/nodalities/2010/06/talis-open-day-linked-data-and-libraries.php

Richard Wallis
Technology Evangelist, Talis
Tel: +44 (0)870 400 5422 (Direct)
Tel: +44 (0)870 400 5000 (Switchboard)
Tel: +44 (0)7767 886 005 (Mobile)
Fax: +44 (0)870 400 5001

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.commailto:rjw3...@hotmail.com






Please consider the environment before printing this email.

Find out more about Talis at http://www.talis.com/
shared innovation™

Any views or personal opinions expressed within this email may not be those of 
Talis Information Ltd or its employees. The content of this email message and 
any files that may be attached are confidential, and for the usage of the 
intended recipient only. If you are not the intended recipient, then please 
return this message to the sender and delete it. Any use of this e-mail by an 
unauthorised recipient is prohibited.

Talis Information Ltd is a member of the Talis Group of companies and is 
registered in England No 3638278 with its registered office at Knights Court, 
Solihull Parkway, Birmingham Business Park, B37 7YB.

[CODE4LIB] Talis Open Day: Linked Data and Libraries - British Library London 21st July

2010-06-24 Thread Richard Wallis

Apologies for cross posting if you receive this more than once.

The agenda for this free Talis Open Day at the British Library Conference 
Centre, London on 21st July, focused upon Linked Data and Libraries is now 
taking shape.

It is a day for those interested in the potential for, and application of, 
Linked data by International and National libraries.  Sessions will cover an 
introduction to Linked Data, a short RDF and SPARQL tutorial (pitched at a 
level which will engage the technical and inform the non-technical), and 
descriptions of National and International Library Linked Data projects and 
initiatives.

Speakers confirmed include:

 *   Romain Wenz, Bibliothèque nationale de France — The BnF Pivot project to 
to build a Linked Data RDF catalogue linked in to the web of data.
 *   Antoine Issacs, Scientific Coordinator, Europeana — W3C Library Linked 
Data Incubator Group
 *   Rob Styles, Talis — Finding Semantic Relationships in Marc


Linked Data is being adopted by many significant organisations across the web.  
data.gov.ukhttp://data.gov.uk/ and the BBC are just two that are working with 
Talis applying Linked Data Semantic Web techniques and technologies.   As can 
be seen from the agenda below, this day will (in addition to addressing general 
Linked Data issues) be covering leading library specific initiatives in this 
area.

AGENDA

 *   Introduction to Linked Data
 *   Overview of the Talis Platform
 *   The Bnf Pivot project
 *   W3C Library Linked Data Incubator Group
 *   RDF/SPARQL tutorial
 *   Bibo – The Bibliographic Ontology
 *   Finding Semantic Relationships in MARC
 *   Linked Data in action

This is an ideal free day for those wanting an insight in to the potential and 
the practicalities of applying Linked Data to library data.

More information and registration:  
http://blogs.talis.com/nodalities/2010/06/talis-open-day-linked-data-and-libraries.php

Richard Wallis
Technology Evangelist, Talis
Tel: +44 (0)870 400 5422 (Direct)
Tel: +44 (0)870 400 5000 (Switchboard)
Tel: +44 (0)7767 886 005 (Mobile)
Fax: +44 (0)870 400 5001

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.commailto:rjw3...@hotmail.com






Please consider the environment before printing this email.

Find out more about Talis at http://www.talis.com/
shared innovation™

Any views or personal opinions expressed within this email may not be those of 
Talis Information Ltd or its employees. The content of this email message and 
any files that may be attached are confidential, and for the usage of the 
intended recipient only. If you are not the intended recipient, then please 
return this message to the sender and delete it. Any use of this e-mail by an 
unauthorised recipient is prohibited.

Talis Information Ltd is a member of the Talis Group of companies and is 
registered in England No 3638278 with its registered office at Knights Court, 
Solihull Parkway, Birmingham Business Park, B37 7YB.

[CODE4LIB] Talis Open Day: Linked Data and Libraries - British Library London 21st July

2010-06-02 Thread Richard Wallis

Hi all,

Apologies for cross posting.

The latest in the series of free Linked Data focused Platform Open Days hosted 
by Talis may be of interest.

This one is specify for anyone interested in understanding and applying Linked 
Data in the world of National, International, Cooperative, and other large 
libraries.

These Open Days are designed to introduce you to the principles, practice and 
potential of Linked Data.  Included is a short tutorial on RDF and the SPARQL 
query language, pitched at a level which will engage the technical and inform 
the non-technical attendees.

Linked Data is being adopted by many significant organisations across the web.  
data.gov.ukhttp://data.gov.uk and the BBC are just two that are working with 
Talis on applying Linked Data Semantic Web techniques and technologies.   As 
can be seen from the provisional agenda below, this day will (in addition to 
addressing general Linked Data issues) be covering leading library specific 
initiatives in this area.

AGENDA

 *   Introduction to Linked Data
 *   Overview of the Talis Platform
 *   The Bnf Pivot project – Emmanuelle Bermes, Bibliothèque nationale de France
 *   W3C Library Linked Data Incubator Group
 *   RDF/SPARQL tutorial
 *   Bibo – The Bibliographic Ontology
 *   Finding Semantic Relationships in MARC
 *   Linked Data in action

This is an ideal free day for those wanting an insight in to the potential and 
the practicalities of applying Linked Data to library data.

More information and registration:  
http://blogs.talis.com/nodalities/2010/06/talis-open-day-linked-data-and-libraries.php


Richard Wallis
Technology Evangelist, Talis
Tel: +44 (0)870 400 5422 (Direct)
Tel: +44 (0)870 400 5000 (Switchboard)
Tel: +44 (0)7767 886 005 (Mobile)
Fax: +44 (0)870 400 5001

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw
IM: rjw3...@hotmail.commailto:rjw3...@hotmail.com






Please consider the environment before printing this email.

Find out more about Talis at http://www.talis.com/
shared innovation™

Any views or personal opinions expressed within this email may not be those of 
Talis Information Ltd or its employees. The content of this email message and 
any files that may be attached are confidential, and for the usage of the 
intended recipient only. If you are not the intended recipient, then please 
return this message to the sender and delete it. Any use of this e-mail by an 
unauthorised recipient is prohibited.

Talis Information Ltd is a member of the Talis Group of companies and is 
registered in England No 3638278 with its registered office at Knights Court, 
Solihull Parkway, Birmingham Business Park, B37 7YB.

Re: [CODE4LIB] Library Linked Data

2009-10-29 Thread Richard Wallis


On 29 Oct 2009, at 11:02, O.Stephens wrote:


Could you elaborate a bit? In my mind, the only semantic web
technology of any note is linked data. How that fits into
library search is anyone's guess, and I'm wondering what,
specifically, you're referring to when you say that Talis is
active in this area.


I'm sure there are some Talis people out there who could give a  
fuller run down of what the company are doing in this area, but as  
examples:


Developing and supporting RDF Triple store technology (aka 'The  
Platform')

Exposing reading list information as RDF via their Aspire product
Contributing to relevant Ontologies - including discussions of/ 
contributions to BIBO, resourcelist and AIISO ontologies
The http://semanticlibrary.org/ demonstrator of use of RDF/Semantic  
tech/Linked data for library data




Well done Owen you saved me a few keystrokes there.

I would add to your list that the Software-as-a-Service semantic  
Platform [http://www.talis.com/platform] underpins the Talis Prism  
OPAC, Talis Aspire resource list system, and Talis Engage community  
information system.  All three providing live services for large  
public and academic libraries, demonstrating that these technologies  
can be robust, scalable and reliable.


Although not library focussed uses of the Platform, it is also being  
used to drive innovation by the BBC and data.gov.uk, in providing  
SPARQL query end-points on to publicly available data in RDF form.


There has been a recent thread on the NGC4LIB list, stimulated by an  
interview with Tim Berners-Lee [http://fora.tv/2009/10/08/Next_Decade_Technologies_Changing_the_World-Tim-Berners-Lee 
], that covers much around semantic web and libraries.  As is often  
the case on that list, the thread wanders far and wide (with the  
almost obligatory stop off in FRBR-land).  I have put together some of  
my thoughts from the issues raised by it, in a recent blog post Will  
Linked Data mean an early end for Marc  RDA [http://tinyurl.com/ykrnjph 
]


~Richard

Please consider the environment before printing this email.

Find out more about Talis at www.talis.com 


shared innovationTM

Any views or personal opinions expressed within this email may not be those of 
Talis Information Ltd or its employees. The content of this email message and 
any files that may be attached are confidential, and for the usage of the 
intended recipient only. If you are not the intended recipient, then please 
return this message to the sender and delete it. Any use of this e-mail by an 
unauthorised recipient is prohibited.

Talis Information Ltd is a member of the Talis Group of companies and is 
registered in England No 3638278 with its registered office at Knights Court, 
Solihull Parkway, Birmingham Business Park, B37 7YB.

[CODE4LIB] Code4lib mugs?

2008-11-03 Thread Richard Wallis

We at Talis will be, as previously, sponsoring Code4lib next year - we  
are just finalising the details with our bean counters at the moment.


In addition to that I would like to make a suggestion as to maybe  
adding to the memories of the event beyond the ritual wearing of the t- 
shirt upon returning to our home establishments.


The consumption of hot beverages I suspect is an almost universal  
trait of the code4liber as we try to get the latest bit of perl,  
python, ruby, html, or javascript to behave as we intended it.   
Wouldn't it be nice in those occasional dark and testing times to have  
a warm and comforting physical reminder that we are part of a global  
community of similar folks, focused on and passionate about technology  
 libraries.


So...

How about a Code4lib mug or similar beverage container.  Subject to  
the featuring of a simple Talis logo, we would be prepared to explore  
the possibility of funding the production of a Code4lib 2009 mug for  
every attendee, which could feature graphics such as the winning t- 
shirt design a Code4lib message, or logo if we have one in time.


Just a suggestion - what do folks think?

Richard Wallis, Talis.

Re: [CODE4LIB] Video encoding done - Mashup idea request

2007-03-16 Thread Richard Wallis

In principle that would be a better option.  In practice you need to be
really really sure that you can simply mix the two formats in to a final
output.

Mixing PC-video and camera-video successfully together has a reputation
for being difficult.  All to do with frame rates and interlacing modes
of which I have zero understanding.  The other issue being that modern
video projectors are very good at coping with having a multitude of PCs
 Macs plugged in to them,  working out the resolution etc. and
displaying a quality image.  I am sceptical of finding some video
editing software that is so forgiving.

If a mix of video and screen capture could be achieved it would be
great, but certainty of a successful result AND a fallback if it doesn't
work should be part of the plan.

RJW

 -Original Message-
 From: Code for Libraries [mailto:[EMAIL PROTECTED] On
 Behalf Of Smith,Devon
 Sent: 16 March 2007 20:39
 To: CODE4LIB@listserv.nd.edu
 Subject: Re: [CODE4LIB] Video encoding done - Mashup idea request

  2. Two camera's so the person at the podium can be composed inside
 their slides / demos.

 If I understand this, you're thinking that one camera would
 capture the speaker, and the other would capture the slide
 presentation.

 If it's possible, wouldn't it be better to put the slides
 directly into the video, rather than showing video of the
 projected slides?
 Screen capture could give the same option for demos, right?

 Or is that really tough?
 What I don't know about video capture/edit could just about
 fill the grand canyon.
 Given that, thanks to all the people who've pulled the video together.
 Community Effort++

 /dev



 -Original Message-
 From: Code for Libraries [mailto:[EMAIL PROTECTED] On
 Behalf Of Noel Peden
 Sent: Friday, March 16, 2007 12:02 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Video encoding done - Mashup idea request

 You are all certainly welcome.  I'm glad to be able to do the video.
 Thanks are due to Dan Scott and Karen Schneider for bringing
 their cameras and providing tapes.  Karen shot the first 1.5
 days of video too.  Ryan Eby has done pretty much all the
 work getting the files on Google and posting / embedding on
 code4lib.org.

 I expect to be there next year, and perhaps we can raise the
 bar a bit more (but not too much.)  Here are some ideas
 (suggestions are welcome):
 1. Wired microphone for consistent sound 2. Two camera's so
 the person at the podium can be composed inside their slides / demos.

 We'll wait for the year after to try out 'bullet time' Matrix
 style shots.  :)  Any other suggestions?

 Regards,
 Noel


The very latest from Talis
read the latest news at www.talis.com/news
listen to our podcasts www.talis.com/podcasts
see us at these events www.talis.com/events
join the discussion here www.talis.com/forums
join our developer community www.talis.com/tdn
and read our blogs www.talis.com/blogs


Any views or personal opinions expressed within this email may not be those of 
Talis Information Ltd. The content of this email message and any files that may 
be attached are confidential, and for the usage of the intended recipient only. 
If you are not the intended recipient, then please return this message to the 
sender and delete it. Any use of this e-mail by an unauthorised recipient is 
prohibited.


Talis Information Ltd is a member of the Talis Group of companies and is 
registered in England No 3638278 with its registered office at Knights Court, 
Solihull Parkway, Birmingham Business Park, B37 7YB.

43 matches

Mail list logo