[CODE4LIB] Code4Lib 2014 pre-conferences: please indicate interest on wiki

2013-12-09 Thread Emily Lynema
Hi folks,

We have [possibly?] a record number of pre-conference proposals up on the
wiki -- 19 by my count. Yay for the awesomeness of you all!

We haven't crunched the numbers on morning vs. afternoon scheduling and the
number of rooms available quite yet, but it would be really, super duper
helpful if folks who think they might want to attend a pre-conference could
take a quick look in the next couple of days and indicate interest. This is
by no means a binding signup (that will happen at time of registration),
but it will help us make sure that all pre-conferences with interest find a
room somewhere.

Sign up here!
http://wiki.code4lib.org/index.php/2014_preconference_proposals

thanks!
-emily

-- 
Emily Lynema
Associate Department Head
Information Technology, NCSU Libraries
919-513-8031
emily_lyn...@ncsu.edu


[CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14

2013-12-09 Thread Bohyun Kim
*** Apologies for the cross-posting***

For the Code4Lib 2014 Conference, 9 scholarships have been sponsored
to promote diversity. http://code4lib.org/conference/2013/scholarship

CLIR/DLF has sponsored 5 scholarships, EBSCO has sponsored 2
scholarships, ProQuest has sponsored 1 full scholarship, and Sumana
Harihareswara has sponsored half a scholarship which was matched by
ProQuest. All sponsors have left it up to the discretion of the
Code4Lib 2014 Scholarship Committee for how to award these diversity
scholarships.

The Code4Lib Scholarship Committee will award 9 diversity scholarships
based on merit and need. Each scholarship will provide up to $1,000 to
cover travel costs and conference fees for a qualified attendee to
attend the 2014 Code4Lib Conference, which will be held in Raleigh,
North Carolina, from March 24 - 27, 2014.

CONFERENCE INFO

For more information on the Code4Lib Conference, please see the
conference website:
http://code4lib.org/conference/2014

You can see write-ups of previous Code4Lib Conferences:
http://journal.code4lib.org/articles/6848
http://journal.code4lib.org/articles/2717
http://journal.code4lib.org/articles/998
http://journal.code4lib.org/articles/72

CODE4LIB 2014 DIVERSITY SCHOLARSHIPS ELIGIBILITY, CRITERIA, AND REQUIREMENTS

To qualify for a scholarship, an applicant must be interested in
actively contributing to the mission and goals of the Code4Lib
Conference.

- Four scholarships will be awarded to any woman or transgendered person.
- Four scholarships will be awarded to any person of Hispanic or
Latino, Black or African-American, Asian, Native Hawaiian or Pacific
Islander, or American Indian or Alaskan Native descent.
- One scholarship will be awarded to the best remaining candidate who
meets any of the previously mentioned eligibility requirements.

Eligible applicants may apply based on multiple criteria, but no
applicant will receive more than one scholarship. Past winners of any
Code4Lib scholarship are not eligible for a scholarship.

The scholarship recipients will be selected based upon their merit and
financial needs.

Scholarship recipients are required to write and submit a brief trip
report to the Code4Lib 2014 Scholarships Committee by April 1, 2014 to
be posted to the Code4Lib wiki. The report should address: (a) what
kind of experience they had at the conference, (b) what they have
learned, (c) what suggestions they have for future attendees and
conference organizers.

All reimbursement forms and receipts must be received by May 26, 2014.

HOW TO APPLY

To apply, please send an email to Jason Ronallo (jrona...@gmail.com)
with the subject heading “Code4Lib 2014 Diversity Scholarship
Application” containing the following (combined into a single attached
PDF, if possible):

1. A brief letter of interest, which:
- Identifies your eligibility for a diversity scholarship
- Describes your interest in the conference and how you intend to
participate
- Discusses your merit and needs for the scholarship
2. A résumé or CV
3. Contact information for two professional or academic references

The application deadline is Dec. 13, 2013, 5pm EST. The scholarship
committee will notify successful candidates the week of Jan. 6, 2013.

SPONSORS

We would like to thank our sponsors for supporting the Code4Lib 2014
Diversity Scholarships.

Council on Library and Information Resources http://www.clir.org/
Digital Library Federation http://www.diglib.org/
EBSCO http://www.ebsco.com/
ProQuest http://www.proquest.com
Sumana Harihareswara http://www.harihareswara.net/


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is available for
now, and you can only select top-level elements (all child elements of the
top-level selections will be auto-generated). I will try to expand it to
more than just MODS. Admittedly, I know very little about METS, so I will
need some assistance if I am going to make one of those.

I'll eventually host this somewhere else once it's done, so don't bookmark
it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

When I first read this, I was imagining not having to give it your metadata
but native support for most of our commonly used metadata records... so the
interface is: Give me 100 MODS records and it spits that out... You could
get fancy and say, Give me X number of METS records that wrap TIFFs and
JPGs and that uses MODS, etc.  That's not as trivial as hooking into an
lorem ipsum machine, but it'd be pretty cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I figured I'd
 better take this question here: my brain is trying to tell me that
 there's a service or app that makes fake metadata, kind of like
 Lorem Ipsum but you feed it your fields and it gives you nonsense
 metadata back. But, it looks right enough for testing. Yesterday, I
 had to make up about 50 rows of fake metadata to test some code that
 handles paging in a UI, and I had to make it all up by hand. This
 hurts my soul. Someone please tell me such a service exists, and link
 me to it, so I never have to do this again. Or else, I may just make
 such a service, to save us all. But I don't want to go coding some new
 service if it already exists, because that sort of thing is for chumps.


 --
 HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
 Library Systems http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Making things that are beautiful is real fun. --Lou Reed



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Richard Sarvas
I've had good luck using both the Data::Faker and Text::Lorem Perl modules to 
generate large amounts (30k+ rows) of Archivists Toolkit test data. Other ports 
of Data::Faker would probably work just as well, though it needs a bit more 
code to more than generate more than name, address and contact info. At the 
time I was mostly just generating person data for the AT names table but I had 
considered one day extending the code so that more detailed person data could 
be created but I never got around to it. It never occurred to me that there 
might actually be a need for something along the lines of a scholarly NPC 
generator. 


Rick

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Sean 
Hannan
Sent: Sunday, December 08, 2013 7:00 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

In ruby, there's the ffaker gem (https://github.com/EmmanuelOga/ffaker), which 
itself is a port of Perl's Data::Faker. 

-Sean

From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] on behalf of Pottinger, 
Hardy J. [pottinge...@missouri.edu]
Sent: Saturday, December 07, 2013 11:51 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Hi, I asked this on Google Plus earlier today, but I figured I'd better take 
this question here: my brain is trying to tell me that there's a service or app 
that makes fake metadata, kind of like Lorem Ipsum but you feed it your 
fields and it gives you nonsense metadata back. But, it looks right enough for 
testing. Yesterday, I had to make up about 50 rows of fake metadata to test 
some code that handles paging in a UI, and I had to make it all up by hand. 
This hurts my soul. Someone please tell me such a service exists, and link me 
to it, so I never have to do this again. Or else, I may just make such a 
service, to save us all. But I don't want to go coding some new service if it 
already exists, because that sort of thing is for chumps.


--
HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library 
Systems http://lso.umsystem.edu/~pottingerhj/
https://MOspace.umsystem.edu/
Making things that are beautiful is real fun. --Lou Reed


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
Nice!  Are you going to put the code on GitHub (or some such place)?  I'd
be interested in tracking...

Kevin


On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote:

 Challenge accepted.

 http://library.ucmo.edu/dev/metadata-generator.php

 Obviously in the prototype phase, but it works. Only MODS is available for
 now, and you can only select top-level elements (all child elements of the
 top-level selections will be auto-generated). I will try to expand it to
 more than just MODS. Admittedly, I know very little about METS, so I will
 need some assistance if I am going to make one of those.

 I'll eventually host this somewhere else once it's done, so don't bookmark
 it.

 Josh Welker
 Information Technology Librarian
 James C. Kirkpatrick Library
 University of Central Missouri
 Warrensburg, MO 64093
 JCKL 2260
 660.543.8022

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Kevin S. Clarke
 Sent: Sunday, December 08, 2013 12:26 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 When I first read this, I was imagining not having to give it your metadata
 but native support for most of our commonly used metadata records... so the
 interface is: Give me 100 MODS records and it spits that out... You could
 get fancy and say, Give me X number of METS records that wrap TIFFs and
 JPGs and that uses MODS, etc.  That's not as trivial as hooking into an
 lorem ipsum machine, but it'd be pretty cool, imho.

 Kevin


 On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
 pottinge...@missouri.edu wrote:

  Hi, I asked this on Google Plus earlier today, but I figured I'd
  better take this question here: my brain is trying to tell me that
  there's a service or app that makes fake metadata, kind of like
  Lorem Ipsum but you feed it your fields and it gives you nonsense
  metadata back. But, it looks right enough for testing. Yesterday, I
  had to make up about 50 rows of fake metadata to test some code that
  handles paging in a UI, and I had to make it all up by hand. This
  hurts my soul. Someone please tell me such a service exists, and link
  me to it, so I never have to do this again. Or else, I may just make
  such a service, to save us all. But I don't want to go coding some new
  service if it already exists, because that sort of thing is for chumps.
 
 
  --
  HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
  Library Systems http://lso.umsystem.edu/~pottingerhj/
  https://MOspace.umsystem.edu/
  Making things that are beautiful is real fun. --Lou Reed
 



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
Sure. It's not a fancy reusable class or anything though--just a simple PHP
script. I will try to put it up sometime today or tomorrow and will share a
link.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Kevin S. Clarke
Sent: Monday, December 09, 2013 10:34 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Nice!  Are you going to put the code on GitHub (or some such place)?  I'd be
interested in tracking...

Kevin


On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote:

 Challenge accepted.

 http://library.ucmo.edu/dev/metadata-generator.php

 Obviously in the prototype phase, but it works. Only MODS is available
 for now, and you can only select top-level elements (all child
 elements of the top-level selections will be auto-generated). I will
 try to expand it to more than just MODS. Admittedly, I know very
 little about METS, so I will need some assistance if I am going to make
 one of those.

 I'll eventually host this somewhere else once it's done, so don't
 bookmark it.

 Josh Welker
 Information Technology Librarian
 James C. Kirkpatrick Library
 University of Central Missouri
 Warrensburg, MO 64093
 JCKL 2260
 660.543.8022

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of Kevin S. Clarke
 Sent: Sunday, December 08, 2013 12:26 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 When I first read this, I was imagining not having to give it your
 metadata but native support for most of our commonly used metadata
 records... so the interface is: Give me 100 MODS records and it
 spits that out... You could get fancy and say, Give me X number of
 METS records that wrap TIFFs and JPGs and that uses MODS, etc.
 That's not as trivial as hooking into an lorem ipsum machine, but it'd be
 pretty cool, imho.

 Kevin


 On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
 pottinge...@missouri.edu wrote:

  Hi, I asked this on Google Plus earlier today, but I figured I'd
  better take this question here: my brain is trying to tell me that
  there's a service or app that makes fake metadata, kind of like
  Lorem Ipsum but you feed it your fields and it gives you nonsense
  metadata back. But, it looks right enough for testing. Yesterday, I
  had to make up about 50 rows of fake metadata to test some code that
  handles paging in a UI, and I had to make it all up by hand. This
  hurts my soul. Someone please tell me such a service exists, and
  link me to it, so I never have to do this again. Or else, I may just
  make such a service, to save us all. But I don't want to go coding
  some new service if it already exists, because that sort of thing is for
  chumps.
 
 
  --
  HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
  Library Systems http://lso.umsystem.edu/~pottingerhj/
  https://MOspace.umsystem.edu/
  Making things that are beautiful is real fun. --Lou Reed
 



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
Cool!

Thanks, Kevin


On Mon, Dec 9, 2013 at 11:37 AM, Joshua Welker wel...@ucmo.edu wrote:

 Sure. It's not a fancy reusable class or anything though--just a simple PHP
 script. I will try to put it up sometime today or tomorrow and will share a
 link.

 Josh Welker


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Kevin S. Clarke
 Sent: Monday, December 09, 2013 10:34 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 Nice!  Are you going to put the code on GitHub (or some such place)?  I'd
 be
 interested in tracking...

 Kevin


 On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote:

  Challenge accepted.
 
  http://library.ucmo.edu/dev/metadata-generator.php
 
  Obviously in the prototype phase, but it works. Only MODS is available
  for now, and you can only select top-level elements (all child
  elements of the top-level selections will be auto-generated). I will
  try to expand it to more than just MODS. Admittedly, I know very
  little about METS, so I will need some assistance if I am going to make
  one of those.
 
  I'll eventually host this somewhere else once it's done, so don't
  bookmark it.
 
  Josh Welker
  Information Technology Librarian
  James C. Kirkpatrick Library
  University of Central Missouri
  Warrensburg, MO 64093
  JCKL 2260
  660.543.8022
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
  Of Kevin S. Clarke
  Sent: Sunday, December 08, 2013 12:26 PM
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
  When I first read this, I was imagining not having to give it your
  metadata but native support for most of our commonly used metadata
  records... so the interface is: Give me 100 MODS records and it
  spits that out... You could get fancy and say, Give me X number of
  METS records that wrap TIFFs and JPGs and that uses MODS, etc.
  That's not as trivial as hooking into an lorem ipsum machine, but it'd be
  pretty cool, imho.
 
  Kevin
 
 
  On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
  pottinge...@missouri.edu wrote:
 
   Hi, I asked this on Google Plus earlier today, but I figured I'd
   better take this question here: my brain is trying to tell me that
   there's a service or app that makes fake metadata, kind of like
   Lorem Ipsum but you feed it your fields and it gives you nonsense
   metadata back. But, it looks right enough for testing. Yesterday, I
   had to make up about 50 rows of fake metadata to test some code that
   handles paging in a UI, and I had to make it all up by hand. This
   hurts my soul. Someone please tell me such a service exists, and
   link me to it, so I never have to do this again. Or else, I may just
   make such a service, to save us all. But I don't want to go coding
   some new service if it already exists, because that sort of thing is
 for
   chumps.
  
  
   --
   HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
   Library Systems http://lso.umsystem.edu/~pottingerhj/
   https://MOspace.umsystem.edu/
   Making things that are beautiful is real fun. --Lou Reed
  
 



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Ben Companjen
Cool!
My first thought on this topic was: give the program an XML schema, and
generate possible documents with the correct datatypes etc. (Something
like that must exist somewhere, right?) Does it happen to work anything
like that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is available for
now, and you can only select top-level elements (all child elements of the
top-level selections will be auto-generated). I will try to expand it to
more than just MODS. Admittedly, I know very little about METS, so I will
need some assistance if I am going to make one of those.

I'll eventually host this somewhere else once it's done, so don't bookmark
it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

When I first read this, I was imagining not having to give it your
metadata
but native support for most of our commonly used metadata records... so
the
interface is: Give me 100 MODS records and it spits that out... You
could
get fancy and say, Give me X number of METS records that wrap TIFFs and
JPGs and that uses MODS, etc.  That's not as trivial as hooking into an
lorem ipsum machine, but it'd be pretty cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I figured I'd
 better take this question here: my brain is trying to tell me that
 there's a service or app that makes fake metadata, kind of like
 Lorem Ipsum but you feed it your fields and it gives you nonsense
 metadata back. But, it looks right enough for testing. Yesterday, I
 had to make up about 50 rows of fake metadata to test some code that
 handles paging in a UI, and I had to make it all up by hand. This
 hurts my soul. Someone please tell me such a service exists, and link
 me to it, so I never have to do this again. Or else, I may just make
 such a service, to save us all. But I don't want to go coding some new
 service if it already exists, because that sort of thing is for chumps.


 --
 HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
 Library Systems http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Making things that are beautiful is real fun. --Lou Reed



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
It's hard-coded to generate the specific elements. But your way sounds a lot
cleaner, so I might try to do that instead :) It will be more difficult
initially but much easier once I start implementing other metadata formats.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben
Companjen
Sent: Monday, December 09, 2013 10:52 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Cool!
My first thought on this topic was: give the program an XML schema, and
generate possible documents with the correct datatypes etc. (Something like
that must exist somewhere, right?) Does it happen to work anything like
that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is available
for now, and you can only select top-level elements (all child elements
of the top-level selections will be auto-generated). I will try to
expand it to more than just MODS. Admittedly, I know very little about
METS, so I will need some assistance if I am going to make one of those.

I'll eventually host this somewhere else once it's done, so don't
bookmark it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

When I first read this, I was imagining not having to give it your
metadata but native support for most of our commonly used metadata
records... so the interface is: Give me 100 MODS records and it spits
that out... You could get fancy and say, Give me X number of METS
records that wrap TIFFs and JPGs and that uses MODS, etc.  That's not
as trivial as hooking into an lorem ipsum machine, but it'd be pretty
cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I figured I'd
 better take this question here: my brain is trying to tell me that
 there's a service or app that makes fake metadata, kind of like
 Lorem Ipsum but you feed it your fields and it gives you nonsense
 metadata back. But, it looks right enough for testing. Yesterday, I
 had to make up about 50 rows of fake metadata to test some code that
 handles paging in a UI, and I had to make it all up by hand. This
 hurts my soul. Someone please tell me such a service exists, and link
 me to it, so I never have to do this again. Or else, I may just make
 such a service, to save us all. But I don't want to go coding some
 new service if it already exists, because that sort of thing is for
 chumps.


 --
 HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
 Library Systems http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Making things that are beautiful is real fun. --Lou Reed



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Ben Companjen
Hi Josh,

Before you start coding:
http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an
DTD or XSD file. First try with the EAC XSD shows I need to try other
options, but it's promising.

(It's still an interesting problem to try to tackle yourself, of course.)

Ben

On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:

It's hard-coded to generate the specific elements. But your way sounds a
lot
cleaner, so I might try to do that instead :) It will be more difficult
initially but much easier once I start implementing other metadata
formats.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Ben
Companjen
Sent: Monday, December 09, 2013 10:52 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Cool!
My first thought on this topic was: give the program an XML schema, and
generate possible documents with the correct datatypes etc. (Something
like
that must exist somewhere, right?) Does it happen to work anything like
that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is available
for now, and you can only select top-level elements (all child elements
of the top-level selections will be auto-generated). I will try to
expand it to more than just MODS. Admittedly, I know very little about
METS, so I will need some assistance if I am going to make one of those.

I'll eventually host this somewhere else once it's done, so don't
bookmark it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

When I first read this, I was imagining not having to give it your
metadata but native support for most of our commonly used metadata
records... so the interface is: Give me 100 MODS records and it spits
that out... You could get fancy and say, Give me X number of METS
records that wrap TIFFs and JPGs and that uses MODS, etc.  That's not
as trivial as hooking into an lorem ipsum machine, but it'd be pretty
cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I figured I'd
 better take this question here: my brain is trying to tell me that
 there's a service or app that makes fake metadata, kind of like
 Lorem Ipsum but you feed it your fields and it gives you nonsense
 metadata back. But, it looks right enough for testing. Yesterday, I
 had to make up about 50 rows of fake metadata to test some code that
 handles paging in a UI, and I had to make it all up by hand. This
 hurts my soul. Someone please tell me such a service exists, and link
 me to it, so I never have to do this again. Or else, I may just make
 such a service, to save us all. But I don't want to go coding some
 new service if it already exists, because that sort of thing is for
 chumps.


 --
 HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
 Library Systems http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Making things that are beautiful is real fun. --Lou Reed



Re: [CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14

2013-12-09 Thread Bohyun Kim
*** Correcting the typos ***

Please see this page: http://code4lib.org/node/491
(This was for the past 2013 conference. 
http://code4lib.org/conference/2013/scholarship.  )

The application deadline is Dec. 13, 2013, 5pm EST. The scholarship
committee will notify successful candidates the week of Jan. 6, 2014.




From: Bohyun Kim
Sent: Monday, December 09, 2013 10:20 AM
To: Code for Libraries; lit...@ala.org
Subject: Reminder - Code4Lib 2014 Conference Diversity Scholarship Application 
Due Dec 14

*** Apologies for the cross-posting***

For the Code4Lib 2014 Conference, 9 scholarships have been sponsored
to promote diversity. http://code4lib.org/conference/2013/scholarship

CLIR/DLF has sponsored 5 scholarships, EBSCO has sponsored 2
scholarships, ProQuest has sponsored 1 full scholarship, and Sumana
Harihareswara has sponsored half a scholarship which was matched by
ProQuest. All sponsors have left it up to the discretion of the
Code4Lib 2014 Scholarship Committee for how to award these diversity
scholarships.

The Code4Lib Scholarship Committee will award 9 diversity scholarships
based on merit and need. Each scholarship will provide up to $1,000 to
cover travel costs and conference fees for a qualified attendee to
attend the 2014 Code4Lib Conference, which will be held in Raleigh,
North Carolina, from March 24 - 27, 2014.

CONFERENCE INFO

For more information on the Code4Lib Conference, please see the
conference website:
http://code4lib.org/conference/2014

You can see write-ups of previous Code4Lib Conferences:
http://journal.code4lib.org/articles/6848
http://journal.code4lib.org/articles/2717
http://journal.code4lib.org/articles/998
http://journal.code4lib.org/articles/72

CODE4LIB 2014 DIVERSITY SCHOLARSHIPS ELIGIBILITY, CRITERIA, AND REQUIREMENTS

To qualify for a scholarship, an applicant must be interested in
actively contributing to the mission and goals of the Code4Lib
Conference.

- Four scholarships will be awarded to any woman or transgendered person.
- Four scholarships will be awarded to any person of Hispanic or
Latino, Black or African-American, Asian, Native Hawaiian or Pacific
Islander, or American Indian or Alaskan Native descent.
- One scholarship will be awarded to the best remaining candidate who
meets any of the previously mentioned eligibility requirements.

Eligible applicants may apply based on multiple criteria, but no
applicant will receive more than one scholarship. Past winners of any
Code4Lib scholarship are not eligible for a scholarship.

The scholarship recipients will be selected based upon their merit and
financial needs.

Scholarship recipients are required to write and submit a brief trip
report to the Code4Lib 2014 Scholarships Committee by April 1, 2014 to
be posted to the Code4Lib wiki. The report should address: (a) what
kind of experience they had at the conference, (b) what they have
learned, (c) what suggestions they have for future attendees and
conference organizers.

All reimbursement forms and receipts must be received by May 26, 2014.

HOW TO APPLY

To apply, please send an email to Jason Ronallo (jrona...@gmail.com)
with the subject heading “Code4Lib 2014 Diversity Scholarship
Application” containing the following (combined into a single attached
PDF, if possible):

1. A brief letter of interest, which:
- Identifies your eligibility for a diversity scholarship
- Describes your interest in the conference and how you intend to
participate
- Discusses your merit and needs for the scholarship
2. A résumé or CV
3. Contact information for two professional or academic references

The application deadline is Dec. 13, 2013, 5pm EST. The scholarship
committee will notify successful candidates the week of Jan. 6, 2013.

SPONSORS

We would like to thank our sponsors for supporting the Code4Lib 2014
Diversity Scholarships.

Council on Library and Information Resources http://www.clir.org/
Digital Library Federation http://www.diglib.org/
EBSCO http://www.ebsco.com/
ProQuest http://www.proquest.com
Sumana Harihareswara http://www.harihareswara.net/


[CODE4LIB] 2 interesting work opportunities in Chattanooga TN

2013-12-09 Thread Nate Hill
Hello friends,
Want to work for the Mozilla Foundation and collaborate with our awesome
public library here in Chattanooga?
Check out these two opportunities.
Documentation  Design Coordinator, Mozilla Gigabit Community
Fundhttps://careers.mozilla.org/en-US/position/obX7XfwW
and
Community Catalyst,
Chattanoogahttps://careers.mozilla.org/en-US/position/o6X7XfwR

Cheers
N


Nate Hill
nathanielh...@gmail.com
http://4thfloor.chattlibrary.org/
http://www.natehill.net


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Joshua Welker
I checked out the Eclipse option and was not able to get much use out of it.
Maybe someone else will have better luck? It doesn't seem to align very well
with a library use case.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben
Companjen
Sent: Monday, December 09, 2013 11:14 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Hi Josh,

Before you start coding:
http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD
or XSD file. First try with the EAC XSD shows I need to try other options,
but it's promising.

(It's still an interesting problem to try to tackle yourself, of course.)

Ben

On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:

It's hard-coded to generate the specific elements. But your way sounds
a lot cleaner, so I might try to do that instead :) It will be more
difficult initially but much easier once I start implementing other
metadata formats.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Ben Companjen
Sent: Monday, December 09, 2013 10:52 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Cool!
My first thought on this topic was: give the program an XML schema, and
generate possible documents with the correct datatypes etc. (Something
like that must exist somewhere, right?) Does it happen to work anything
like that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is available
for now, and you can only select top-level elements (all child
elements of the top-level selections will be auto-generated). I will
try to expand it to more than just MODS. Admittedly, I know very
little about METS, so I will need some assistance if I am going to make
one of those.

I'll eventually host this somewhere else once it's done, so don't
bookmark it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
Of Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

When I first read this, I was imagining not having to give it your
metadata but native support for most of our commonly used metadata
records... so the interface is: Give me 100 MODS records and it
spits that out... You could get fancy and say, Give me X number of
METS records that wrap TIFFs and JPGs and that uses MODS, etc.
That's not as trivial as hooking into an lorem ipsum machine, but it'd
be pretty cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I figured I'd
 better take this question here: my brain is trying to tell me that
 there's a service or app that makes fake metadata, kind of like
 Lorem Ipsum but you feed it your fields and it gives you nonsense
 metadata back. But, it looks right enough for testing. Yesterday, I
 had to make up about 50 rows of fake metadata to test some code that
 handles paging in a UI, and I had to make it all up by hand. This
 hurts my soul. Someone please tell me such a service exists, and
 link me to it, so I never have to do this again. Or else, I may just
 make such a service, to save us all. But I don't want to go coding
 some new service if it already exists, because that sort of thing is
 for chumps.


 --
 HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
 Library Systems http://lso.umsystem.edu/~pottingerhj/
 https://MOspace.umsystem.edu/
 Making things that are beautiful is real fun. --Lou Reed



[CODE4LIB] problem in old etd xml files

2013-12-09 Thread Robertson, Wendy C
Back in 1999-2002 a handful of our theses were submitted  as a collection of 
xml files.  We posted the files in our repository several years ago (we posted 
a zipped folder with all the files).  At that time, if you opened front.xml you 
would be able to access the thesis. We have not touched the files in the close 
to 5 years since we posted them, but the files no longer open correctly. One of 
the problem theses is http://ir.uiowa.edu/etd/189/.

Front.xml begins
?xml version=1.0 encoding=UTF-8?
?xml:stylesheet type=text/css href=UIowa2K1.css ?
!DOCTYPE thesis SYSTEM UIowa2K.dtd

I have tried the following changes but they do not help

1)  Adding standalone=no? to the xml declaration  -- ?xml version=1.0 
 encoding=UTF-8 standalone=no?

2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match the 
files (which are in all caps)

3)  Changing xml:stylesheet to xml-stylesheet

Chrome shows errors that entities are not defined, but they are defined in the 
dtd.

I would appreciate any assistance in making these documents available again. 
Thanks!

Wendy Robertson
Digital Scholarship Librarian *  The University of Iowa Libraries
1015 Main Library  *  Iowa City, Iowa 52242
wendy-robert...@uiowa.edu * 319-335-5821


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Ron Gilmour
DTDs and XML namespaces don't like each other very much. I think you're
getting into trouble because your DTD doesn't allow the two
namespace-declaring attributes on the thesis element. Try adding this to
your DTD:

!ATTLIST thesis xmlns:xhtml CDATA #FIXED 'http://www.w3.org/1999/xhtml'
 xmlns:html CDATA #FIXED 'http://www.w3.org/TR/REC-html40'

You're still going to be faced with a number of validity errors, but I
think most of them are self-explanatory (e.g., you have multiple
linebreak elements where your DTD only allows one). Some of these have to
do with validity against your DTD and others are related to HTML validity.

Ron Gilmour
Web Services Librarian
Ithaca College Library



On Mon, Dec 9, 2013 at 3:10 PM, Robertson, Wendy C 
wendy-robert...@uiowa.edu wrote:

 Back in 1999-2002 a handful of our theses were submitted  as a collection
 of xml files.  We posted the files in our repository several years ago (we
 posted a zipped folder with all the files).  At that time, if you opened
 front.xml you would be able to access the thesis. We have not touched the
 files in the close to 5 years since we posted them, but the files no longer
 open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/.

 Front.xml begins
 ?xml version=1.0 encoding=UTF-8?
 ?xml:stylesheet type=text/css href=UIowa2K1.css ?
 !DOCTYPE thesis SYSTEM UIowa2K.dtd

 I have tried the following changes but they do not help

 1)  Adding standalone=no? to the xml declaration  -- ?xml
 version=1.0  encoding=UTF-8 standalone=no?

 2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match the
 files (which are in all caps)

 3)  Changing xml:stylesheet to xml-stylesheet

 Chrome shows errors that entities are not defined, but they are defined in
 the dtd.

 I would appreciate any assistance in making these documents available
 again. Thanks!

 Wendy Robertson
 Digital Scholarship Librarian *  The University of Iowa Libraries
 1015 Main Library  *  Iowa City, Iowa 52242
 wendy-robert...@uiowa.edu * 319-335-5821



Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jon Gorman
A lot of modern systems won't load entities (or will limit it somehow)
because of the denial of service attack that is possible.  Look for XML
Entity Reference Denial of Service. I can't remember if Public declarations
are treated any differently than System ones. (I would have suspected it to
trust SYSTEM ones more, but they'd still be exploitable by the same bug).


(There's also a fair number of other errors, I'm somewhat skeptical that
the example worked on many browsers even then. It's possible IE was
flexible enough it would have worked).

One thing you might want to do is is take out the entities.

I can't remember why I had to do this, but xmllint seemed to do the trick.
( I found a snippet at
http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c,
but it' smissing the necessary --loaddtd)

xmllint --loaddtd --noent --dropdtd FRONT.xml  FRONT_nodtdent.xml

I mean, you don't need the dtd for validation, particularly since I suspect
given the errors it may not validate anyhow.

It might make the files a little harder to read when reading the raw
source, but I suspect that's not typically a problem.

Jon Gorman
University of Illinois



On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C 
wendy-robert...@uiowa.edu wrote:

 Back in 1999-2002 a handful of our theses were submitted  as a collection
 of xml files.  We posted the files in our repository several years ago (we
 posted a zipped folder with all the files).  At that time, if you opened
 front.xml you would be able to access the thesis. We have not touched the
 files in the close to 5 years since we posted them, but the files no longer
 open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/.

 Front.xml begins
 ?xml version=1.0 encoding=UTF-8?
 ?xml:stylesheet type=text/css href=UIowa2K1.css ?
 !DOCTYPE thesis SYSTEM UIowa2K.dtd

 I have tried the following changes but they do not help

 1)  Adding standalone=no? to the xml declaration  -- ?xml
 version=1.0  encoding=UTF-8 standalone=no?

 2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match the
 files (which are in all caps)

 3)  Changing xml:stylesheet to xml-stylesheet

 Chrome shows errors that entities are not defined, but they are defined in
 the dtd.

 I would appreciate any assistance in making these documents available
 again. Thanks!

 Wendy Robertson
 Digital Scholarship Librarian *  The University of Iowa Libraries
 1015 Main Library  *  Iowa City, Iowa 52242
 wendy-robert...@uiowa.edu * 319-335-5821



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Peter Binkley
Years ago Bill Moen had a set of radioactive MARC records with unique
tokens in all fields, to test Z39.50 retrieval. I don't know whether they
were ever released anywhere, but I see the specs are here:

http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/

Peter


Peter Binkley
Digital Initiatives Technology Librarian
Information Technology Services
peter.bink...@ualberta.ca

2-10K Cameron Library
University of Alberta
Edmonton, Alberta
Canada T6G 2J8

phone 780-492-3743
fax 780-492-9243


On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote:

 I checked out the Eclipse option and was not able to get much use out of
 it.
 Maybe someone else will have better luck? It doesn't seem to align very
 well
 with a library use case.

 Josh Welker


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Ben
 Companjen
 Sent: Monday, December 09, 2013 11:14 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 Hi Josh,

 Before you start coding:
 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
 ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an
 DTD
 or XSD file. First try with the EAC XSD shows I need to try other options,
 but it's promising.

 (It's still an interesting problem to try to tackle yourself, of course.)

 Ben

 On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:

 It's hard-coded to generate the specific elements. But your way sounds
 a lot cleaner, so I might try to do that instead :) It will be more
 difficult initially but much easier once I start implementing other
 metadata formats.
 
 Josh Welker
 
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Ben Companjen
 Sent: Monday, December 09, 2013 10:52 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
 Cool!
 My first thought on this topic was: give the program an XML schema, and
 generate possible documents with the correct datatypes etc. (Something
 like that must exist somewhere, right?) Does it happen to work anything
 like that, or is it hardcoded to generate these specific elements?
 
 Ben
 
 On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:
 
 Challenge accepted.
 
 http://library.ucmo.edu/dev/metadata-generator.php
 
 Obviously in the prototype phase, but it works. Only MODS is available
 for now, and you can only select top-level elements (all child
 elements of the top-level selections will be auto-generated). I will
 try to expand it to more than just MODS. Admittedly, I know very
 little about METS, so I will need some assistance if I am going to make
 one of those.
 
 I'll eventually host this somewhere else once it's done, so don't
 bookmark it.
 
 Josh Welker
 Information Technology Librarian
 James C. Kirkpatrick Library
 University of Central Missouri
 Warrensburg, MO 64093
 JCKL 2260
 660.543.8022
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of Kevin S. Clarke
 Sent: Sunday, December 08, 2013 12:26 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
 When I first read this, I was imagining not having to give it your
 metadata but native support for most of our commonly used metadata
 records... so the interface is: Give me 100 MODS records and it
 spits that out... You could get fancy and say, Give me X number of
 METS records that wrap TIFFs and JPGs and that uses MODS, etc.
 That's not as trivial as hooking into an lorem ipsum machine, but it'd
 be pretty cool, imho.
 
 Kevin
 
 
 On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
 pottinge...@missouri.edu wrote:
 
  Hi, I asked this on Google Plus earlier today, but I figured I'd
  better take this question here: my brain is trying to tell me that
  there's a service or app that makes fake metadata, kind of like
  Lorem Ipsum but you feed it your fields and it gives you nonsense
  metadata back. But, it looks right enough for testing. Yesterday, I
  had to make up about 50 rows of fake metadata to test some code that
  handles paging in a UI, and I had to make it all up by hand. This
  hurts my soul. Someone please tell me such a service exists, and
  link me to it, so I never have to do this again. Or else, I may just
  make such a service, to save us all. But I don't want to go coding
  some new service if it already exists, because that sort of thing is
  for chumps.
 
 
  --
  HARDY POTTINGER pottinge...@umsystem.edu University of Missouri
  Library Systems http://lso.umsystem.edu/~pottingerhj/
  https://MOspace.umsystem.edu/
  Making things that are beautiful is real fun. --Lou Reed
 




Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Roy Tennant
I can't help wondering what the half-life of a radioactive MARC record is.
My guess is it is either really, really short or really, really long. ;-)
Roy


On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.cawrote:

 Years ago Bill Moen had a set of radioactive MARC records with unique
 tokens in all fields, to test Z39.50 retrieval. I don't know whether they
 were ever released anywhere, but I see the specs are here:

 http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/

 Peter


 Peter Binkley
 Digital Initiatives Technology Librarian
 Information Technology Services
 peter.bink...@ualberta.ca

 2-10K Cameron Library
 University of Alberta
 Edmonton, Alberta
 Canada T6G 2J8

 phone 780-492-3743
 fax 780-492-9243


 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote:

  I checked out the Eclipse option and was not able to get much use out of
  it.
  Maybe someone else will have better luck? It doesn't seem to align very
  well
  with a library use case.
 
  Josh Welker
 
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
  Ben
  Companjen
  Sent: Monday, December 09, 2013 11:14 AM
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
  Hi Josh,
 
  Before you start coding:
 
 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
  ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an
  DTD
  or XSD file. First try with the EAC XSD shows I need to try other
 options,
  but it's promising.
 
  (It's still an interesting problem to try to tackle yourself, of course.)
 
  Ben
 
  On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:
 
  It's hard-coded to generate the specific elements. But your way sounds
  a lot cleaner, so I might try to do that instead :) It will be more
  difficult initially but much easier once I start implementing other
  metadata formats.
  
  Josh Welker
  
  
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
  Ben Companjen
  Sent: Monday, December 09, 2013 10:52 AM
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
  
  Cool!
  My first thought on this topic was: give the program an XML schema, and
  generate possible documents with the correct datatypes etc. (Something
  like that must exist somewhere, right?) Does it happen to work anything
  like that, or is it hardcoded to generate these specific elements?
  
  Ben
  
  On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:
  
  Challenge accepted.
  
  http://library.ucmo.edu/dev/metadata-generator.php
  
  Obviously in the prototype phase, but it works. Only MODS is available
  for now, and you can only select top-level elements (all child
  elements of the top-level selections will be auto-generated). I will
  try to expand it to more than just MODS. Admittedly, I know very
  little about METS, so I will need some assistance if I am going to make
  one of those.
  
  I'll eventually host this somewhere else once it's done, so don't
  bookmark it.
  
  Josh Welker
  Information Technology Librarian
  James C. Kirkpatrick Library
  University of Central Missouri
  Warrensburg, MO 64093
  JCKL 2260
  660.543.8022
  
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
  Of Kevin S. Clarke
  Sent: Sunday, December 08, 2013 12:26 PM
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
  
  When I first read this, I was imagining not having to give it your
  metadata but native support for most of our commonly used metadata
  records... so the interface is: Give me 100 MODS records and it
  spits that out... You could get fancy and say, Give me X number of
  METS records that wrap TIFFs and JPGs and that uses MODS, etc.
  That's not as trivial as hooking into an lorem ipsum machine, but it'd
  be pretty cool, imho.
  
  Kevin
  
  
  On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
  pottinge...@missouri.edu wrote:
  
   Hi, I asked this on Google Plus earlier today, but I figured I'd
   better take this question here: my brain is trying to tell me that
   there's a service or app that makes fake metadata, kind of like
   Lorem Ipsum but you feed it your fields and it gives you nonsense
   metadata back. But, it looks right enough for testing. Yesterday, I
   had to make up about 50 rows of fake metadata to test some code that
   handles paging in a UI, and I had to make it all up by hand. This
   hurts my soul. Someone please tell me such a service exists, and
   link me to it, so I never have to do this again. Or else, I may just
   make such a service, to save us all. But I don't want to go coding
   some new service if it already exists, because that sort of thing is
   for chumps.
  
  
   --
   HARDY POTTINGER pottinge...@umsystem.edu 

[CODE4LIB] Job: Software Developer at University of Maryland, College Park

2013-12-09 Thread jobs
Software Developer
University of Maryland, College Park
College Park

As the largest university library system in the Washington D.C.-Baltimore
area, the University of Maryland Libraries serve more than 37,500 students and
4,200 faculty of the flagship College Park campus. The University of Maryland
Libraries share the teaching, learning and research goals of the university.
Its role as a key academic resource is evident in its service to the academic
community and its actionable strategic plan. Recent membership in the
Committee on Institutional Cooperation, a robust organization of Big Ten
member institutions, is particularly meaningful to the University Libraries
and will further propel the university's ascendancy in academic excellence.

  
The Software Developer provides broad programming support to the University of
Maryland Libraries for the development and delivery of Java-based software
applications. The applications support development and management of web pages
and large-scale digital collections. They include tools for cataloging,
search, and discovery of digital collections, tools for acquisition of digital
collections, access to and retrieval of digital objects in the collections,
and tools for preservation and maintenance of digital collections over the
long term.

  
In addition to developing new functionality, the Software Developer will
participate in the full life cycle of the Libraries' applications, including
initial configuration and setup, design, testing, updates, and ongoing
support, maintenance, and troubleshooting. Development will also include
interfacing with and expanding functionality of open source, commercial and
production software.

  
The incumbent serves on a growing development team which works collaboratively
using agile methodologies and tools. The incumbent will use and promote tools
such as Eclipse, Maven, JUnit, Jira, and Git with a growing body of
applications and may participate in retrofitting existing applications.

  
Required Education: Bachelor's Degree

  
Required Experience:

  * Three or more years of programming experience
  * Experience creating web applications using a Web Application Framework
  * Experience using database connectivity tools such as JDBC or ORM
  * Experience using version control software such as Subversion or Git
  * Excellent interpersonal skills
  * Excellent written and verbal communication skills
  
Preferred Education: Bachelor's Degree in a field related to information
sciences, computer sciences and engineering, or information management

  
Preferred Experience:

  * Experience programming in Java
  * Experience with XML, Xpath, and XSLT
  * Experience using an integrated development environment such as Eclipse
  * Experience using Lucene or Solr search tools
  * Experience with automated testing tools such as Junit or Selenium
  * Experience with REST/SOAP web services and related tools
  * Experience with Spring or Grails
  * Experience in academic libraries and archives.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/11105/


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Robertson, Wendy C
Thanks.  I'll see if this helps. 

I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at 
them, but we had super employees (recent grads from library school) that worked 
with the files and I trust that they would have noticed problems.  

Fortunately we only have 7 of these to try to fix. 

Wendy

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon 
Gorman
Sent: Monday, December 09, 2013 3:17 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] problem in old etd xml files

A lot of modern systems won't load entities (or will limit it somehow) because 
of the denial of service attack that is possible.  Look for XML Entity 
Reference Denial of Service. I can't remember if Public declarations are 
treated any differently than System ones. (I would have suspected it to trust 
SYSTEM ones more, but they'd still be exploitable by the same bug).


(There's also a fair number of other errors, I'm somewhat skeptical that the 
example worked on many browsers even then. It's possible IE was flexible enough 
it would have worked).

One thing you might want to do is is take out the entities.

I can't remember why I had to do this, but xmllint seemed to do the trick.
( I found a snippet at
http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c,
but it' smissing the necessary --loaddtd)

xmllint --loaddtd --noent --dropdtd FRONT.xml  FRONT_nodtdent.xml

I mean, you don't need the dtd for validation, particularly since I suspect 
given the errors it may not validate anyhow.

It might make the files a little harder to read when reading the raw source, 
but I suspect that's not typically a problem.

Jon Gorman
University of Illinois



On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C  wendy-robert...@uiowa.edu 
wrote:

 Back in 1999-2002 a handful of our theses were submitted  as a 
 collection of xml files.  We posted the files in our repository 
 several years ago (we posted a zipped folder with all the files).  At 
 that time, if you opened front.xml you would be able to access the 
 thesis. We have not touched the files in the close to 5 years since we 
 posted them, but the files no longer open correctly. One of the problem 
 theses is http://ir.uiowa.edu/etd/189/.

 Front.xml begins
 ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet 
 type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM 
 UIowa2K.dtd

 I have tried the following changes but they do not help

 1)  Adding standalone=no? to the xml declaration  -- ?xml
 version=1.0  encoding=UTF-8 standalone=no?

 2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match the
 files (which are in all caps)

 3)  Changing xml:stylesheet to xml-stylesheet

 Chrome shows errors that entities are not defined, but they are 
 defined in the dtd.

 I would appreciate any assistance in making these documents available 
 again. Thanks!

 Wendy Robertson
 Digital Scholarship Librarian *  The University of Iowa Libraries
 1015 Main Library  *  Iowa City, Iowa 52242 wendy-robert...@uiowa.edu 
 * 319-335-5821



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Tom Cramer
 I can't help wondering what the half-life of a radioactive MARC record is.
 My guess is it is either really, really short or really, really long. ;-)

It's 42, but Z39.50 will accelerate the rate of decay.

- T


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Frumkin, Jeremy
Finally, Z39.50 actually speeds up something

-- jaf

Sent from my iPad

On Dec 9, 2013, at 4:59 PM, Tom Cramer tcra...@stanford.edu wrote:

 I can't help wondering what the half-life of a radioactive MARC record is.
 My guess is it is either really, really short or really, really long. ;-)
 
 It's 42, but Z39.50 will accelerate the rate of decay.
 
 - T


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
What's killing you is the ampersands. When these were encoded they contained 
characters that hadn't been properly encoded as XML (mainly special linguistic 
characters and non-breaking spaces). Definitely replace your :stylesheet with 
-stylesheet, but then do a find and replace on all of your ampersands. It's the 
number one giant killer with modern XML parsers. I downloaded your file, 
switched in the hyphen and ditched all the ampersands and the solution tested 
good for me in Chrome and Firefox.

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information Systems
Assistant Professor, Graduate College
Department of Health Sciences Library and Information Management
University of Oklahoma Health Sciences Center
405-271-2285, opt. 5
405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is 
addressed and may contain information that is privileged, confidential or 
otherwise exempt from disclosure. If the reader of this e-mail is not the 
intended recipient or the employee or agent responsible for delivering the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution, or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
immediately notify us by replying to the original message at the listed email 
address. Thank You.

On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C wendy-robert...@uiowa.edu 
wrote:

 Thanks.  I'll see if this helps. 
 
 I'm sure IE was used to view the files 4.5 years ago. I don't think I looked 
 at them, but we had super employees (recent grads from library school) that 
 worked with the files and I trust that they would have noticed problems.  
 
 Fortunately we only have 7 of these to try to fix. 
 
 Wendy
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon 
 Gorman
 Sent: Monday, December 09, 2013 3:17 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files
 
 A lot of modern systems won't load entities (or will limit it somehow) 
 because of the denial of service attack that is possible.  Look for XML 
 Entity Reference Denial of Service. I can't remember if Public declarations 
 are treated any differently than System ones. (I would have suspected it to 
 trust SYSTEM ones more, but they'd still be exploitable by the same bug).
 
 
 (There's also a fair number of other errors, I'm somewhat skeptical that the 
 example worked on many browsers even then. It's possible IE was flexible 
 enough it would have worked).
 
 One thing you might want to do is is take out the entities.
 
 I can't remember why I had to do this, but xmllint seemed to do the trick.
 ( I found a snippet at
 http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c,
 but it' smissing the necessary --loaddtd)
 
 xmllint --loaddtd --noent --dropdtd FRONT.xml  FRONT_nodtdent.xml
 
 I mean, you don't need the dtd for validation, particularly since I suspect 
 given the errors it may not validate anyhow.
 
 It might make the files a little harder to read when reading the raw source, 
 but I suspect that's not typically a problem.
 
 Jon Gorman
 University of Illinois
 
 
 
 On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C  
 wendy-robert...@uiowa.edu wrote:
 
 Back in 1999-2002 a handful of our theses were submitted  as a 
 collection of xml files.  We posted the files in our repository 
 several years ago (we posted a zipped folder with all the files).  At 
 that time, if you opened front.xml you would be able to access the 
 thesis. We have not touched the files in the close to 5 years since we 
 posted them, but the files no longer open correctly. One of the problem 
 theses is http://ir.uiowa.edu/etd/189/.
 
 Front.xml begins
 ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet 
 type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM 
 UIowa2K.dtd
 
 I have tried the following changes but they do not help
 
 1)  Adding standalone=no? to the xml declaration  -- ?xml
 version=1.0  encoding=UTF-8 standalone=no?
 
 2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match the
 files (which are in all caps)
 
 3)  Changing xml:stylesheet to xml-stylesheet
 
 Chrome shows errors that entities are not defined, but they are 
 defined in the dtd.
 
 I would appreciate any assistance in making these documents available 
 again. Thanks!
 
 Wendy Robertson
 Digital Scholarship Librarian *  The University of Iowa Libraries
 1015 Main Library  *  Iowa City, Iowa 52242 wendy-robert...@uiowa.edu 
 * 319-335-5821
 


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Joshua Welker
Evil ampersands! They have caused me hours of headaches in past XML
projects...

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Jason Bengtson
Sent: Monday, December 09, 2013 4:35 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] problem in old etd xml files

What's killing you is the ampersands. When these were encoded they
contained characters that hadn't been properly encoded as XML (mainly
special linguistic characters and non-breaking spaces). Definitely replace
your :stylesheet with -stylesheet, but then do a find and replace on all
of your ampersands. It's the number one giant killer with modern XML
parsers. I downloaded your file, switched in the hyphen and ditched all
the ampersands and the solution tested good for me in Chrome and Firefox.

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information Systems Assistant Professor,
Graduate College Department of Health Sciences Library and Information
Management University of Oklahoma Health Sciences Center 405-271-2285,
opt. 5
405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is
addressed and may contain information that is privileged, confidential or
otherwise exempt from disclosure. If the reader of this e-mail is not the
intended recipient or the employee or agent responsible for delivering the
message to the intended recipient, you are hereby notified that any
dissemination, distribution, or copying of this communication is strictly
prohibited. If you have received this communication in error, please
immediately notify us by replying to the original message at the listed
email address. Thank You.

On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C
wendy-robert...@uiowa.edu wrote:

 Thanks.  I'll see if this helps.

 I'm sure IE was used to view the files 4.5 years ago. I don't think I
looked at them, but we had super employees (recent grads from library
school) that worked with the files and I trust that they would have
noticed problems.

 Fortunately we only have 7 of these to try to fix.

 Wendy

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of Jon Gorman
 Sent: Monday, December 09, 2013 3:17 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files

 A lot of modern systems won't load entities (or will limit it somehow)
because of the denial of service attack that is possible.  Look for XML
Entity Reference Denial of Service. I can't remember if Public
declarations are treated any differently than System ones. (I would have
suspected it to trust SYSTEM ones more, but they'd still be exploitable by
the same bug).


 (There's also a fair number of other errors, I'm somewhat skeptical that
the example worked on many browsers even then. It's possible IE was
flexible enough it would have worked).

 One thing you might want to do is is take out the entities.

 I can't remember why I had to do this, but xmllint seemed to do the
trick.
 ( I found a snippet at
 http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-re
 ferences-in-xml-and-create-a-new-xml-in-c,
 but it' smissing the necessary --loaddtd)

 xmllint --loaddtd --noent --dropdtd FRONT.xml  FRONT_nodtdent.xml

 I mean, you don't need the dtd for validation, particularly since I
suspect given the errors it may not validate anyhow.

 It might make the files a little harder to read when reading the raw
source, but I suspect that's not typically a problem.

 Jon Gorman
 University of Illinois



 On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C 
wendy-robert...@uiowa.edu wrote:

 Back in 1999-2002 a handful of our theses were submitted  as a
 collection of xml files.  We posted the files in our repository
 several years ago (we posted a zipped folder with all the files).  At
 that time, if you opened front.xml you would be able to access the
 thesis. We have not touched the files in the close to 5 years since
 we posted them, but the files no longer open correctly. One of the
problem theses is http://ir.uiowa.edu/etd/189/.

 Front.xml begins
 ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet
 type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM
 UIowa2K.dtd

 I have tried the following changes but they do not help

 1)  Adding standalone=no? to the xml declaration  -- ?xml
 version=1.0  encoding=UTF-8 standalone=no?

 2)  Changing the case of UIowa2K1.css and UIowa2K.dtd to match
the
 files (which are in all caps)

 3)  Changing xml:stylesheet to xml-stylesheet

 Chrome shows errors that entities are not defined, but they are
 defined in the dtd.

 I would appreciate any assistance in making these documents available
 again. Thanks!

 Wendy Robertson
 Digital Scholarship Librarian *  The University of Iowa Libraries
 1015 Main Library  *  Iowa City, Iowa 52242 

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
Agreed.

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information Systems
Assistant Professor, Graduate College
Department of Health Sciences Library and Information Management
University of Oklahoma Health Sciences Center
405-271-2285, opt. 5
405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is 
addressed and may contain information that is privileged, confidential or 
otherwise exempt from disclosure. If the reader of this e-mail is not the 
intended recipient or the employee or agent responsible for delivering the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution, or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
immediately notify us by replying to the original message at the listed email 
address. Thank You.

On Dec 9, 2013, at 4:42 PM, Joshua Welker wel...@ucmo.edu wrote:

 Evil ampersands! They have caused me hours of headaches in past XML
 projects...
 
 Josh Welker
 
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Jason Bengtson
 Sent: Monday, December 09, 2013 4:35 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files
 
 What's killing you is the ampersands. When these were encoded they
 contained characters that hadn't been properly encoded as XML (mainly
 special linguistic characters and non-breaking spaces). Definitely replace
 your :stylesheet with -stylesheet, but then do a find and replace on all
 of your ampersands. It's the number one giant killer with modern XML
 parsers. I downloaded your file, switched in the hyphen and ditched all
 the ampersands and the solution tested good for me in Chrome and Firefox.
 
 Best regards,
 
 Jason Bengtson, MLIS, MA
 Head of Library Computing and Information Systems Assistant Professor,
 Graduate College Department of Health Sciences Library and Information
 Management University of Oklahoma Health Sciences Center 405-271-2285,
 opt. 5
 405-271-3297 (fax)
 jason-bengt...@ouhsc.edu
 http://library.ouhsc.edu
 www.jasonbengtson.com
 
 NOTICE:
 This e-mail is intended solely for the use of the individual to whom it is
 addressed and may contain information that is privileged, confidential or
 otherwise exempt from disclosure. If the reader of this e-mail is not the
 intended recipient or the employee or agent responsible for delivering the
 message to the intended recipient, you are hereby notified that any
 dissemination, distribution, or copying of this communication is strictly
 prohibited. If you have received this communication in error, please
 immediately notify us by replying to the original message at the listed
 email address. Thank You.
 
 On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C
 wendy-robert...@uiowa.edu wrote:
 
 Thanks.  I'll see if this helps.
 
 I'm sure IE was used to view the files 4.5 years ago. I don't think I
 looked at them, but we had super employees (recent grads from library
 school) that worked with the files and I trust that they would have
 noticed problems.
 
 Fortunately we only have 7 of these to try to fix.
 
 Wendy
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of Jon Gorman
 Sent: Monday, December 09, 2013 3:17 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files
 
 A lot of modern systems won't load entities (or will limit it somehow)
 because of the denial of service attack that is possible.  Look for XML
 Entity Reference Denial of Service. I can't remember if Public
 declarations are treated any differently than System ones. (I would have
 suspected it to trust SYSTEM ones more, but they'd still be exploitable by
 the same bug).
 
 
 (There's also a fair number of other errors, I'm somewhat skeptical that
 the example worked on many browsers even then. It's possible IE was
 flexible enough it would have worked).
 
 One thing you might want to do is is take out the entities.
 
 I can't remember why I had to do this, but xmllint seemed to do the
 trick.
 ( I found a snippet at
 http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-re
 ferences-in-xml-and-create-a-new-xml-in-c,
 but it' smissing the necessary --loaddtd)
 
 xmllint --loaddtd --noent --dropdtd FRONT.xml  FRONT_nodtdent.xml
 
 I mean, you don't need the dtd for validation, particularly since I
 suspect given the errors it may not validate anyhow.
 
 It might make the files a little harder to read when reading the raw
 source, but I suspect that's not typically a problem.
 
 Jon Gorman
 University of Illinois
 
 
 
 On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C 
 wendy-robert...@uiowa.edu wrote:
 
 Back in 1999-2002 a handful of our theses were submitted  as a
 collection of xml files.  We posted the 

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jason Bengtson
Of course, the fly in the ointment here is that you still lose all of your 
links, which you need for the document to work as intended. From what I can see 
the original construction of the document relied upon the sloppiness of parsers 
in some of the old browsers (hence the ampersands in html special characters 
that weren't XML compliant). I doubt that modern XML parsers will be convinced 
to turn your namespaced quasi-html tags into actual, event handled html without 
using XSLT. I know this is a pain, but you may have to throw together an XSLT 
frame for these XML documents. The good news is that once you build one it'll 
probably work for the others.

A lower impact solution (if your documents are as few as you've said) might be 
to do some mass find/replace queries (and some header modifications, etc) to 
convert the XML tags into workable html, then just run them as such. You can, 
of course, keep the XML extension on the files after that. I played around with 
that on the file you pointed out and was able to get the links back without too 
much trouble. It'll be a judgement on cost/benefit.

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information Systems
Assistant Professor, Graduate College
Department of Health Sciences Library and Information Management
University of Oklahoma Health Sciences Center
405-271-2285, opt. 5
405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is 
addressed and may contain information that is privileged, confidential or 
otherwise exempt from disclosure. If the reader of this e-mail is not the 
intended recipient or the employee or agent responsible for delivering the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution, or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
immediately notify us by replying to the original message at the listed email 
address. Thank You.

On Dec 9, 2013, at 4:42 PM, Joshua Welker wel...@ucmo.edu wrote:

 Evil ampersands! They have caused me hours of headaches in past XML
 projects...
 
 Josh Welker
 
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
 Jason Bengtson
 Sent: Monday, December 09, 2013 4:35 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files
 
 What's killing you is the ampersands. When these were encoded they
 contained characters that hadn't been properly encoded as XML (mainly
 special linguistic characters and non-breaking spaces). Definitely replace
 your :stylesheet with -stylesheet, but then do a find and replace on all
 of your ampersands. It's the number one giant killer with modern XML
 parsers. I downloaded your file, switched in the hyphen and ditched all
 the ampersands and the solution tested good for me in Chrome and Firefox.
 
 Best regards,
 
 Jason Bengtson, MLIS, MA
 Head of Library Computing and Information Systems Assistant Professor,
 Graduate College Department of Health Sciences Library and Information
 Management University of Oklahoma Health Sciences Center 405-271-2285,
 opt. 5
 405-271-3297 (fax)
 jason-bengt...@ouhsc.edu
 http://library.ouhsc.edu
 www.jasonbengtson.com
 
 NOTICE:
 This e-mail is intended solely for the use of the individual to whom it is
 addressed and may contain information that is privileged, confidential or
 otherwise exempt from disclosure. If the reader of this e-mail is not the
 intended recipient or the employee or agent responsible for delivering the
 message to the intended recipient, you are hereby notified that any
 dissemination, distribution, or copying of this communication is strictly
 prohibited. If you have received this communication in error, please
 immediately notify us by replying to the original message at the listed
 email address. Thank You.
 
 On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C
 wendy-robert...@uiowa.edu wrote:
 
 Thanks.  I'll see if this helps.
 
 I'm sure IE was used to view the files 4.5 years ago. I don't think I
 looked at them, but we had super employees (recent grads from library
 school) that worked with the files and I trust that they would have
 noticed problems.
 
 Fortunately we only have 7 of these to try to fix.
 
 Wendy
 
 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of Jon Gorman
 Sent: Monday, December 09, 2013 3:17 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] problem in old etd xml files
 
 A lot of modern systems won't load entities (or will limit it somehow)
 because of the denial of service attack that is possible.  Look for XML
 Entity Reference Denial of Service. I can't remember if Public
 declarations are treated any differently than System ones. (I would have
 suspected it to trust SYSTEM ones more, but they'd 

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Bill Dueber
The sad thing is that the Library of Congress spent billions of dollars of
taxpayer money building a safe storage facility in the stable caves under
Dublin, OH, but now no one will let them bury them there.


On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com wrote:

 I can't help wondering what the half-life of a radioactive MARC record is.
 My guess is it is either really, really short or really, really long. ;-)
 Roy


 On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca
 wrote:

  Years ago Bill Moen had a set of radioactive MARC records with unique
  tokens in all fields, to test Z39.50 retrieval. I don't know whether they
  were ever released anywhere, but I see the specs are here:
 
  http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/
 
  Peter
 
 
  Peter Binkley
  Digital Initiatives Technology Librarian
  Information Technology Services
  peter.bink...@ualberta.ca
 
  2-10K Cameron Library
  University of Alberta
  Edmonton, Alberta
  Canada T6G 2J8
 
  phone 780-492-3743
  fax 780-492-9243
 
 
  On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote:
 
   I checked out the Eclipse option and was not able to get much use out
 of
   it.
   Maybe someone else will have better luck? It doesn't seem to align very
   well
   with a library use case.
  
   Josh Welker
  
  
   -Original Message-
   From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of
   Ben
   Companjen
   Sent: Monday, December 09, 2013 11:14 AM
   To: CODE4LIB@LISTSERV.ND.EDU
   Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
  
   Hi Josh,
  
   Before you start coding:
  
 
 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
   ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an
   DTD
   or XSD file. First try with the EAC XSD shows I need to try other
  options,
   but it's promising.
  
   (It's still an interesting problem to try to tackle yourself, of
 course.)
  
   Ben
  
   On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:
  
   It's hard-coded to generate the specific elements. But your way sounds
   a lot cleaner, so I might try to do that instead :) It will be more
   difficult initially but much easier once I start implementing other
   metadata formats.
   
   Josh Welker
   
   
   -Original Message-
   From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of
   Ben Companjen
   Sent: Monday, December 09, 2013 10:52 AM
   To: CODE4LIB@LISTSERV.ND.EDU
   Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
   
   Cool!
   My first thought on this topic was: give the program an XML schema,
 and
   generate possible documents with the correct datatypes etc. (Something
   like that must exist somewhere, right?) Does it happen to work
 anything
   like that, or is it hardcoded to generate these specific elements?
   
   Ben
   
   On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:
   
   Challenge accepted.
   
   http://library.ucmo.edu/dev/metadata-generator.php
   
   Obviously in the prototype phase, but it works. Only MODS is
 available
   for now, and you can only select top-level elements (all child
   elements of the top-level selections will be auto-generated). I will
   try to expand it to more than just MODS. Admittedly, I know very
   little about METS, so I will need some assistance if I am going to
 make
   one of those.
   
   I'll eventually host this somewhere else once it's done, so don't
   bookmark it.
   
   Josh Welker
   Information Technology Librarian
   James C. Kirkpatrick Library
   University of Central Missouri
   Warrensburg, MO 64093
   JCKL 2260
   660.543.8022
   
   -Original Message-
   From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
   Of Kevin S. Clarke
   Sent: Sunday, December 08, 2013 12:26 PM
   To: CODE4LIB@LISTSERV.ND.EDU
   Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
   
   When I first read this, I was imagining not having to give it your
   metadata but native support for most of our commonly used metadata
   records... so the interface is: Give me 100 MODS records and it
   spits that out... You could get fancy and say, Give me X number of
   METS records that wrap TIFFs and JPGs and that uses MODS, etc.
   That's not as trivial as hooking into an lorem ipsum machine, but
 it'd
   be pretty cool, imho.
   
   Kevin
   
   
   On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
   pottinge...@missouri.edu wrote:
   
Hi, I asked this on Google Plus earlier today, but I figured I'd
better take this question here: my brain is trying to tell me that
there's a service or app that makes fake metadata, kind of like
Lorem Ipsum but you feed it your fields and it gives you nonsense
metadata back. But, it looks right enough for testing. Yesterday, I
had to make up about 50 rows of fake metadata to test some 

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Jon Gorman
How did you fix the ampersands? I ask, because if you just did a simple
text transform from  to amp;, it would mask the problem of the entity
escaping I think...

Not at work, so I don't have a good example and the file is downloading
very slowly here, so I'll try to do one from memory.

There were several aacute; in the XML which mapped to an accent character
in the DTD via the Entity.

If you just substituted  with amp;, you'd get amp;aacute;, which would
render inline as accute;. It would superficially solve the issue since
browsers would no longer give the errors about the dtd since it wouldn't be
trying to load entities from the DTDs. And depending how you did it, you
likely could also replace a correctly encoded one to make amp;amp;,
leading to some very odd stuff.

I wouldn't be surprised to find some unescaped ampersands, but the solution
I posted will essentially replace the entities with their text, hopefully
causing most characters to appear correctly. You definitely still need to
fix some of the other stuff. (I suspect it never worked for most browsers
and XML systems, most likely only IE).

Jon Gorman
University of Illinois


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Roy Tennant
I ask you, would you want to work all day sitting on top of a huge pile of
radioactive MARC records? I sure wouldn't...
Roy


On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.com wrote:

 The sad thing is that the Library of Congress spent billions of dollars of
 taxpayer money building a safe storage facility in the stable caves under
 Dublin, OH, but now no one will let them bury them there.


 On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com wrote:

  I can't help wondering what the half-life of a radioactive MARC record
 is.
  My guess is it is either really, really short or really, really long. ;-)
  Roy
 
 
  On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca
  wrote:
 
   Years ago Bill Moen had a set of radioactive MARC records with unique
   tokens in all fields, to test Z39.50 retrieval. I don't know whether
 they
   were ever released anywhere, but I see the specs are here:
  
   http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/
  
   Peter
  
  
   Peter Binkley
   Digital Initiatives Technology Librarian
   Information Technology Services
   peter.bink...@ualberta.ca
  
   2-10K Cameron Library
   University of Alberta
   Edmonton, Alberta
   Canada T6G 2J8
  
   phone 780-492-3743
   fax 780-492-9243
  
  
   On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu
 wrote:
  
I checked out the Eclipse option and was not able to get much use out
  of
it.
Maybe someone else will have better luck? It doesn't seem to align
 very
well
with a library use case.
   
Josh Welker
   
   
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
  Of
Ben
Companjen
Sent: Monday, December 09, 2013 11:14 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
   
Hi Josh,
   
Before you start coding:
   
  
 
 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from
 an
DTD
or XSD file. First try with the EAC XSD shows I need to try other
   options,
but it's promising.
   
(It's still an interesting problem to try to tackle yourself, of
  course.)
   
Ben
   
On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote:
   
It's hard-coded to generate the specific elements. But your way
 sounds
a lot cleaner, so I might try to do that instead :) It will be more
difficult initially but much easier once I start implementing other
metadata formats.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
 Behalf
  Of
Ben Companjen
Sent: Monday, December 09, 2013 10:52 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Cool!
My first thought on this topic was: give the program an XML schema,
  and
generate possible documents with the correct datatypes etc.
 (Something
like that must exist somewhere, right?) Does it happen to work
  anything
like that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is
  available
for now, and you can only select top-level elements (all child
elements of the top-level selections will be auto-generated). I
 will
try to expand it to more than just MODS. Admittedly, I know very
little about METS, so I will need some assistance if I am going to
  make
one of those.

I'll eventually host this somewhere else once it's done, so don't
bookmark it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
 Behalf
Of Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a
 thing?

When I first read this, I was imagining not having to give it your
metadata but native support for most of our commonly used metadata
records... so the interface is: Give me 100 MODS records and it
spits that out... You could get fancy and say, Give me X number of
METS records that wrap TIFFs and JPGs and that uses MODS, etc.
That's not as trivial as hooking into an lorem ipsum machine, but
  it'd
be pretty cool, imho.

Kevin


On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Hi, I asked this on Google Plus earlier today, but I 

Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread jason bengtson
For testing purposes I just nixed them. As I noted, to rework the file a person 
would probably want to use a more critical eye with find and replace. Totally 
doable.


On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote:

 How did you fix the ampersands? I ask, because if you just did a simple
 text transform from  to amp;, it would mask the problem of the entity
 escaping I think...
 
 Not at work, so I don't have a good example and the file is downloading
 very slowly here, so I'll try to do one from memory.
 
 There were several aacute; in the XML which mapped to an accent character
 in the DTD via the Entity.
 
 If you just substituted  with amp;, you'd get amp;aacute;, which would
 render inline as accute;. It would superficially solve the issue since
 browsers would no longer give the errors about the dtd since it wouldn't be
 trying to load entities from the DTDs. And depending how you did it, you
 likely could also replace a correctly encoded one to make amp;amp;,
 leading to some very odd stuff.
 
 I wouldn't be surprised to find some unescaped ampersands, but the solution
 I posted will essentially replace the entities with their text, hopefully
 causing most characters to appear correctly. You definitely still need to
 fix some of the other stuff. (I suspect it never worked for most browsers
 and XML systems, most likely only IE).
 
 Jon Gorman
 University of Illinois

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information SystemsAssistant Professor, Graduate 
CollegeDepartment of Health Sciences Library and Information 
ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt. 
5405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is 
addressed and may contain information that is privileged, confidential or 
otherwise exempt from disclosure. If the reader of this e-mail is not the 
intended recipient or the employee or agent responsible for delivering the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution, or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
immediately notify us by replying to the original message at the listed email 
address. Thank You.


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread jason bengtson
Each ampersand was in a special character, which would have to be replaced with 
something. Most of them were in nonbreaking spaces or the enya in a particular 
name.


On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote:

 How did you fix the ampersands? I ask, because if you just did a simple
 text transform from  to amp;, it would mask the problem of the entity
 escaping I think...
 
 Not at work, so I don't have a good example and the file is downloading
 very slowly here, so I'll try to do one from memory.
 
 There were several aacute; in the XML which mapped to an accent character
 in the DTD via the Entity.
 
 If you just substituted  with amp;, you'd get amp;aacute;, which would
 render inline as accute;. It would superficially solve the issue since
 browsers would no longer give the errors about the dtd since it wouldn't be
 trying to load entities from the DTDs. And depending how you did it, you
 likely could also replace a correctly encoded one to make amp;amp;,
 leading to some very odd stuff.
 
 I wouldn't be surprised to find some unescaped ampersands, but the solution
 I posted will essentially replace the entities with their text, hopefully
 causing most characters to appear correctly. You definitely still need to
 fix some of the other stuff. (I suspect it never worked for most browsers
 and XML systems, most likely only IE).
 
 Jon Gorman
 University of Illinois

Best regards,

Jason Bengtson, MLIS, MA
Head of Library Computing and Information SystemsAssistant Professor, Graduate 
CollegeDepartment of Health Sciences Library and Information 
ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt. 
5405-271-3297 (fax)
jason-bengt...@ouhsc.edu
http://library.ouhsc.edu
www.jasonbengtson.com

NOTICE:
This e-mail is intended solely for the use of the individual to whom it is 
addressed and may contain information that is privileged, confidential or 
otherwise exempt from disclosure. If the reader of this e-mail is not the 
intended recipient or the employee or agent responsible for delivering the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution, or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
immediately notify us by replying to the original message at the listed email 
address. Thank You.


Re: [CODE4LIB] problem in old etd xml files

2013-12-09 Thread Roy Tennant
For my money, the text transform should look only for exact matches (e.g.,
aacute;, nbsp;, copy;) and replace them with their numeric
counterparts.
Roy


On Mon, Dec 9, 2013 at 5:41 PM, jason bengtson j.bengtson...@gmail.comwrote:

 For testing purposes I just nixed them. As I noted, to rework the file a
 person would probably want to use a more critical eye with find and
 replace. Totally doable.


 On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote:

  How did you fix the ampersands? I ask, because if you just did a simple
  text transform from  to amp;, it would mask the problem of the entity
  escaping I think...
 
  Not at work, so I don't have a good example and the file is downloading
  very slowly here, so I'll try to do one from memory.
 
  There were several aacute; in the XML which mapped to an accent
 character
  in the DTD via the Entity.
 
  If you just substituted  with amp;, you'd get amp;aacute;, which would
  render inline as accute;. It would superficially solve the issue since
  browsers would no longer give the errors about the dtd since it wouldn't
 be
  trying to load entities from the DTDs. And depending how you did it, you
  likely could also replace a correctly encoded one to make amp;amp;,
  leading to some very odd stuff.
 
  I wouldn't be surprised to find some unescaped ampersands, but the
 solution
  I posted will essentially replace the entities with their text, hopefully
  causing most characters to appear correctly. You definitely still need to
  fix some of the other stuff. (I suspect it never worked for most browsers
  and XML systems, most likely only IE).
 
  Jon Gorman
  University of Illinois

 Best regards,

 Jason Bengtson, MLIS, MA
 Head of Library Computing and Information SystemsAssistant Professor,
 Graduate CollegeDepartment of Health Sciences Library and Information
 ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt.
 5405-271-3297 (fax)
 jason-bengt...@ouhsc.edu
 http://library.ouhsc.edu
 www.jasonbengtson.com

 NOTICE:
 This e-mail is intended solely for the use of the individual to whom it is
 addressed and may contain information that is privileged, confidential or
 otherwise exempt from disclosure. If the reader of this e-mail is not the
 intended recipient or the employee or agent responsible for delivering the
 message to the intended recipient, you are hereby notified that any
 dissemination, distribution, or copying of this communication is strictly
 prohibited. If you have received this communication in error, please
 immediately notify us by replying to the original message at the listed
 email address. Thank You.



Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Pottinger, Hardy J.
Well it's not a web service, but it does make lots of fake metadata for batch 
loading into DSpace. I will just leave this here:

https://github.com/hardyoyo/random_dspace_batch_metadata

Thanks for the lead on the Faker gem! This was a fun diversion. I especially 
like the titles this script mints. :-)

A possible improvement would be to randomly reuse author names, so author 
facets have more than one item. I'll do that if I ever have to test author 
facets.

--Hardy

Sent from my iPad

On Dec 9, 2013, at 7:36 PM, Roy Tennant 
roytenn...@gmail.commailto:roytenn...@gmail.com wrote:

I ask you, would you want to work all day sitting on top of a huge pile of
radioactive MARC records? I sure wouldn't...
Roy


On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber 
b...@dueber.commailto:b...@dueber.com wrote:

The sad thing is that the Library of Congress spent billions of dollars of
taxpayer money building a safe storage facility in the stable caves under
Dublin, OH, but now no one will let them bury them there.


On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant 
roytenn...@gmail.commailto:roytenn...@gmail.com wrote:

I can't help wondering what the half-life of a radioactive MARC record
is.
My guess is it is either really, really short or really, really long. ;-)
Roy


On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley 
peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca
wrote:

Years ago Bill Moen had a set of radioactive MARC records with unique
tokens in all fields, to test Z39.50 retrieval. I don't know whether
they
were ever released anywhere, but I see the specs are here:

http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/

Peter


Peter Binkley
Digital Initiatives Technology Librarian
Information Technology Services
peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca

2-10K Cameron Library
University of Alberta
Edmonton, Alberta
Canada T6G 2J8

phone 780-492-3743
fax 780-492-9243


On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker 
wel...@ucmo.edumailto:wel...@ucmo.edu
wrote:

I checked out the Eclipse option and was not able to get much use out
of
it.
Maybe someone else will have better luck? It doesn't seem to align
very
well
with a library use case.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
Of
Ben
Companjen
Sent: Monday, December 09, 2013 11:14 AM
To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Hi Josh,

Before you start coding:



http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from
an
DTD
or XSD file. First try with the EAC XSD shows I need to try other
options,
but it's promising.

(It's still an interesting problem to try to tackle yourself, of
course.)

Ben

On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu 
wrote:

It's hard-coded to generate the specific elements. But your way
sounds
a lot cleaner, so I might try to do that instead :) It will be more
difficult initially but much easier once I start implementing other
metadata formats.

Josh Welker


-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
Behalf
Of
Ben Companjen
Sent: Monday, December 09, 2013 10:52 AM
To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

Cool!
My first thought on this topic was: give the program an XML schema,
and
generate possible documents with the correct datatypes etc.
(Something
like that must exist somewhere, right?) Does it happen to work
anything
like that, or is it hardcoded to generate these specific elements?

Ben

On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu 
wrote:

Challenge accepted.

http://library.ucmo.edu/dev/metadata-generator.php

Obviously in the prototype phase, but it works. Only MODS is
available
for now, and you can only select top-level elements (all child
elements of the top-level selections will be auto-generated). I
will
try to expand it to more than just MODS. Admittedly, I know very
little about METS, so I will need some assistance if I am going to
make
one of those.

I'll eventually host this somewhere else once it's done, so don't
bookmark it.

Josh Welker
Information Technology Librarian
James C. Kirkpatrick Library
University of Central Missouri
Warrensburg, MO 64093
JCKL 2260
660.543.8022

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
Behalf
Of Kevin S. Clarke
Sent: Sunday, December 08, 2013 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a
thing?

When I first read this, I was imagining not having to give it your
metadata but native support for most of our commonly used metadata
records... so the interface is: Give me 100 MODS records and it
spits that out... 

Re: [CODE4LIB] transforming marc to rdf

2013-12-09 Thread Eric Lease Morgan
I have created an initial pile of RDF, mostly.

I am in the process of experimenting with linked data for archives. My goal is 
to use existing (EAD and MARC) metadata to create RDF/XML, and then to expose 
this RDF/XML using linked data principles. Once I get that far I hope to slurp 
up the RDF/XML into a triple store, analyse the data, and learn how the whole 
process could be improved. 

This is what I have done to date:

  * accumulated sets of EAD files and MARC
records

  * identified and cached a few XSL stylesheets
transforming EAD and MARCXML into RDF/XML

  * wrote a couple of Perl script that combine
Bullet #1 and Bullet #2 to create HTML and
RDF/XML

  * write a mod_perl module implementing
rudimentary content negotiation

  * made the whole thing (scripts, sets of data,
HTML, RDF/XML, etc.) available on the Web

You can see the fruits of these labors at http://infomotions.com/sandbox/liam/, 
and there you will find a few directories:

  * bin - my Perl scripts live here as well as
a couple of support files

  * data - full of RDF/XML files -- about 4,000
of them

  * etc - mostly stylesheets

  * id - a placeholder for the URIs and content
negotiation

  * lib - where the actual content negotiation
script lives

  * pages - HTML versions of the original metadata

  * src - a cache for my original metadata

  * tmp - things of brief importance; mostly trash

My Perl scripts read the metadata, create HTML and RDF/XML, and save the result 
in the pages and data directories, respectively. A person can browse these 
directories, but browsing will be difficult because there is nothing there 
except cryptic file names. Selecting any of the files should return valid HTML 
or RDF/XML. 

Each cryptic name is the leaf of a URI prefixed with 
http://infomotions.com/sandbox/liam/id/;. For example, if the leaf is 
mshm510, then the combined leaf and prefix form a resolvable URI -- 
http://infomotions.com/sandbox/liam/id/mshm510. When user-agent says it can 
accept text/html, then the HTTP server redirects the user-agent to 
http://infomotions.com/sandbox/liam/pages/mshm510.html. If the user agent does 
not request a text/html representation, then the RDF/XML version is returned -- 
http://infomotions.com/sandbox/liam/data/mshm510.rdf. This is rudimentary 
content-negotiation. For a good time, here are a few actionable URIs:

  * http://infomotions.com/sandbox/liam/id/4042gwbo
  * http://infomotions.com/sandbox/liam/id/httphdllocgovlocmusiceadmusmu004002
  * http://infomotions.com/sandbox/liam/id/ma117
  * http://infomotions.com/sandbox/liam/id/mshm509
  * http://infomotions.com/sandbox/liam/id/stcmarcocm11422551
  * http://infomotions.com/sandbox/liam/id/vilmarcvil_155543

For a good time, feed them to the W3C RDF Validator. 

The next step is to figure out how to handle file not found errors when a URI 
does not exist. Another thing to figure out is how to make potential robots 
aware of the data set. The bigger problem is to simply make the dataset more 
meaningful the the inclusion of more URIs in the RDF/XML as well as the use of 
a more consistent and standardized set of ontologies. 

Fun with linked data?

— 
Eric Morgan


Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Brian Zelip
Not metadata, but still pretty fun - http://meettheipsums.com - some
curated ipsums.


Brian Zelip

---
Graduate Assistant
Scholarly Commons, University Library
University of Illinois at Urbana-Champaign


On Mon, Dec 9, 2013 at 9:14 PM, Pottinger, Hardy J. 
pottinge...@missouri.edu wrote:

 Well it's not a web service, but it does make lots of fake metadata for
 batch loading into DSpace. I will just leave this here:

 https://github.com/hardyoyo/random_dspace_batch_metadata

 Thanks for the lead on the Faker gem! This was a fun diversion. I
 especially like the titles this script mints. :-)

 A possible improvement would be to randomly reuse author names, so author
 facets have more than one item. I'll do that if I ever have to test author
 facets.

 --Hardy

 Sent from my iPad

 On Dec 9, 2013, at 7:36 PM, Roy Tennant roytenn...@gmail.commailto:
 roytenn...@gmail.com wrote:

 I ask you, would you want to work all day sitting on top of a huge pile of
 radioactive MARC records? I sure wouldn't...
 Roy


 On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.commailto:
 b...@dueber.com wrote:

 The sad thing is that the Library of Congress spent billions of dollars of
 taxpayer money building a safe storage facility in the stable caves under
 Dublin, OH, but now no one will let them bury them there.


 On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.commailto:
 roytenn...@gmail.com wrote:

 I can't help wondering what the half-life of a radioactive MARC record
 is.
 My guess is it is either really, really short or really, really long. ;-)
 Roy


 On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca
 mailto:peter.bink...@ualberta.ca
 wrote:

 Years ago Bill Moen had a set of radioactive MARC records with unique
 tokens in all fields, to test Z39.50 retrieval. I don't know whether
 they
 were ever released anywhere, but I see the specs are here:

 http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/

 Peter


 Peter Binkley
 Digital Initiatives Technology Librarian
 Information Technology Services
 peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca

 2-10K Cameron Library
 University of Alberta
 Edmonton, Alberta
 Canada T6G 2J8

 phone 780-492-3743
 fax 780-492-9243


 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edumailto:
 wel...@ucmo.edu
 wrote:

 I checked out the Eclipse option and was not able to get much use out
 of
 it.
 Maybe someone else will have better luck? It doesn't seem to align
 very
 well
 with a library use case.

 Josh Welker


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
 Of
 Ben
 Companjen
 Sent: Monday, December 09, 2013 11:14 AM
 To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 Hi Josh,

 Before you start coding:



 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
 ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from
 an
 DTD
 or XSD file. First try with the EAC XSD shows I need to try other
 options,
 but it's promising.

 (It's still an interesting problem to try to tackle yourself, of
 course.)

 Ben

 On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu
 wrote:

 It's hard-coded to generate the specific elements. But your way
 sounds
 a lot cleaner, so I might try to do that instead :) It will be more
 difficult initially but much easier once I start implementing other
 metadata formats.

 Josh Welker


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
 Behalf
 Of
 Ben Companjen
 Sent: Monday, December 09, 2013 10:52 AM
 To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

 Cool!
 My first thought on this topic was: give the program an XML schema,
 and
 generate possible documents with the correct datatypes etc.
 (Something
 like that must exist somewhere, right?) Does it happen to work
 anything
 like that, or is it hardcoded to generate these specific elements?

 Ben

 On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu
 wrote:

 Challenge accepted.

 http://library.ucmo.edu/dev/metadata-generator.php

 Obviously in the prototype phase, but it works. Only MODS is
 available
 for now, and you can only select top-level elements (all child
 elements of the top-level selections will be auto-generated). I
 will
 try to expand it to more than just MODS. Admittedly, I know very
 little about METS, so I will need some assistance if I am going to
 make
 one of those.

 I'll eventually host this somewhere else once it's done, so don't
 bookmark it.

 Josh Welker
 Information Technology Librarian
 James C. Kirkpatrick Library
 University of Central Missouri
 Warrensburg, MO 64093
 JCKL 2260
 660.543.8022

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
 

Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?

2013-12-09 Thread Kevin S. Clarke
I was telling Hardy earlier today we needed a metadata ipsum... it would
pull random words from the full-text of the AACR2.

Kevin



On Mon, Dec 9, 2013 at 11:24 PM, Brian Zelip bze...@gmail.com wrote:

 Not metadata, but still pretty fun - http://meettheipsums.com - some
 curated ipsums.


 Brian Zelip

 ---
 Graduate Assistant
 Scholarly Commons, University Library
 University of Illinois at Urbana-Champaign


 On Mon, Dec 9, 2013 at 9:14 PM, Pottinger, Hardy J. 
 pottinge...@missouri.edu wrote:

  Well it's not a web service, but it does make lots of fake metadata for
  batch loading into DSpace. I will just leave this here:
 
  https://github.com/hardyoyo/random_dspace_batch_metadata
 
  Thanks for the lead on the Faker gem! This was a fun diversion. I
  especially like the titles this script mints. :-)
 
  A possible improvement would be to randomly reuse author names, so author
  facets have more than one item. I'll do that if I ever have to test
 author
  facets.
 
  --Hardy
 
  Sent from my iPad
 
  On Dec 9, 2013, at 7:36 PM, Roy Tennant roytenn...@gmail.commailto:
  roytenn...@gmail.com wrote:
 
  I ask you, would you want to work all day sitting on top of a huge pile
 of
  radioactive MARC records? I sure wouldn't...
  Roy
 
 
  On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.commailto:
  b...@dueber.com wrote:
 
  The sad thing is that the Library of Congress spent billions of dollars
 of
  taxpayer money building a safe storage facility in the stable caves under
  Dublin, OH, but now no one will let them bury them there.
 
 
  On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com
 mailto:
  roytenn...@gmail.com wrote:
 
  I can't help wondering what the half-life of a radioactive MARC record
  is.
  My guess is it is either really, really short or really, really long. ;-)
  Roy
 
 
  On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca
  mailto:peter.bink...@ualberta.ca
  wrote:
 
  Years ago Bill Moen had a set of radioactive MARC records with unique
  tokens in all fields, to test Z39.50 retrieval. I don't know whether
  they
  were ever released anywhere, but I see the specs are here:
 
  http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/
 
  Peter
 
 
  Peter Binkley
  Digital Initiatives Technology Librarian
  Information Technology Services
  peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca
 
  2-10K Cameron Library
  University of Alberta
  Edmonton, Alberta
  Canada T6G 2J8
 
  phone 780-492-3743
  fax 780-492-9243
 
 
  On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edumailto:
  wel...@ucmo.edu
  wrote:
 
  I checked out the Eclipse option and was not able to get much use out
  of
  it.
  Maybe someone else will have better luck? It doesn't seem to align
  very
  well
  with a library use case.
 
  Josh Welker
 
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
  Of
  Ben
  Companjen
  Sent: Monday, December 09, 2013 11:14 AM
  To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
  Hi Josh,
 
  Before you start coding:
 
 
 
 
 http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen
  ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from
  an
  DTD
  or XSD file. First try with the EAC XSD shows I need to try other
  options,
  but it's promising.
 
  (It's still an interesting problem to try to tackle yourself, of
  course.)
 
  Ben
 
  On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto:
 wel...@ucmo.edu
  wrote:
 
  It's hard-coded to generate the specific elements. But your way
  sounds
  a lot cleaner, so I might try to do that instead :) It will be more
  difficult initially but much easier once I start implementing other
  metadata formats.
 
  Josh Welker
 
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
  Behalf
  Of
  Ben Companjen
  Sent: Monday, December 09, 2013 10:52 AM
  To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU
  Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
 
  Cool!
  My first thought on this topic was: give the program an XML schema,
  and
  generate possible documents with the correct datatypes etc.
  (Something
  like that must exist somewhere, right?) Does it happen to work
  anything
  like that, or is it hardcoded to generate these specific elements?
 
  Ben
 
  On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto:
 wel...@ucmo.edu
  wrote:
 
  Challenge accepted.
 
  http://library.ucmo.edu/dev/metadata-generator.php
 
  Obviously in the prototype phase, but it works. Only MODS is
  available
  for now, and you can only select top-level elements (all child
  elements of the top-level selections will be auto-generated). I
  will
  try to expand it to more than just MODS. Admittedly, I know very
  little about METS, so I will need some