[CODE4LIB] Code4Lib 2014 pre-conferences: please indicate interest on wiki
Hi folks, We have [possibly?] a record number of pre-conference proposals up on the wiki -- 19 by my count. Yay for the awesomeness of you all! We haven't crunched the numbers on morning vs. afternoon scheduling and the number of rooms available quite yet, but it would be really, super duper helpful if folks who think they might want to attend a pre-conference could take a quick look in the next couple of days and indicate interest. This is by no means a binding signup (that will happen at time of registration), but it will help us make sure that all pre-conferences with interest find a room somewhere. Sign up here! http://wiki.code4lib.org/index.php/2014_preconference_proposals thanks! -emily -- Emily Lynema Associate Department Head Information Technology, NCSU Libraries 919-513-8031 emily_lyn...@ncsu.edu
[CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14
*** Apologies for the cross-posting*** For the Code4Lib 2014 Conference, 9 scholarships have been sponsored to promote diversity. http://code4lib.org/conference/2013/scholarship CLIR/DLF has sponsored 5 scholarships, EBSCO has sponsored 2 scholarships, ProQuest has sponsored 1 full scholarship, and Sumana Harihareswara has sponsored half a scholarship which was matched by ProQuest. All sponsors have left it up to the discretion of the Code4Lib 2014 Scholarship Committee for how to award these diversity scholarships. The Code4Lib Scholarship Committee will award 9 diversity scholarships based on merit and need. Each scholarship will provide up to $1,000 to cover travel costs and conference fees for a qualified attendee to attend the 2014 Code4Lib Conference, which will be held in Raleigh, North Carolina, from March 24 - 27, 2014. CONFERENCE INFO For more information on the Code4Lib Conference, please see the conference website: http://code4lib.org/conference/2014 You can see write-ups of previous Code4Lib Conferences: http://journal.code4lib.org/articles/6848 http://journal.code4lib.org/articles/2717 http://journal.code4lib.org/articles/998 http://journal.code4lib.org/articles/72 CODE4LIB 2014 DIVERSITY SCHOLARSHIPS ELIGIBILITY, CRITERIA, AND REQUIREMENTS To qualify for a scholarship, an applicant must be interested in actively contributing to the mission and goals of the Code4Lib Conference. - Four scholarships will be awarded to any woman or transgendered person. - Four scholarships will be awarded to any person of Hispanic or Latino, Black or African-American, Asian, Native Hawaiian or Pacific Islander, or American Indian or Alaskan Native descent. - One scholarship will be awarded to the best remaining candidate who meets any of the previously mentioned eligibility requirements. Eligible applicants may apply based on multiple criteria, but no applicant will receive more than one scholarship. Past winners of any Code4Lib scholarship are not eligible for a scholarship. The scholarship recipients will be selected based upon their merit and financial needs. Scholarship recipients are required to write and submit a brief trip report to the Code4Lib 2014 Scholarships Committee by April 1, 2014 to be posted to the Code4Lib wiki. The report should address: (a) what kind of experience they had at the conference, (b) what they have learned, (c) what suggestions they have for future attendees and conference organizers. All reimbursement forms and receipts must be received by May 26, 2014. HOW TO APPLY To apply, please send an email to Jason Ronallo (jrona...@gmail.com) with the subject heading “Code4Lib 2014 Diversity Scholarship Application” containing the following (combined into a single attached PDF, if possible): 1. A brief letter of interest, which: - Identifies your eligibility for a diversity scholarship - Describes your interest in the conference and how you intend to participate - Discusses your merit and needs for the scholarship 2. A résumé or CV 3. Contact information for two professional or academic references The application deadline is Dec. 13, 2013, 5pm EST. The scholarship committee will notify successful candidates the week of Jan. 6, 2013. SPONSORS We would like to thank our sponsors for supporting the Code4Lib 2014 Diversity Scholarships. Council on Library and Information Resources http://www.clir.org/ Digital Library Federation http://www.diglib.org/ EBSCO http://www.ebsco.com/ ProQuest http://www.proquest.com Sumana Harihareswara http://www.harihareswara.net/
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I've had good luck using both the Data::Faker and Text::Lorem Perl modules to generate large amounts (30k+ rows) of Archivists Toolkit test data. Other ports of Data::Faker would probably work just as well, though it needs a bit more code to more than generate more than name, address and contact info. At the time I was mostly just generating person data for the AT names table but I had considered one day extending the code so that more detailed person data could be created but I never got around to it. It never occurred to me that there might actually be a need for something along the lines of a scholarly NPC generator. Rick -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Sean Hannan Sent: Sunday, December 08, 2013 7:00 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? In ruby, there's the ffaker gem (https://github.com/EmmanuelOga/ffaker), which itself is a port of Perl's Data::Faker. -Sean From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] on behalf of Pottinger, Hardy J. [pottinge...@missouri.edu] Sent: Saturday, December 07, 2013 11:51 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Nice! Are you going to put the code on GitHub (or some such place)? I'd be interested in tracking... Kevin On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Sure. It's not a fancy reusable class or anything though--just a simple PHP script. I will try to put it up sometime today or tomorrow and will share a link. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Monday, December 09, 2013 10:34 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Nice! Are you going to put the code on GitHub (or some such place)? I'd be interested in tracking... Kevin On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Cool! Thanks, Kevin On Mon, Dec 9, 2013 at 11:37 AM, Joshua Welker wel...@ucmo.edu wrote: Sure. It's not a fancy reusable class or anything though--just a simple PHP script. I will try to put it up sometime today or tomorrow and will share a link. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Monday, December 09, 2013 10:34 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Nice! Are you going to put the code on GitHub (or some such place)? I'd be interested in tracking... Kevin On Mon, Dec 9, 2013 at 11:27 AM, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14
*** Correcting the typos *** Please see this page: http://code4lib.org/node/491 (This was for the past 2013 conference. http://code4lib.org/conference/2013/scholarship. ) The application deadline is Dec. 13, 2013, 5pm EST. The scholarship committee will notify successful candidates the week of Jan. 6, 2014. From: Bohyun Kim Sent: Monday, December 09, 2013 10:20 AM To: Code for Libraries; lit...@ala.org Subject: Reminder - Code4Lib 2014 Conference Diversity Scholarship Application Due Dec 14 *** Apologies for the cross-posting*** For the Code4Lib 2014 Conference, 9 scholarships have been sponsored to promote diversity. http://code4lib.org/conference/2013/scholarship CLIR/DLF has sponsored 5 scholarships, EBSCO has sponsored 2 scholarships, ProQuest has sponsored 1 full scholarship, and Sumana Harihareswara has sponsored half a scholarship which was matched by ProQuest. All sponsors have left it up to the discretion of the Code4Lib 2014 Scholarship Committee for how to award these diversity scholarships. The Code4Lib Scholarship Committee will award 9 diversity scholarships based on merit and need. Each scholarship will provide up to $1,000 to cover travel costs and conference fees for a qualified attendee to attend the 2014 Code4Lib Conference, which will be held in Raleigh, North Carolina, from March 24 - 27, 2014. CONFERENCE INFO For more information on the Code4Lib Conference, please see the conference website: http://code4lib.org/conference/2014 You can see write-ups of previous Code4Lib Conferences: http://journal.code4lib.org/articles/6848 http://journal.code4lib.org/articles/2717 http://journal.code4lib.org/articles/998 http://journal.code4lib.org/articles/72 CODE4LIB 2014 DIVERSITY SCHOLARSHIPS ELIGIBILITY, CRITERIA, AND REQUIREMENTS To qualify for a scholarship, an applicant must be interested in actively contributing to the mission and goals of the Code4Lib Conference. - Four scholarships will be awarded to any woman or transgendered person. - Four scholarships will be awarded to any person of Hispanic or Latino, Black or African-American, Asian, Native Hawaiian or Pacific Islander, or American Indian or Alaskan Native descent. - One scholarship will be awarded to the best remaining candidate who meets any of the previously mentioned eligibility requirements. Eligible applicants may apply based on multiple criteria, but no applicant will receive more than one scholarship. Past winners of any Code4Lib scholarship are not eligible for a scholarship. The scholarship recipients will be selected based upon their merit and financial needs. Scholarship recipients are required to write and submit a brief trip report to the Code4Lib 2014 Scholarships Committee by April 1, 2014 to be posted to the Code4Lib wiki. The report should address: (a) what kind of experience they had at the conference, (b) what they have learned, (c) what suggestions they have for future attendees and conference organizers. All reimbursement forms and receipts must be received by May 26, 2014. HOW TO APPLY To apply, please send an email to Jason Ronallo (jrona...@gmail.com) with the subject heading “Code4Lib 2014 Diversity Scholarship Application” containing the following (combined into a single attached PDF, if possible): 1. A brief letter of interest, which: - Identifies your eligibility for a diversity scholarship - Describes your interest in the conference and how you intend to participate - Discusses your merit and needs for the scholarship 2. A résumé or CV 3. Contact information for two professional or academic references The application deadline is Dec. 13, 2013, 5pm EST. The scholarship committee will notify successful candidates the week of Jan. 6, 2013. SPONSORS We would like to thank our sponsors for supporting the Code4Lib 2014 Diversity Scholarships. Council on Library and Information Resources http://www.clir.org/ Digital Library Federation http://www.diglib.org/ EBSCO http://www.ebsco.com/ ProQuest http://www.proquest.com Sumana Harihareswara http://www.harihareswara.net/
[CODE4LIB] 2 interesting work opportunities in Chattanooga TN
Hello friends, Want to work for the Mozilla Foundation and collaborate with our awesome public library here in Chattanooga? Check out these two opportunities. Documentation Design Coordinator, Mozilla Gigabit Community Fundhttps://careers.mozilla.org/en-US/position/obX7XfwW and Community Catalyst, Chattanoogahttps://careers.mozilla.org/en-US/position/o6X7XfwR Cheers N Nate Hill nathanielh...@gmail.com http://4thfloor.chattlibrary.org/ http://www.natehill.net
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
[CODE4LIB] problem in old etd xml files
Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242 wendy-robert...@uiowa.edu * 319-335-5821
Re: [CODE4LIB] problem in old etd xml files
DTDs and XML namespaces don't like each other very much. I think you're getting into trouble because your DTD doesn't allow the two namespace-declaring attributes on the thesis element. Try adding this to your DTD: !ATTLIST thesis xmlns:xhtml CDATA #FIXED 'http://www.w3.org/1999/xhtml' xmlns:html CDATA #FIXED 'http://www.w3.org/TR/REC-html40' You're still going to be faced with a number of validity errors, but I think most of them are self-explanatory (e.g., you have multiple linebreak elements where your DTD only allows one). Some of these have to do with validity against your DTD and others are related to HTML validity. Ron Gilmour Web Services Librarian Ithaca College Library On Mon, Dec 9, 2013 at 3:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242 wendy-robert...@uiowa.edu * 319-335-5821
Re: [CODE4LIB] problem in old etd xml files
A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd still be exploitable by the same bug). (There's also a fair number of other errors, I'm somewhat skeptical that the example worked on many browsers even then. It's possible IE was flexible enough it would have worked). One thing you might want to do is is take out the entities. I can't remember why I had to do this, but xmllint seemed to do the trick. ( I found a snippet at http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c, but it' smissing the necessary --loaddtd) xmllint --loaddtd --noent --dropdtd FRONT.xml FRONT_nodtdent.xml I mean, you don't need the dtd for validation, particularly since I suspect given the errors it may not validate anyhow. It might make the files a little harder to read when reading the raw source, but I suspect that's not typically a problem. Jon Gorman University of Illinois On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242 wendy-robert...@uiowa.edu * 319-335-5821
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu University of Missouri Library Systems http://lso.umsystem.edu/~pottingerhj/ https://MOspace.umsystem.edu/ Making things that are beautiful is real fun. --Lou Reed
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.cawrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some code that handles paging in a UI, and I had to make it all up by hand. This hurts my soul. Someone please tell me such a service exists, and link me to it, so I never have to do this again. Or else, I may just make such a service, to save us all. But I don't want to go coding some new service if it already exists, because that sort of thing is for chumps. -- HARDY POTTINGER pottinge...@umsystem.edu
[CODE4LIB] Job: Software Developer at University of Maryland, College Park
Software Developer University of Maryland, College Park College Park As the largest university library system in the Washington D.C.-Baltimore area, the University of Maryland Libraries serve more than 37,500 students and 4,200 faculty of the flagship College Park campus. The University of Maryland Libraries share the teaching, learning and research goals of the university. Its role as a key academic resource is evident in its service to the academic community and its actionable strategic plan. Recent membership in the Committee on Institutional Cooperation, a robust organization of Big Ten member institutions, is particularly meaningful to the University Libraries and will further propel the university's ascendancy in academic excellence. The Software Developer provides broad programming support to the University of Maryland Libraries for the development and delivery of Java-based software applications. The applications support development and management of web pages and large-scale digital collections. They include tools for cataloging, search, and discovery of digital collections, tools for acquisition of digital collections, access to and retrieval of digital objects in the collections, and tools for preservation and maintenance of digital collections over the long term. In addition to developing new functionality, the Software Developer will participate in the full life cycle of the Libraries' applications, including initial configuration and setup, design, testing, updates, and ongoing support, maintenance, and troubleshooting. Development will also include interfacing with and expanding functionality of open source, commercial and production software. The incumbent serves on a growing development team which works collaboratively using agile methodologies and tools. The incumbent will use and promote tools such as Eclipse, Maven, JUnit, Jira, and Git with a growing body of applications and may participate in retrofitting existing applications. Required Education: Bachelor's Degree Required Experience: * Three or more years of programming experience * Experience creating web applications using a Web Application Framework * Experience using database connectivity tools such as JDBC or ORM * Experience using version control software such as Subversion or Git * Excellent interpersonal skills * Excellent written and verbal communication skills Preferred Education: Bachelor's Degree in a field related to information sciences, computer sciences and engineering, or information management Preferred Experience: * Experience programming in Java * Experience with XML, Xpath, and XSLT * Experience using an integrated development environment such as Eclipse * Experience using Lucene or Solr search tools * Experience with automated testing tools such as Junit or Selenium * Experience with REST/SOAP web services and related tools * Experience with Spring or Grails * Experience in academic libraries and archives. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/11105/
Re: [CODE4LIB] problem in old etd xml files
Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of these to try to fix. Wendy -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Gorman Sent: Monday, December 09, 2013 3:17 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd still be exploitable by the same bug). (There's also a fair number of other errors, I'm somewhat skeptical that the example worked on many browsers even then. It's possible IE was flexible enough it would have worked). One thing you might want to do is is take out the entities. I can't remember why I had to do this, but xmllint seemed to do the trick. ( I found a snippet at http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c, but it' smissing the necessary --loaddtd) xmllint --loaddtd --noent --dropdtd FRONT.xml FRONT_nodtdent.xml I mean, you don't need the dtd for validation, particularly since I suspect given the errors it may not validate anyhow. It might make the files a little harder to read when reading the raw source, but I suspect that's not typically a problem. Jon Gorman University of Illinois On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242 wendy-robert...@uiowa.edu * 319-335-5821
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) It's 42, but Z39.50 will accelerate the rate of decay. - T
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Finally, Z39.50 actually speeds up something -- jaf Sent from my iPad On Dec 9, 2013, at 4:59 PM, Tom Cramer tcra...@stanford.edu wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) It's 42, but Z39.50 will accelerate the rate of decay. - T
Re: [CODE4LIB] problem in old etd xml files
What's killing you is the ampersands. When these were encoded they contained characters that hadn't been properly encoded as XML (mainly special linguistic characters and non-breaking spaces). Definitely replace your :stylesheet with -stylesheet, but then do a find and replace on all of your ampersands. It's the number one giant killer with modern XML parsers. I downloaded your file, switched in the hyphen and ditched all the ampersands and the solution tested good for me in Chrome and Firefox. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of these to try to fix. Wendy -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Gorman Sent: Monday, December 09, 2013 3:17 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd still be exploitable by the same bug). (There's also a fair number of other errors, I'm somewhat skeptical that the example worked on many browsers even then. It's possible IE was flexible enough it would have worked). One thing you might want to do is is take out the entities. I can't remember why I had to do this, but xmllint seemed to do the trick. ( I found a snippet at http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-references-in-xml-and-create-a-new-xml-in-c, but it' smissing the necessary --loaddtd) xmllint --loaddtd --noent --dropdtd FRONT.xml FRONT_nodtdent.xml I mean, you don't need the dtd for validation, particularly since I suspect given the errors it may not validate anyhow. It might make the files a little harder to read when reading the raw source, but I suspect that's not typically a problem. Jon Gorman University of Illinois On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242 wendy-robert...@uiowa.edu * 319-335-5821
Re: [CODE4LIB] problem in old etd xml files
Evil ampersands! They have caused me hours of headaches in past XML projects... Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jason Bengtson Sent: Monday, December 09, 2013 4:35 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files What's killing you is the ampersands. When these were encoded they contained characters that hadn't been properly encoded as XML (mainly special linguistic characters and non-breaking spaces). Definitely replace your :stylesheet with -stylesheet, but then do a find and replace on all of your ampersands. It's the number one giant killer with modern XML parsers. I downloaded your file, switched in the hyphen and ditched all the ampersands and the solution tested good for me in Chrome and Firefox. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of these to try to fix. Wendy -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Gorman Sent: Monday, December 09, 2013 3:17 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd still be exploitable by the same bug). (There's also a fair number of other errors, I'm somewhat skeptical that the example worked on many browsers even then. It's possible IE was flexible enough it would have worked). One thing you might want to do is is take out the entities. I can't remember why I had to do this, but xmllint seemed to do the trick. ( I found a snippet at http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-re ferences-in-xml-and-create-a-new-xml-in-c, but it' smissing the necessary --loaddtd) xmllint --loaddtd --noent --dropdtd FRONT.xml FRONT_nodtdent.xml I mean, you don't need the dtd for validation, particularly since I suspect given the errors it may not validate anyhow. It might make the files a little harder to read when reading the raw source, but I suspect that's not typically a problem. Jon Gorman University of Illinois On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the files in our repository several years ago (we posted a zipped folder with all the files). At that time, if you opened front.xml you would be able to access the thesis. We have not touched the files in the close to 5 years since we posted them, but the files no longer open correctly. One of the problem theses is http://ir.uiowa.edu/etd/189/. Front.xml begins ?xml version=1.0 encoding=UTF-8? ?xml:stylesheet type=text/css href=UIowa2K1.css ? !DOCTYPE thesis SYSTEM UIowa2K.dtd I have tried the following changes but they do not help 1) Adding standalone=no? to the xml declaration -- ?xml version=1.0 encoding=UTF-8 standalone=no? 2) Changing the case of UIowa2K1.css and UIowa2K.dtd to match the files (which are in all caps) 3) Changing xml:stylesheet to xml-stylesheet Chrome shows errors that entities are not defined, but they are defined in the dtd. I would appreciate any assistance in making these documents available again. Thanks! Wendy Robertson Digital Scholarship Librarian * The University of Iowa Libraries 1015 Main Library * Iowa City, Iowa 52242
Re: [CODE4LIB] problem in old etd xml files
Agreed. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 4:42 PM, Joshua Welker wel...@ucmo.edu wrote: Evil ampersands! They have caused me hours of headaches in past XML projects... Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jason Bengtson Sent: Monday, December 09, 2013 4:35 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files What's killing you is the ampersands. When these were encoded they contained characters that hadn't been properly encoded as XML (mainly special linguistic characters and non-breaking spaces). Definitely replace your :stylesheet with -stylesheet, but then do a find and replace on all of your ampersands. It's the number one giant killer with modern XML parsers. I downloaded your file, switched in the hyphen and ditched all the ampersands and the solution tested good for me in Chrome and Firefox. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of these to try to fix. Wendy -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Gorman Sent: Monday, December 09, 2013 3:17 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd still be exploitable by the same bug). (There's also a fair number of other errors, I'm somewhat skeptical that the example worked on many browsers even then. It's possible IE was flexible enough it would have worked). One thing you might want to do is is take out the entities. I can't remember why I had to do this, but xmllint seemed to do the trick. ( I found a snippet at http://stackoverflow.com/questions/614067/how-to-resolve-all-entity-re ferences-in-xml-and-create-a-new-xml-in-c, but it' smissing the necessary --loaddtd) xmllint --loaddtd --noent --dropdtd FRONT.xml FRONT_nodtdent.xml I mean, you don't need the dtd for validation, particularly since I suspect given the errors it may not validate anyhow. It might make the files a little harder to read when reading the raw source, but I suspect that's not typically a problem. Jon Gorman University of Illinois On Mon, Dec 9, 2013 at 2:10 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Back in 1999-2002 a handful of our theses were submitted as a collection of xml files. We posted the
Re: [CODE4LIB] problem in old etd xml files
Of course, the fly in the ointment here is that you still lose all of your links, which you need for the document to work as intended. From what I can see the original construction of the document relied upon the sloppiness of parsers in some of the old browsers (hence the ampersands in html special characters that weren't XML compliant). I doubt that modern XML parsers will be convinced to turn your namespaced quasi-html tags into actual, event handled html without using XSLT. I know this is a pain, but you may have to throw together an XSLT frame for these XML documents. The good news is that once you build one it'll probably work for the others. A lower impact solution (if your documents are as few as you've said) might be to do some mass find/replace queries (and some header modifications, etc) to convert the XML tags into workable html, then just run them as such. You can, of course, keep the XML extension on the files after that. I played around with that on the file you pointed out and was able to get the links back without too much trouble. It'll be a judgement on cost/benefit. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 4:42 PM, Joshua Welker wel...@ucmo.edu wrote: Evil ampersands! They have caused me hours of headaches in past XML projects... Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jason Bengtson Sent: Monday, December 09, 2013 4:35 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files What's killing you is the ampersands. When these were encoded they contained characters that hadn't been properly encoded as XML (mainly special linguistic characters and non-breaking spaces). Definitely replace your :stylesheet with -stylesheet, but then do a find and replace on all of your ampersands. It's the number one giant killer with modern XML parsers. I downloaded your file, switched in the hyphen and ditched all the ampersands and the solution tested good for me in Chrome and Firefox. Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information Systems Assistant Professor, Graduate College Department of Health Sciences Library and Information Management University of Oklahoma Health Sciences Center 405-271-2285, opt. 5 405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You. On Dec 9, 2013, at 3:57 PM, Robertson, Wendy C wendy-robert...@uiowa.edu wrote: Thanks. I'll see if this helps. I'm sure IE was used to view the files 4.5 years ago. I don't think I looked at them, but we had super employees (recent grads from library school) that worked with the files and I trust that they would have noticed problems. Fortunately we only have 7 of these to try to fix. Wendy -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Jon Gorman Sent: Monday, December 09, 2013 3:17 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] problem in old etd xml files A lot of modern systems won't load entities (or will limit it somehow) because of the denial of service attack that is possible. Look for XML Entity Reference Denial of Service. I can't remember if Public declarations are treated any differently than System ones. (I would have suspected it to trust SYSTEM ones more, but they'd
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca wrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I figured I'd better take this question here: my brain is trying to tell me that there's a service or app that makes fake metadata, kind of like Lorem Ipsum but you feed it your fields and it gives you nonsense metadata back. But, it looks right enough for testing. Yesterday, I had to make up about 50 rows of fake metadata to test some
Re: [CODE4LIB] problem in old etd xml files
How did you fix the ampersands? I ask, because if you just did a simple text transform from to amp;, it would mask the problem of the entity escaping I think... Not at work, so I don't have a good example and the file is downloading very slowly here, so I'll try to do one from memory. There were several aacute; in the XML which mapped to an accent character in the DTD via the Entity. If you just substituted with amp;, you'd get amp;aacute;, which would render inline as accute;. It would superficially solve the issue since browsers would no longer give the errors about the dtd since it wouldn't be trying to load entities from the DTDs. And depending how you did it, you likely could also replace a correctly encoded one to make amp;amp;, leading to some very odd stuff. I wouldn't be surprised to find some unescaped ampersands, but the solution I posted will essentially replace the entities with their text, hopefully causing most characters to appear correctly. You definitely still need to fix some of the other stuff. (I suspect it never worked for most browsers and XML systems, most likely only IE). Jon Gorman University of Illinois
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I ask you, would you want to work all day sitting on top of a huge pile of radioactive MARC records? I sure wouldn't... Roy On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.com wrote: The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca wrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out... You could get fancy and say, Give me X number of METS records that wrap TIFFs and JPGs and that uses MODS, etc. That's not as trivial as hooking into an lorem ipsum machine, but it'd be pretty cool, imho. Kevin On Sat, Dec 7, 2013 at 11:51 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Hi, I asked this on Google Plus earlier today, but I
Re: [CODE4LIB] problem in old etd xml files
For testing purposes I just nixed them. As I noted, to rework the file a person would probably want to use a more critical eye with find and replace. Totally doable. On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote: How did you fix the ampersands? I ask, because if you just did a simple text transform from to amp;, it would mask the problem of the entity escaping I think... Not at work, so I don't have a good example and the file is downloading very slowly here, so I'll try to do one from memory. There were several aacute; in the XML which mapped to an accent character in the DTD via the Entity. If you just substituted with amp;, you'd get amp;aacute;, which would render inline as accute;. It would superficially solve the issue since browsers would no longer give the errors about the dtd since it wouldn't be trying to load entities from the DTDs. And depending how you did it, you likely could also replace a correctly encoded one to make amp;amp;, leading to some very odd stuff. I wouldn't be surprised to find some unescaped ampersands, but the solution I posted will essentially replace the entities with their text, hopefully causing most characters to appear correctly. You definitely still need to fix some of the other stuff. (I suspect it never worked for most browsers and XML systems, most likely only IE). Jon Gorman University of Illinois Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information SystemsAssistant Professor, Graduate CollegeDepartment of Health Sciences Library and Information ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt. 5405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You.
Re: [CODE4LIB] problem in old etd xml files
Each ampersand was in a special character, which would have to be replaced with something. Most of them were in nonbreaking spaces or the enya in a particular name. On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote: How did you fix the ampersands? I ask, because if you just did a simple text transform from to amp;, it would mask the problem of the entity escaping I think... Not at work, so I don't have a good example and the file is downloading very slowly here, so I'll try to do one from memory. There were several aacute; in the XML which mapped to an accent character in the DTD via the Entity. If you just substituted with amp;, you'd get amp;aacute;, which would render inline as accute;. It would superficially solve the issue since browsers would no longer give the errors about the dtd since it wouldn't be trying to load entities from the DTDs. And depending how you did it, you likely could also replace a correctly encoded one to make amp;amp;, leading to some very odd stuff. I wouldn't be surprised to find some unescaped ampersands, but the solution I posted will essentially replace the entities with their text, hopefully causing most characters to appear correctly. You definitely still need to fix some of the other stuff. (I suspect it never worked for most browsers and XML systems, most likely only IE). Jon Gorman University of Illinois Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information SystemsAssistant Professor, Graduate CollegeDepartment of Health Sciences Library and Information ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt. 5405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You.
Re: [CODE4LIB] problem in old etd xml files
For my money, the text transform should look only for exact matches (e.g., aacute;, nbsp;, copy;) and replace them with their numeric counterparts. Roy On Mon, Dec 9, 2013 at 5:41 PM, jason bengtson j.bengtson...@gmail.comwrote: For testing purposes I just nixed them. As I noted, to rework the file a person would probably want to use a more critical eye with find and replace. Totally doable. On Dec 9, 2013, at 7:37 PM, Jon Gorman jonathan.gor...@gmail.com wrote: How did you fix the ampersands? I ask, because if you just did a simple text transform from to amp;, it would mask the problem of the entity escaping I think... Not at work, so I don't have a good example and the file is downloading very slowly here, so I'll try to do one from memory. There were several aacute; in the XML which mapped to an accent character in the DTD via the Entity. If you just substituted with amp;, you'd get amp;aacute;, which would render inline as accute;. It would superficially solve the issue since browsers would no longer give the errors about the dtd since it wouldn't be trying to load entities from the DTDs. And depending how you did it, you likely could also replace a correctly encoded one to make amp;amp;, leading to some very odd stuff. I wouldn't be surprised to find some unescaped ampersands, but the solution I posted will essentially replace the entities with their text, hopefully causing most characters to appear correctly. You definitely still need to fix some of the other stuff. (I suspect it never worked for most browsers and XML systems, most likely only IE). Jon Gorman University of Illinois Best regards, Jason Bengtson, MLIS, MA Head of Library Computing and Information SystemsAssistant Professor, Graduate CollegeDepartment of Health Sciences Library and Information ManagementUniversity of Oklahoma Health Sciences Center405-271-2285, opt. 5405-271-3297 (fax) jason-bengt...@ouhsc.edu http://library.ouhsc.edu www.jasonbengtson.com NOTICE: This e-mail is intended solely for the use of the individual to whom it is addressed and may contain information that is privileged, confidential or otherwise exempt from disclosure. If the reader of this e-mail is not the intended recipient or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this communication in error, please immediately notify us by replying to the original message at the listed email address. Thank You.
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Well it's not a web service, but it does make lots of fake metadata for batch loading into DSpace. I will just leave this here: https://github.com/hardyoyo/random_dspace_batch_metadata Thanks for the lead on the Faker gem! This was a fun diversion. I especially like the titles this script mints. :-) A possible improvement would be to randomly reuse author names, so author facets have more than one item. I'll do that if I ever have to test author facets. --Hardy Sent from my iPad On Dec 9, 2013, at 7:36 PM, Roy Tennant roytenn...@gmail.commailto:roytenn...@gmail.com wrote: I ask you, would you want to work all day sitting on top of a huge pile of radioactive MARC records? I sure wouldn't... Roy On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.commailto:b...@dueber.com wrote: The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.commailto:roytenn...@gmail.com wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca wrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Kevin S. Clarke Sent: Sunday, December 08, 2013 12:26 PM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? When I first read this, I was imagining not having to give it your metadata but native support for most of our commonly used metadata records... so the interface is: Give me 100 MODS records and it spits that out...
Re: [CODE4LIB] transforming marc to rdf
I have created an initial pile of RDF, mostly. I am in the process of experimenting with linked data for archives. My goal is to use existing (EAD and MARC) metadata to create RDF/XML, and then to expose this RDF/XML using linked data principles. Once I get that far I hope to slurp up the RDF/XML into a triple store, analyse the data, and learn how the whole process could be improved. This is what I have done to date: * accumulated sets of EAD files and MARC records * identified and cached a few XSL stylesheets transforming EAD and MARCXML into RDF/XML * wrote a couple of Perl script that combine Bullet #1 and Bullet #2 to create HTML and RDF/XML * write a mod_perl module implementing rudimentary content negotiation * made the whole thing (scripts, sets of data, HTML, RDF/XML, etc.) available on the Web You can see the fruits of these labors at http://infomotions.com/sandbox/liam/, and there you will find a few directories: * bin - my Perl scripts live here as well as a couple of support files * data - full of RDF/XML files -- about 4,000 of them * etc - mostly stylesheets * id - a placeholder for the URIs and content negotiation * lib - where the actual content negotiation script lives * pages - HTML versions of the original metadata * src - a cache for my original metadata * tmp - things of brief importance; mostly trash My Perl scripts read the metadata, create HTML and RDF/XML, and save the result in the pages and data directories, respectively. A person can browse these directories, but browsing will be difficult because there is nothing there except cryptic file names. Selecting any of the files should return valid HTML or RDF/XML. Each cryptic name is the leaf of a URI prefixed with http://infomotions.com/sandbox/liam/id/;. For example, if the leaf is mshm510, then the combined leaf and prefix form a resolvable URI -- http://infomotions.com/sandbox/liam/id/mshm510. When user-agent says it can accept text/html, then the HTTP server redirects the user-agent to http://infomotions.com/sandbox/liam/pages/mshm510.html. If the user agent does not request a text/html representation, then the RDF/XML version is returned -- http://infomotions.com/sandbox/liam/data/mshm510.rdf. This is rudimentary content-negotiation. For a good time, here are a few actionable URIs: * http://infomotions.com/sandbox/liam/id/4042gwbo * http://infomotions.com/sandbox/liam/id/httphdllocgovlocmusiceadmusmu004002 * http://infomotions.com/sandbox/liam/id/ma117 * http://infomotions.com/sandbox/liam/id/mshm509 * http://infomotions.com/sandbox/liam/id/stcmarcocm11422551 * http://infomotions.com/sandbox/liam/id/vilmarcvil_155543 For a good time, feed them to the W3C RDF Validator. The next step is to figure out how to handle file not found errors when a URI does not exist. Another thing to figure out is how to make potential robots aware of the data set. The bigger problem is to simply make the dataset more meaningful the the inclusion of more URIs in the RDF/XML as well as the use of a more consistent and standardized set of ontologies. Fun with linked data? — Eric Morgan
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
Not metadata, but still pretty fun - http://meettheipsums.com - some curated ipsums. Brian Zelip --- Graduate Assistant Scholarly Commons, University Library University of Illinois at Urbana-Champaign On Mon, Dec 9, 2013 at 9:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Well it's not a web service, but it does make lots of fake metadata for batch loading into DSpace. I will just leave this here: https://github.com/hardyoyo/random_dspace_batch_metadata Thanks for the lead on the Faker gem! This was a fun diversion. I especially like the titles this script mints. :-) A possible improvement would be to randomly reuse author names, so author facets have more than one item. I'll do that if I ever have to test author facets. --Hardy Sent from my iPad On Dec 9, 2013, at 7:36 PM, Roy Tennant roytenn...@gmail.commailto: roytenn...@gmail.com wrote: I ask you, would you want to work all day sitting on top of a huge pile of radioactive MARC records? I sure wouldn't... Roy On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.commailto: b...@dueber.com wrote: The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.commailto: roytenn...@gmail.com wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca mailto:peter.bink...@ualberta.ca wrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edumailto: wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto:wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some assistance if I am going to make one of those. I'll eventually host this somewhere else once it's done, so don't bookmark it. Josh Welker Information Technology Librarian James C. Kirkpatrick Library University of Central Missouri Warrensburg, MO 64093 JCKL 2260 660.543.8022 -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing?
I was telling Hardy earlier today we needed a metadata ipsum... it would pull random words from the full-text of the AACR2. Kevin On Mon, Dec 9, 2013 at 11:24 PM, Brian Zelip bze...@gmail.com wrote: Not metadata, but still pretty fun - http://meettheipsums.com - some curated ipsums. Brian Zelip --- Graduate Assistant Scholarly Commons, University Library University of Illinois at Urbana-Champaign On Mon, Dec 9, 2013 at 9:14 PM, Pottinger, Hardy J. pottinge...@missouri.edu wrote: Well it's not a web service, but it does make lots of fake metadata for batch loading into DSpace. I will just leave this here: https://github.com/hardyoyo/random_dspace_batch_metadata Thanks for the lead on the Faker gem! This was a fun diversion. I especially like the titles this script mints. :-) A possible improvement would be to randomly reuse author names, so author facets have more than one item. I'll do that if I ever have to test author facets. --Hardy Sent from my iPad On Dec 9, 2013, at 7:36 PM, Roy Tennant roytenn...@gmail.commailto: roytenn...@gmail.com wrote: I ask you, would you want to work all day sitting on top of a huge pile of radioactive MARC records? I sure wouldn't... Roy On Mon, Dec 9, 2013 at 5:08 PM, Bill Dueber b...@dueber.commailto: b...@dueber.com wrote: The sad thing is that the Library of Congress spent billions of dollars of taxpayer money building a safe storage facility in the stable caves under Dublin, OH, but now no one will let them bury them there. On Mon, Dec 9, 2013 at 4:50 PM, Roy Tennant roytenn...@gmail.com mailto: roytenn...@gmail.com wrote: I can't help wondering what the half-life of a radioactive MARC record is. My guess is it is either really, really short or really, really long. ;-) Roy On Mon, Dec 9, 2013 at 1:39 PM, Peter Binkley peter.bink...@ualberta.ca mailto:peter.bink...@ualberta.ca wrote: Years ago Bill Moen had a set of radioactive MARC records with unique tokens in all fields, to test Z39.50 retrieval. I don't know whether they were ever released anywhere, but I see the specs are here: http://digital.library.unt.edu/ark:/67531/metadc111015/m1/1/ Peter Peter Binkley Digital Initiatives Technology Librarian Information Technology Services peter.bink...@ualberta.camailto:peter.bink...@ualberta.ca 2-10K Cameron Library University of Alberta Edmonton, Alberta Canada T6G 2J8 phone 780-492-3743 fax 780-492-9243 On Mon, Dec 9, 2013 at 12:50 PM, Joshua Welker wel...@ucmo.edumailto: wel...@ucmo.edu wrote: I checked out the Eclipse option and was not able to get much use out of it. Maybe someone else will have better luck? It doesn't seem to align very well with a library use case. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 11:14 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Hi Josh, Before you start coding: http://stackoverflow.com/questions/17106/how-to-generate-sample-xml-documen ts-from-their-dtd-or-xsd suggests that Eclipse can generate XML from an DTD or XSD file. First try with the EAC XSD shows I need to try other options, but it's promising. (It's still an interesting problem to try to tackle yourself, of course.) Ben On 09-12-13 17:59, Joshua Welker wel...@ucmo.edumailto: wel...@ucmo.edu wrote: It's hard-coded to generate the specific elements. But your way sounds a lot cleaner, so I might try to do that instead :) It will be more difficult initially but much easier once I start implementing other metadata formats. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Ben Companjen Sent: Monday, December 09, 2013 10:52 AM To: CODE4LIB@LISTSERV.ND.EDUmailto:CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Lorem Ipsum metadata? Is there such a thing? Cool! My first thought on this topic was: give the program an XML schema, and generate possible documents with the correct datatypes etc. (Something like that must exist somewhere, right?) Does it happen to work anything like that, or is it hardcoded to generate these specific elements? Ben On 09-12-13 17:27, Joshua Welker wel...@ucmo.edumailto: wel...@ucmo.edu wrote: Challenge accepted. http://library.ucmo.edu/dev/metadata-generator.php Obviously in the prototype phase, but it works. Only MODS is available for now, and you can only select top-level elements (all child elements of the top-level selections will be auto-generated). I will try to expand it to more than just MODS. Admittedly, I know very little about METS, so I will need some