Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
David, That sounds like a definite option. Thanks. Does S3 has an API for uploading so that the upload process could be scripted, or do you manually upload each file? Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Gary McGath
Concerns have been raised about how expensive Glacier gets if you need to recover a lot of files in a short time period. http://www.wired.com/wiredenterprise/2012/08/glacier/ On 1/10/13 5:56 PM, Roy Tennant wrote: I'd also take a look at Amazon Glacier. Recently I parked about 50GB of data

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Glacier sounds even better than S3 for what we're looking for. We are only going to be retrieving the files in the case of corruption, so the pay-per-retrieval model would work well. I heard of Glacier in the past but forgot all about it. Thank you. Josh Welker -Original Message-

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Good point. But since campus IT will be creating regular disaster-recovery backups, the odds that we'd need ever need to retrieve more than a handful of files from Glacier at a time is pretty low. Josh Welker -Original Message- From: Code for Libraries

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Matt Schultz
Hi Josh, Glad you are looking into LOCKSS as a potential solution for your needs and that you are thinking beyond simple backup solutions for more long-term preservation. Here at MetaArchive Cooperative we make use of LOCKSS to preserve a range of content/collections from our member institutions.

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Al Matthews
We use LOCKSS as part of MetaArchive. LOCKSS as I understand it is typically spec-d for consumer hardware, and so, presumably as a result of SE Asia flooding, there have been some drive failures and cache downtimes and adjustments accordingly. However, that is the worst of it, first. LOCKSS is

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Thanks, Al. I think we'd join a LOCKSS network rather than run multiple LOCKSS boxes ourselves. Does anyone have any experience with one of those, like the LOCKSS Global Alliance? Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Al

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Al Matthews
http://metaarchive.org/costs in our case. Interested to hear other experiences. Al On 1/11/13 10:01 AM, Joshua Welker jwel...@sbuniv.edu wrote: Thanks, Al. I think we'd join a LOCKSS network rather than run multiple LOCKSS boxes ourselves. Does anyone have any experience with one of those, like

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread James Gilbert
Hi Josh, I lurked on this thread, as I did not know the size of your institution. Being a public library serving about 24,000 residents - we have the small-institution issues as well for this type of project. We recently tackled a similar situation and the solution: 1) Purchase a 3TB SeaGate

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
James, Definitely a simple and elegant solution, but that is not a viable long-term option for us. We currently have tons of old CDs and DVDs full of data, and one of our goals is to wean off those media completely. Most consumer-grade CDs and DVDs are very poor in terms of long-term data

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Matt Schultz
Josh, Totally understand the resource constraints and the price comparison up-front. As Roy alluded to earlier, it pays with Glacier to envision what your content retrieval scenarios might be, because that $368 up-front could very easily balloon in situations where you are needing to restore a

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Cary Gordon
Restoring 3 Tb from Glacier is $370. Add about $90 if you use AWS Import/Export (you provide the device). Hopefully, this is not something that you would do often. Cary On Fri, Jan 11, 2013 at 8:14 AM, Matt Schultz matt.schu...@metaarchive.org wrote: Josh, Totally understand the resource

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Aaron Trehub
Hello Josh, Auburn University is a member of two Private LOCKSS Networks: the MetaArchive Cooperative and the Alabama Digital Preservation Network (ADPNet). Here's a link to a recent conference paper that describes both networks, including their current pricing structures:

[CODE4LIB] code4lib 2013 location

2013-01-11 Thread Erik Hetzner
Hi all, Apparently code4lib 2013 is going to be held at the UIC Forum http://www.uic.edu/depts/uicforum/ I assumed it would be at the conference hotel. This is just a note so that others do not make the same assumption, since nowhere in the information about the conference is the location

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Thanks, I missed the part about DuraCloud as an abstraction layer. I might look into hosting an install of it on the primary server running the digitization platform. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Tim Donohue

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
The only scenario I can think of where we'd need to do a full restore is if the server crashes, and for those cases, we are going to have typical short-term imaging setups in place. Our needs beyond that are to make sure our original files are backed up redundantly in some non-volatile location

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Thanks for bringing up the issue of the cost of making sure the data is consistent. We will be using DSpace for now, and I know DSpace has some checksum functionality built in out-of-the-box. It shouldn't be too difficult to write a script that loops through DSpace's checksum data and compares

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Francis Kayiwa
On Fri, Jan 11, 2013 at 10:41:54AM -0800, Erik Hetzner wrote: Hi all, Apparently code4lib 2013 is going to be held at the UIC Forum http://www.uic.edu/depts/uicforum/ I assumed it would be at the conference hotel. This is just a note so that others do not make the same assumption,

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Thomas Kula
On Fri, Jan 11, 2013 at 07:45:21PM +, Joshua Welker wrote: Thanks for bringing up the issue of the cost of making sure the data is consistent. We will be using DSpace for now, and I know DSpace has some checksum functionality built in out-of-the-box. It shouldn't be too difficult to

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Tim Donohue
Hi Josh, Now that you bring up DSpace as being part of the equation... You might want to look at the newly released Replication Task Suite plugin/addon for DSpace (supports DSpace versions 1.8.x 3.0): https://wiki.duraspace.org/display/DSPACE/ReplicationTaskSuite This DSpace plugin does

[CODE4LIB] Job Posting / Metadata Specialist / Washington, DC

2013-01-11 Thread Suzanne Richards
Apologies for the cross postings . . . . . . . LAC Group is seeking a Metadata Specialist to work on a long-term contract for a prestigious government agency located in Washington, DC. This position includes reconciling existing schemas and vocabularies to create an enterprise schema and

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Joshua Welker
Awesome! Thanks. I will look into this for sure. Josh Welker -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Tim Donohue Sent: Friday, January 11, 2013 2:30 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Digital collection backups Hi

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Patrick Berry
I'll take this opportunity to remind folks that if you spot anything amiss, please let me know (or sign up and fix it!) and I will clean it up. Thanks! On Fri, Jan 11, 2013 at 11:51 AM, Francis Kayiwa kay...@uic.edu wrote: On Fri, Jan 11, 2013 at 10:41:54AM -0800, Erik Hetzner wrote: Hi

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread ddwiggins
Be careful about assuming too much on this. When I started working with S3, the system required an MD5 sum to upload, and would respond to requests with this etag in the header as well. I therefor assumed that this was integral to the system, and was a good way to compare local files against

Re: [CODE4LIB] Digital collection backups

2013-01-11 Thread Randy Fischer
On Fri, Jan 11, 2013 at 2:45 PM, Joshua Welker jwel...@sbuniv.edu wrote: Reading the Glacier FAQ on Amazon's site, it looks like they provide an archive inventory (updated daily) that can be downloaded as JSON. I read some users saying that this inventory includes checksum data. So hopefully

Re: [CODE4LIB] XMP Metadata to tab-delemited file

2013-01-11 Thread Misty De Meo
Hi, Andrea, XMP is natively an RDF-based format, so getting out XML isn't hard at all. You have a couple of XML-based options with exiftool: exiftool -X foo.jpg # prints the metadata in exiftool's own RDF/XML schema to stdout exiftool -tagsfromfile foo.jpg -o foo.xmp # writes the metadata in an

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Cynthia Ng
I'm sorry, but that doesn't actually clear up anything for me. The location on the layrd page just says Chicago. So, is the conference still happening at UIC? Since the conference hotel isn't super close, does that mean there will be transportation provided? While we're on the subject, are the

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Patrick Berry
So, the location field on Lanyrd is not super specific. It likes big things, like cities. On the reg page for code4lib the UIC Forum is listed as the location though. http://www.regonline.com/builder/site/Default.aspx?EventID=1167723 I'll see if I can put that somewhere in the event on Lanyrd.

[CODE4LIB] Job: Tools Lab Operations Engineer (Contractor) at Wikimedia Foundation

2013-01-11 Thread jobs
**Background Information** The Wikimedia Foundation, Inc. is a nonprofit charitable organization dedicated to the growth, development and distribution of free, multilingual content, and to providing the full content of these wiki-based projects to the public free of charge. The Wikimedia

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Francis Kayiwa
On Fri, Jan 11, 2013 at 06:41:26PM -0500, Cynthia Ng wrote: I'm sorry, but that doesn't actually clear up anything for me. The location on the layrd page just says Chicago. So, is the conference still happening at UIC? Since the conference hotel isn't super close, does that mean there will be

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Bill Dueber
Because it seems like it might be useful, I've started a publicly-editable google map at http://goo.gl/maps/LWqay Right now, it has two points: the hotel and the conference location. Please add stuff as appropriate if the urge strikes you. On Fri, Jan 11, 2013 at 7:54 PM, Francis Kayiwa

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Wilhelmina Randtke
It takes about 15 minutes to walk a mile. It's really not that far for people without health problems that affect mobility. In most cases, driving, then parking will take more time than walking to cover such a short distance. Just saying... -Wilhelmina Randtke On Fri, Jan 11, 2013 at 7:12 PM,

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Cary Gordon
FWIW, the # 8 bus runs every 10 min. Cary On Fri, Jan 11, 2013 at 5:12 PM, Bill Dueber b...@dueber.com wrote: Because it seems like it might be useful, I've started a publicly-editable google map at http://goo.gl/maps/LWqay Right now, it has two points: the hotel and the conference

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Francis Kayiwa
On Fri, Jan 11, 2013 at 05:51:17PM -0800, Cary Gordon wrote: FWIW, the # 8 bus runs every 10 min. Good point. It may be worth your while getting the 3 day pass for $US 14 http://www.transitchicago.com/travel_information/fares/unlimitedridecards.aspx Not for traveling to the conference but any

Re: [CODE4LIB] code4lib 2013 location

2013-01-11 Thread Jon Gorman
Gah, I think I forgot to announce this on the list, but there's also this google map: https://maps.google.com/maps/ms?msid=213549257652679418473.0004ce6c25e6cdeb0319dmsa=0 which I put on the social page http://wiki.code4lib.org/index.php/2013_social_activities I'll go ahead and add the hotel and