David,
That sounds like a definite option. Thanks. Does S3 has an API for uploading so
that the upload process could be scripted, or do you manually upload each file?
Josh Welker
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
Concerns have been raised about how expensive Glacier gets if you need
to recover a lot of files in a short time period.
http://www.wired.com/wiredenterprise/2012/08/glacier/
On 1/10/13 5:56 PM, Roy Tennant wrote:
I'd also take a look at Amazon Glacier. Recently I parked about 50GB
of data
Glacier sounds even better than S3 for what we're looking for. We are only
going to be retrieving the files in the case of corruption, so the
pay-per-retrieval model would work well. I heard of Glacier in the past but
forgot all about it. Thank you.
Josh Welker
-Original Message-
Good point. But since campus IT will be creating regular disaster-recovery
backups, the odds that we'd need ever need to retrieve more than a handful of
files from Glacier at a time is pretty low.
Josh Welker
-Original Message-
From: Code for Libraries
Hi Josh,
Glad you are looking into LOCKSS as a potential solution for your needs and
that you are thinking beyond simple backup solutions for more long-term
preservation. Here at MetaArchive Cooperative we make use of LOCKSS to
preserve a range of content/collections from our member institutions.
We use LOCKSS as part of MetaArchive. LOCKSS as I understand it is
typically spec-d for consumer hardware, and so, presumably as a result of
SE Asia flooding, there have been some drive failures and cache downtimes
and adjustments accordingly.
However, that is the worst of it, first.
LOCKSS is
Thanks, Al. I think we'd join a LOCKSS network rather than run multiple LOCKSS
boxes ourselves. Does anyone have any experience with one of those, like the
LOCKSS Global Alliance?
Josh Welker
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Al
http://metaarchive.org/costs in our case. Interested to hear other
experiences. Al
On 1/11/13 10:01 AM, Joshua Welker jwel...@sbuniv.edu wrote:
Thanks, Al. I think we'd join a LOCKSS network rather than run multiple
LOCKSS boxes ourselves. Does anyone have any experience with one of
those, like
Hi Josh,
I lurked on this thread, as I did not know the size of your institution.
Being a public library serving about 24,000 residents - we have the
small-institution issues as well for this type of project. We recently
tackled a similar situation and the solution:
1) Purchase a 3TB SeaGate
James,
Definitely a simple and elegant solution, but that is not a viable long-term
option for us. We currently have tons of old CDs and DVDs full of data, and one
of our goals is to wean off those media completely. Most consumer-grade CDs
and DVDs are very poor in terms of long-term data
Josh,
Totally understand the resource constraints and the price comparison
up-front. As Roy alluded to earlier, it pays with Glacier to envision what
your content retrieval scenarios might be, because that $368 up-front could
very easily balloon in situations where you are needing to restore a
Restoring 3 Tb from Glacier is $370. Add about $90 if you use AWS
Import/Export (you provide the device).
Hopefully, this is not something that you would do often.
Cary
On Fri, Jan 11, 2013 at 8:14 AM, Matt Schultz
matt.schu...@metaarchive.org wrote:
Josh,
Totally understand the resource
Hello Josh,
Auburn University is a member of two Private LOCKSS Networks: the MetaArchive
Cooperative and the Alabama Digital Preservation Network (ADPNet). Here's a
link to a recent conference paper that describes both networks, including their
current pricing structures:
Hi all,
Apparently code4lib 2013 is going to be held at the UIC Forum
http://www.uic.edu/depts/uicforum/
I assumed it would be at the conference hotel. This is just a note so
that others do not make the same assumption, since nowhere in the
information about the conference is the location
Thanks, I missed the part about DuraCloud as an abstraction layer. I might look
into hosting an install of it on the primary server running the digitization
platform.
Josh Welker
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Tim
Donohue
The only scenario I can think of where we'd need to do a full restore is if the
server crashes, and for those cases, we are going to have typical short-term
imaging setups in place. Our needs beyond that are to make sure our original
files are backed up redundantly in some non-volatile location
Thanks for bringing up the issue of the cost of making sure the data is
consistent. We will be using DSpace for now, and I know DSpace has some
checksum functionality built in out-of-the-box. It shouldn't be too difficult
to write a script that loops through DSpace's checksum data and compares
On Fri, Jan 11, 2013 at 10:41:54AM -0800, Erik Hetzner wrote:
Hi all,
Apparently code4lib 2013 is going to be held at the UIC Forum
http://www.uic.edu/depts/uicforum/
I assumed it would be at the conference hotel. This is just a note so
that others do not make the same assumption,
On Fri, Jan 11, 2013 at 07:45:21PM +, Joshua Welker wrote:
Thanks for bringing up the issue of the cost of making sure the data is
consistent. We will be using DSpace for now, and I know DSpace has some
checksum functionality built in out-of-the-box. It shouldn't be too difficult
to
Hi Josh,
Now that you bring up DSpace as being part of the equation...
You might want to look at the newly released Replication Task Suite
plugin/addon for DSpace (supports DSpace versions 1.8.x 3.0):
https://wiki.duraspace.org/display/DSPACE/ReplicationTaskSuite
This DSpace plugin does
Apologies for the cross postings . . . . . . .
LAC Group is seeking a Metadata Specialist to work on a long-term contract for
a prestigious government agency located in Washington, DC. This position
includes reconciling existing schemas and vocabularies to create an enterprise
schema and
Awesome! Thanks. I will look into this for sure.
Josh Welker
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Tim
Donohue
Sent: Friday, January 11, 2013 2:30 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Digital collection backups
Hi
I'll take this opportunity to remind folks that if you spot anything amiss,
please let me know (or sign up and fix it!) and I will clean it up. Thanks!
On Fri, Jan 11, 2013 at 11:51 AM, Francis Kayiwa kay...@uic.edu wrote:
On Fri, Jan 11, 2013 at 10:41:54AM -0800, Erik Hetzner wrote:
Hi
Be careful about assuming too much on this.
When I started working with S3, the system required an MD5 sum to upload, and
would respond to requests with this etag in the header as well. I therefor
assumed that this was integral to the system, and was a good way to compare
local files against
On Fri, Jan 11, 2013 at 2:45 PM, Joshua Welker jwel...@sbuniv.edu wrote:
Reading the Glacier FAQ on Amazon's site, it looks like they provide an
archive inventory (updated daily) that can be downloaded as JSON. I read
some users saying that this inventory includes checksum data. So hopefully
Hi, Andrea,
XMP is natively an RDF-based format, so getting out XML isn't hard at all.
You have a couple of XML-based options with exiftool:
exiftool -X foo.jpg # prints the metadata in exiftool's own RDF/XML schema
to stdout
exiftool -tagsfromfile foo.jpg -o foo.xmp # writes the metadata in an
I'm sorry, but that doesn't actually clear up anything for me. The
location on the layrd page just says Chicago. So, is the conference
still happening at UIC? Since the conference hotel isn't super close,
does that mean there will be transportation provided?
While we're on the subject, are the
So, the location field on Lanyrd is not super specific. It likes big
things, like cities. On the reg page for code4lib the UIC Forum is listed
as the location though.
http://www.regonline.com/builder/site/Default.aspx?EventID=1167723
I'll see if I can put that somewhere in the event on Lanyrd.
**Background Information**
The Wikimedia Foundation, Inc. is a nonprofit charitable organization
dedicated to the growth, development and distribution of free, multilingual
content, and to providing the full content of these wiki-based projects to the
public free of charge. The Wikimedia
On Fri, Jan 11, 2013 at 06:41:26PM -0500, Cynthia Ng wrote:
I'm sorry, but that doesn't actually clear up anything for me. The
location on the layrd page just says Chicago. So, is the conference
still happening at UIC? Since the conference hotel isn't super close,
does that mean there will be
Because it seems like it might be useful, I've started a publicly-editable
google map at
http://goo.gl/maps/LWqay
Right now, it has two points: the hotel and the conference location. Please
add stuff as appropriate if the urge strikes you.
On Fri, Jan 11, 2013 at 7:54 PM, Francis Kayiwa
It takes about 15 minutes to walk a mile. It's really not that far for
people without health problems that affect mobility. In most cases,
driving, then parking will take more time than walking to cover such a
short distance. Just saying...
-Wilhelmina Randtke
On Fri, Jan 11, 2013 at 7:12 PM,
FWIW, the # 8 bus runs every 10 min.
Cary
On Fri, Jan 11, 2013 at 5:12 PM, Bill Dueber b...@dueber.com wrote:
Because it seems like it might be useful, I've started a publicly-editable
google map at
http://goo.gl/maps/LWqay
Right now, it has two points: the hotel and the conference
On Fri, Jan 11, 2013 at 05:51:17PM -0800, Cary Gordon wrote:
FWIW, the # 8 bus runs every 10 min.
Good point. It may be worth your while getting the 3 day pass for $US 14
http://www.transitchicago.com/travel_information/fares/unlimitedridecards.aspx
Not for traveling to the conference but any
Gah, I think I forgot to announce this on the list, but there's also
this google map:
https://maps.google.com/maps/ms?msid=213549257652679418473.0004ce6c25e6cdeb0319dmsa=0
which I put on the social page
http://wiki.code4lib.org/index.php/2013_social_activities
I'll go ahead and add the hotel and
35 matches
Mail list logo