There are several options - depending on the type of datasets. Can you provide 
a little more info? In the meantime - 

Have you checked out DCC and Dataverse?

http://www.dcc.ac.uk/resources/how-guides/cite-datasets

http://datascience.iq.harvard.edu/dataverse


Yvonne


-----Original Message-----
From: Code for Libraries [mailto:[email protected]] On Behalf Of Kyle 
Banerjee
Sent: Wednesday, July 23, 2014 4:29 PM
To: [email protected]
Subject: [CODE4LIB] Publishing large datasets

We've been facing increasing requests to help researchers publish datasets.
There are many dimensions to this problem, but one of them is applying 
appropriate metadata and mounting them so they can be explored with a regular 
web browser or downloaded by expert users using specialized tools.

Datasets often are large. One that we used for a pilot project contained well 
over 10,000 objects with a total size of about 1 TB. We've been asked to help 
with much larger and more complex datasets.

The pilot was successful but our current process is neither scalable nor 
sustainable. We have some ideas on how to proceed, but we're mostly making 
things up. Are there methods/tools/etc you've found helpful? Also, where should 
we look for ideas? Thanks,

kyle

Reply via email to