To be fair to those concerned about cost, a more conservative estimate
from the NSF RDLM workshop last summer in Princeton is $1,000 to $3,000
per terabyte per year for long term storage allowing for overhead in
moderate-sized institutions such as the PDB. Larger entities, such
as Google are able to do it for much lower annual costs in the range of
$100 to $300 per terabyte per year. Indeed, if this becomes a serious
effort, one might wish to consider involving the large storage farm
businesses such as Google and Amazon. They might be willing to help
support science partially in exchange for eyeballs going to their sites.
Regards,
H. J. Bernstein
At 1:56 PM -0600 10/25/11, James Stroud wrote:
On Oct 24, 2011, at 3:56 PM, James Holton wrote:
The PDB only gets about 8000 depositions per year
Just to put this into dollars. If each dataset is about 17 GB in
size, then that's about 14 TB of storage that needs to come online
every year to store the raw data for every structure. A two second
search reveals that Newegg has a 3GB hitachi for $200. So that's
about $1000 / year of storage for the raw data behind PDB deposits.
James
--
=====================================================
Herbert J. Bernstein, Professor of Computer Science
Dowling College, Brookhaven Campus, B111B
1300 William Floyd Parkway, Shirley, NY, 11967
+1-631-244-1328
Lab: +1-631-244-1935
Cell: +1-631-428-1397
y...@dowling.edu
=====================================================