Hi-
I'm wondering if bitstreams with anonymous read access that are part of
locked down items are at risk of getting indexed and made available through
search engines. Do search engines stop at locked down item pages or is there
another way for them to index and access readable bitstreams?

We have a number of thesis collections with some publicly available items
and some items locked down to our campus IP range. Over the weekend I looked
at implementing Peter Dietz's code for embedding the Google Docs viewer.
https://wiki.duraspace.org/display/DSPACE/Document+Preview+with+Google+Docs+viewer

What I noticed was that if I was on campus and the bitstream in question was
locked down by IP the viewer could not access the PDF, presumably because
the IP trying to access the doc is from Google. The viewer will work when I
am on campus and the item is locked down but the bitstream is open.

So, if I leave items as restricted but open up their bitstreams so that they
will work with the embedded viewer is there any risk that they will get
crawled and made available in Google results? I would love to implement the
viewer to make our users' lives easier, but it doesn't make sense if it
doesn't work for huge chunks of our collection or if enabling it exposes
private data.

Thanks!
Spencer

-- 
Spencer Lamm
Digital Initiatives Librarian
Swarthmore College Library
[email protected]
610-328-8541
------------------------------------------------------------------------------
Create and publish websites with WebMatrix
Use the most popular FREE web apps or write code yourself; 
WebMatrix provides all the features you need to develop and publish 
your website. http://p.sf.net/sfu/ms-webmatrix-sf
_______________________________________________
Dspace-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-general

Reply via email to