[Dspace-tech] Ideal hardware configuration for the server

2009-02-24 Thread Ayesha Khatoon
Dear Professionals, We are creating a Digital Online Repository on Indian Cultural Heritage. We have approx 3.75 lac digital records of 50 Terabytes in size, in the form of text, image, audio, video and using Fedora Core 4 and dspace-1.4.2. Kindly suggest an ideal configuration (hardware) for the s

Re: [Dspace-tech] script to validate all PDFs ?

2009-02-24 Thread Larry Stone
> Does anyone have a script that checks all of the previously uploaded > PDFs and find ones that are malformed and reports their URLs/record IDs? I think it's most appropriate to do this with the MediaFilter mechanism. The default DSpace (1.5.1) distribution includes the plugin: org.dspace.app.me

Re: [Dspace-tech] citations, journals, volumes, issues, , articles and dublin core

2009-02-24 Thread Scott Yeadon
Hi Mark, In a DSpace context I think a major problem is lack of agreement on, or default implementation for a storage model for various classes of content (typically aggregations). If this could be achieved then value-add services could be far more easily connected - dissemination services suc

Re: [Dspace-tech] script to validate all PDFs ?

2009-02-24 Thread Kim Shepherd
Hi Stuart, Example assetstore file: ${dspace.dir}/assetstore/95/80/98/95809816172544348784747013964495251419 The filename itself is in bitstream.internal_id in the dspace database, and the directory names are just the first 6 numbers of the internal ID. Here's a SQL query that resolves interna

[Dspace-tech] script to validate all PDFs ?

2009-02-24 Thread stuart yeates
Does anyone have a script that checks all of the previously uploaded PDFs and find ones that are malformed and reports their URLs/record IDs? I can see how to write a script that uses the unix command line 'file' and 'pdftops' tools to check that every file that looks like a PDF is a good and v

[Dspace-tech] institutional repository ranking and page rank

2009-02-24 Thread Claudia Jürgen
Hi all, most of you are aware of the repository ranking: http://repositories.webometrics.info/index.html Due to a question on GUDE I was curious about the relation of this ranking to page ranking and did a check on the top 300 institutional repositories in the CSIC ranking: Of the top 100 inst

Re: [Dspace-tech] robots.txt question

2009-02-24 Thread Tim Donohue
Here's another example similar to Keith's...This is something I'm planning to add as an out-of-the-box setting for DSpace XMLUI (so it should be default for the 1.5.2 release). In this case, you want to add this code to your *theme's* sitemap.xmap file. I'd suggest adding it near the top, dire

Re: [Dspace-tech] robots.txt question

2009-02-24 Thread Keith Gilbertson
Hi Eric. One way to do this could be to make a small change to the sitemap.xmap file in the xmlui directory. Here's an example from the non-bitstream pipeline in the sitemap.xmap file of one of our DSpace instances: In our case, we placed

Re: [Dspace-tech] Manakin Documentation

2009-02-24 Thread Mark H. Wood
On Mon, Feb 23, 2009 at 07:23:29PM +, K. Jones wrote: > Does anyone know a good link or site that has documentation for Manakin in > Dspace 1.5. > > I need some information for getting the xmlui running under tomcat. You mean, beyond Chapter 5 section 5 ("XMLUI Interface Customizations (Man

[Dspace-tech] robots.txt question

2009-02-24 Thread Eric Luhrs
I'm trying to figure out how to add a robots.txt file to xmlui. I tried placing it in [dspace]/webapps/xmlui (which is where Tomcat's ROOT points) but that didn't work. When I attempt to access it, I get the standard "Page not found" message. Do I need to create a static manakin page that overri

Re: [Dspace-tech] citations, journals, volumes, issues, articles and dublin core

2009-02-24 Thread Mark H. Wood
May I suggest that we should never, never, never! get used to being shocked and surprised by some of the aspects of digital libraries, but rather to the warm feeling of having done something about them. It seems to me that metadata support for journal articles, while fundamental, is the least of o