[Dspace-tech] METS and DSpace

2008-12-12 Thread Andrew Marlow
Hello James, I saw your note at http://www.loc.gov/standards/mets/mets-registry.htmlabout adding METS support to DSpace. I am new to DSpace and to digital libraries generally, so I could be wrong, but AFAIK DSpace does not support METS yet. Am I right? If so then I wonder what the status of this w

Re: [Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Andrew Marlow
On Fri, Dec 12, 2008 at 3:31 PM, Shane Beers wrote: > Andrew: > Performing OCR on a PDF document is, as far as I know, the most widely used > method to search a PDF document. I see. I didnt know that. > Is there a specific reason you do not want the PDFs to be searchable? I ***do*** want th

Re: [Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Brian Freels-Stendel
It should be able to filter these files. The OCR (text) is kept in a different 'layer': http://www.dclab.com/pdfconversion3.asp. Caveat is that there may be more ways to include OCR information that I don't know about. B-- >>> On 12/12/2008 at 1:11 PM, in message , "Thornton, Susan M. (LARC-

Re: [Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
Question: If a .pdf document contains, let's say, 1 page in the middle of a document that contains an image (a drawing for instance), is filter-media going to fail on the filtering of this document or will it just skip the image and continue to filter what it can? I have made some mo

[Dspace-tech] DSpace Reaches Over 500 Instances Worldwide

2008-12-12 Thread Valorie Hollister
The DSpace platform now has over 500 instances in 60 different countries worldwide! DSpace continues to be the most popular repository solution, with over a third of the known institutional repositories using the DSpace software. Each month there are between 10-15 new instances of DSpace. C

Re: [Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Mark H. Wood
On Fri, Dec 12, 2008 at 08:44:49AM +, Andrew Marlow wrote: > Now that I have loaded a few PDFs into my DSpace repo, I am wondering how to > enable full text searching. The PDFs happen to be in a form that means they > cannot be searched directly. So when I search in DSpace I get no results Do

Re: [Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Shane Beers
Andrew: Performing OCR on a PDF document is, as far as I know, the most widely used method to search a PDF document. Is there a specific reason you do not want the PDFs to be searchable? Even the archival "standard" of PDF/A (archival PDF) allows for OCR. I use the commercial product ABBYY F

Re: [Dspace-tech] Zotero with Manakin Front End

2008-12-12 Thread Dorothea Salo
On Thu, Dec 11, 2008 at 9:42 PM, Diggory Mark wrote: > Dorothea, > > That would make an excellent contribution. We should consider bringing > it into the XMLUI base template library. Gank away. There may be other useful tidbits in that file as well, though a lot of it is... idiosyncratic. If I n

Re: [Dspace-tech] dc.subject when the item is a book?

2008-12-12 Thread Poulter, Dale
Andrew, It sounds like you have different templates. You will need to add the dc.subject to the template your collection is using in config/input-forms.xml. --Dale --- Dale Poulter Automation Coordinator Library Information Technology Services Vande

[Dspace-tech] dc.subject when the item is a book?

2008-12-12 Thread Andrew Marlow
I just tried adding a book to my DSpace collection for the first time today. After adding it I went to edit the metadata and was suprised to see that unlike journal articles, book metadata does not have dc.subject! How do I specify subject(s) for a book please? -- Regards, Andrew M.

Re: [Dspace-tech] pseudonym vs email-address

2008-12-12 Thread Bruc Liong
What could be done is: 1) user in DSPace is assigned to pseudonym as identifier (e.g. using netid) 2) user has his email address being "updated" from shibboleth (hence if he changes email, it will be reflected right away) I think this will satisfy your requirements (whereby there is a chance for

Re: [Dspace-tech] netid using for Shibboleth

2008-12-12 Thread Bruc Liong
Yup, that's where targetedID could go for as well. The only implication is that if the items need to be imported/exported to other repositories (SP), you may need to do some transformation for it (if you've chosen ePTID). Using principal name attribute could be another alternative. Bruc -Or

Re: [Dspace-tech] pseudonym vs email-address

2008-12-12 Thread François Parmentier
Hi. I can only speak for v1.4.2, but I don't think it is much different for 1.5.x: as email is a column of of eperson table (which also gets a eperson_id column), why couldn't you change the email value, instead of using a pseudonym. This would be better if the user could change the email himself.

[Dspace-tech] searching, PDFs, HTML and XML

2008-12-12 Thread Andrew Marlow
Hello, Now that I have loaded a few PDFs into my DSpace repo, I am wondering how to enable full text searching. The PDFs happen to be in a form that means they cannot be searched directly. So when I search in DSpace I get no results returned (unless the text also appears in the abstract I entered

Re: [Dspace-tech] netid using for Shibboleth

2008-12-12 Thread Andrea Bollini
Hi Jochen, that's exactly what it is for. Actually LDAP auth using it. Any custom authentication should use it instead of email if the external "username" is not an email address. Please note that netid need to be unique. Best, Andrea Jochen Lienhard ha scritto: > Hi, > > I found the netid column

Re: [Dspace-tech] FW: HANDLE update issue

2008-12-12 Thread François Parmentier
I once rewrite the update-handle-prefix script to work with oracle, instead of postgresql (in version 1.4.2 of DSpace). I mainly replaced ---8<--- echo "update handle set handle=overlay(handle placing '$2' from 1 for $oldPrefixLen) where handle like '$1%';" | psql # update the metadatavalu

[Dspace-tech] netid using for Shibboleth

2008-12-12 Thread Jochen Lienhard
Hi, I found the netid column in the eperson table. Can this be used for a login too instead of the email address? This could be the solution for the pseudonym login with Shibboleth. Greetings Jochen -- Dr. rer. nat. Jochen Lienhard UB Universität Freiburg Dezernat EDV Tel: (0761) 203-3908 Ema

Re: [Dspace-tech] dublin core and multiple subject categorizations

2008-12-12 Thread Andrew Marlow
On Thu, Dec 11, 2008 at 11:02 PM, Kim Shepherd wrote: > Hi Andrew, > > Metadata doesn't quite work this way. There may only be one element > defined in your DC schema, but it can be repeated in an item's metadata for > as many values as you need. > I didn't know that. DSpace has only one value