Hi All,

I'm only just now "re-discovering" this thread, as I've found this same 
behavior in DSpace OAI-PMH servers. Here's how to see this behavior (and 
I've double checked it on http://demo.dspace.org, running 5.2):

1. Create an Item
2. Run the OAI-PMH import (./dspace oai import).
3. The item now appears in OAI-PMH
4. Withdraw that Item
5. Re-Run the OAI-PMH import (./dspace oai import)
6. The Item still appears in OAI-PMH and is NEVER flagged as withdrawn.

So, you can still access all its metadata, etc. The ONLY way to trigger 
an update to the OAI-PMH record (of the withdrawn item) is to re-import 
EVERYTHING (./dspace oai import -c). However, as noted, since withdrawn 
items are not included in the XOAI "indexAll()" command, the item will 
now disappear from OAI-PMH entirely.

Here's that indexAll() command:
https://github.com/DSpace/DSpace/blob/master/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L191

This behavior seems to be counter to what is documented at:
https://wiki.duraspace.org/pages/viewpage.action?pageId=45548245#OAI-PMHDataProvider2.0%28Internals%29-Deletions

The documentation specifically states that "DSpace keeps a permanent 
record of withdrawn items". It also states that a request for a 
withdrawn item "will yield the 'record deleted' header".

This sounds like a bug to me. The documentation definitely does not 
match with the behavior.

I've created a new bug ticket for this. It will need a volunteer to resolve.

https://jira.duraspace.org/browse/DS-2593

- Tim

On 3/6/2015 5:21 AM, helix84 wrote:
> Yes, that makes sense. Still, it needs a brief verification because
> sometimes our understanding of a concept and its implementation may
> differ.
>
>
> On Fri, Mar 6, 2015 at 10:12 AM, Kristian Roberto Salcedo
> <[email protected]> wrote:
>> Hi Ivan,
>>
>> I might have overlooked something, but from your comments on 
>> https://jira.duraspace.org/browse/DS-2491 wouldnt it be meaningful
>> to do both your declaration change and Ondřej's suggestion in order to match 
>> both what is declared and what the documentation says about OAI and deleted 
>> items?
>>
>> regards,
>> Kristian
>>
>>> -----Original Message-----
>>> From: [email protected] [mailto:[email protected]] On Behalf Of
>>> helix84
>>> Sent: Thursday, March 05, 2015 3:46 PM
>>> To: Kristian Roberto Salcedo
>>> Cc: [email protected]; João Melo
>>> Subject: Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
>>>
>>> Hi Kristian,
>>>
>>> I think you're right that the declared
>>> <deletedRecord>persistent</deletedRecord> doesn't match how DSpace
>>> behaves. I filed a Jira issue and created a pull request to change the 
>>> declared
>>> status to <deletedRecord>transient</deletedRecord>.
>>>
>>> https://jira.duraspace.org/browse/DS-2491
>>>
>>>
>>> Regards,
>>> ~~helix84
>>>
>>> Compulsory reading: DSpace Mailing List Etiquette
>>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>>
>>>
>>> On Thu, Mar 5, 2015 at 2:52 PM, Kristian Roberto Salcedo
>>> <[email protected]> wrote:
>>>> Hi all,
>>>>
>>>> In Dspace 4.2 we are currently seeing that our OAI feeds are not being
>>> persistent as per these definitions when it comes to deleted (withdrawn)
>>> items:
>>>>
>>>> http://www.openarchives.org/OAI/openarchivesprotocol.html#deletion
>>>> https://wiki.duraspace.org/pages/viewpage.action?pageId=34640887#OAI-
>>> P
>>>> MHDataProvider2.0(Internals)-Deletions
>>>>
>>>> Withdrawn items do not get the <header status=deleted> as they should,
>>> like in this example:
>>>>
>>>>
>>> http://webservices.itcs.umich.edu/mediawiki/oaibp/index.php/Deleted_Re
>>>> cord_Example_1
>>>>
>>>> Withdrawing an item is not reflected in the OAI entry in any way.
>>>>
>>>> ----------------------------------------------
>>>>
>>>> This is our Identify page at the moment:
>>>>
>>>> https://www.duo.uio.no/oai/request?verb=Identify
>>>>
>>>> ----------------------------------------------
>>>>
>>>> I believe we're doing everything right regarding config parameters and
>>> maintenance of the solr oai index.
>>>>
>>>> The only place I can find a "persistent"-parameter is in this config file:
>>>> /www/var/data/dspace/config/oaicat.properties
>>>> which I thought was deprecated, but we still set it just to be sure:
>>>> Identify.deletedRecord=persistent
>>>>
>>>> All other relevant config parameters are set in these two files as far as 
>>>> I can
>>> tell:
>>>>
>>>> /www/var/data/dspace/config/modules/oai.cfg
>>>> /www/var/data/dspace/config/crosswalks/oai/description.xml
>>>>
>>>> We are running the oai import -o command nightly.
>>>>
>>>> ----------------------------------------------
>>>>
>>>> The only way we've found to remove withdrawn items from the OAI feed is
>>> by running a complete re-indexing of the oai solr index:
>>>>
>>>> /www/var/data/dspace/bin/dspace oai import -c
>>>>
>>>> with a subsequent
>>>>
>>>> /www/var/data/dspace/bin/dspace oai clean-cache
>>>>
>>>> This removes a withdrawn item:
>>>>
>>>> https://www.duo.uio.no/handle/10852/42670
>>>>
>>>> completely from the feed:
>>>>
>>>>
>>> https://www.duo.uio.no/oai/request?verb=GetRecord&metadataPrefix=oai
>>> _d
>>>> c&identifier=oai:localhost:10852/42670
>>>>
>>>> which is not what we want...
>>>>
>>>> ----------------------------------------------
>>>>
>>>>
>>>> Is anyone else experiencing the same problem?
>>>>
>>>> Maybe I've missed something - If this actually works differently than we
>>> expect or has been addressed in OAI 2.1, please let me know.
>>>>
>>>>
>>>> regards,
>>>> Kristian Salcedo
>>>> Universitetet of Oslo Library
>>>> Department of digital services
>>>>
>>>>
>>>> ----------------------------------------------------------------------
>>>> -------- Dive into the World of Parallel Programming The Go Parallel
>>>> Website, sponsored by Intel and developed in partnership with Slashdot
>>>> Media, is your hub for all things parallel software development, from
>>>> weekly thought leadership blogs to news, videos, case studies,
>>>> tutorials and more. Take a look and join the conversation now.
>>>> http://goparallel.sourceforge.net/
>>>> _______________________________________________
>>>> DSpace-tech mailing list
>>>> [email protected]
>>>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>>> List Etiquette:
>>>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website, sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for all
> things parallel software development, from weekly thought leadership blogs to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette: 
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>

------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to