Hi Ferran,

I have been working a bit with BibHarvest lately, and perhaps I can
help answer some of your questions. See comments below.

On Fri, 2011-04-01 at 14:00 +0200, Ferran Jorba wrote:
> After appying it localy, now I've forced another harvest, but it seems
> that it doesn't collect older-than-last-harvesting-time records, even if
> those records do not exist in my site.  In a very related way, we are
> not sure if older records of a this-oaiset-that-I'm-checking-now are
> going to be collected next harvesting session.

The OAI harvester looks in the table oaiHARVEST column 'lastrun' to
determine if lastrun + frequency > today. If true, no harvesting
happens. In your case, perhaps a dirty fix could be to do an explicit
UPDATE statement to change lastrun to a previous date, then run the
harvesting job again. E.g.

>>> run_sql("UPDATE oaiHARVEST SET lastrun = 'yyyy-mm-dd hh:mm:ss' WHERE
id=SOMEID")

> I can do a manual harvesting-converting-and-uploading (h-c-u) of the
> records that I've identified, no problem.  But I'd like to know how does
> Invenio decides that a record has to be collected for the two related
> scenarios that I've tried to explain in my previous paragraph.
> 
> Do I have to do any post-processing after doing my manual h-c-u action?
> Or, is there a way that I can feed a known list of local records (or
> remote identifiers) to oaiharvest?

I believe if you have a simple post-process workflow i.e. h-c-u, you
should be able to run single harvesting runs through OAI Harvest Admin
Interface. Only caveat is that it only accepts single(!) identifiers.

Support for these sorts of identifier lists is something that would be
nice to have, and hopefully something we can look into adding soon.
Similar updates to the interface and harvester is already planned. See
http://invenio-software.org/ticket/483 

(I see from the OAI v2.0 spec that harvesting lists of identifiers is
not directly supported by the protocol either)

Hope this helps.

Regards,
Jan

-- 
-------
Jan Age Lavik <[email protected]>
CERN Technical Student
Open-Access Group (GS-SIS-OA)

Phone: +41 22 767 9092
Office: 3-1-011

Reply via email to