Am 15.12.2010 10:21, schrieb Ferran Jorba:

Hi!

when importing from external OAI sources, I'm finding records empty
fields, for example:

     <datafield tag="500" ind1="" ind2="">
       <subfield code="a" />
     </datafield>

[...]
I'm seeking some advice about how, where or when to deal with them.
Should it be done just during the Dublin Core to Marcxml conversion
(say, etc/bibconvert/config/ojs2marcxml.xsl) or in the MarcXML parser
(lib/python/invenio/bibrecord.py), in the general function
(create_record) or for each of the low lever parsers (create_record_RXP,
create_record_minidom create_record_4suite).

After some thinking about the issue, I think it would be nice to have
some "Marc Sanitiser" that takes care about the above issues before
ingest. One could actually imagine quite some reasons why such records
exist. At least as far as I can see they are valid XML, just not valid Marc.

--

Kind regards,

Alexander Wagner
Subject Specialist
Central Library
52425 Juelich

mail : [email protected]
phone: +49 2461 61-1586
Fax  : +49 2461 61-6103
http://www.fz-juelich.de/zb/mitarbeiter/fachinformation#wagner


------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Forschungszentrum Juelich GmbH
52425 Juelich
Sitz der Gesellschaft: Juelich
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
Vorsitzender des Aufsichtsrats: MinDirig Dr. Karl Eugen Huthmacher
Geschaeftsfuehrung: Prof. Dr. Achim Bachem (Vorsitzender),
Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt,
Prof. Dr. Sebastian M. Schmidt
------------------------------------------------------------------------------------------------
------------------------------------------------------------------------------------------------

Reply via email to