Michael,
In a MARC UCS/Unicode UTF-8 environment, the Esc (0x1B) character doesn't serve any purpose
Correct re. the Esc character. The presence of an Esc is probably a good indication that the record is in MARC-8.
So, I'm wondering if for MARC record testing, it would make sense to tighten up the ASCII part of the regexp a bit to this: [\x1D-\x7E]
That would almost certainly do. I don't think I've ever seen a newline or a tab in a MARC record. However, knowing the amount of c**p we do get in records, it wouldn't surprise me
if one did appear somewhere. Ashley. -- Ashley Sanders [EMAIL PROTECTED] Copac http://copac.ac.uk A MIMAS Service funded by JISC