On Thu 08 Mar 2018 17:06:12 Peter Schaffter wrote: > This seems to be an "on again, off again" bug. We discussed adding > LC_ALL=C to the command string in pdfmom, but I see it's not there. > Deri--any objections to adding it? The alternative is to pass the > -a flag to the various greps, but Steffen Nurpmesco pointed out that > -a is not standardized.
I've got an example which is meant to show the problem (camus.mom), but unfortunately I can't make it generate the error which others are seeing. Camus.mom is a utf-8 file and I have used -k in a utf-8 user account (LC_CTYPE=en_GB.UTF-8) and with -Kutf8 in an old style account (LC_CTYPE=en_GB), neither produced an error from grep. This leads me to suspect there is something in my version of grep which "understands" that UTF-8 files are not binary data. I believe compiling grep with NLS support is optional, so may be people who get this error are using a grep without language support. One possible solution would be to replace the greps with a perl regexp, but I also have no objection to LC_ALL=C, if this fixes it. Be careful it only applies to the greps not anything else. Cheers Deri