-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
At 01:52 AM 3/15/02, Jens Stavnstrup wrote:
The editing have been done on a Unix platform with Emacs. Occasionally,
when copying text from a word document, Saxon protests (actually
Aelfred protests), complaining over bad continuation of multi-byte
On Fri, 15 Mar 2002, Christopher R. Maden wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
At 01:52 AM 3/15/02, Jens Stavnstrup wrote:
The editing have been done on a Unix platform with Emacs. Occasionally,
when copying text from a word document, Saxon protests (actually
Aelfred
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
At 02:58 AM 3/15/02, Jens Stavnstrup wrote:
On Fri, 15 Mar 2002, Christopher R. Maden wrote:
1) Do all of your entities (i.e., files) have encoding declarations? What
are they? Remember that UTF-8 is the default unless you explicitly
specify
On Fri, 15 Mar 2002, Jirka Kosek wrote:
Jens Stavnstrup wrote:
Now I am going to release my colleague on the document. They are going to
use a myriad of windows editors (Word, Notepad, etc in different language
versions), and I predict this is going to cause a lot of problems.
Does
Jens Stavnstrup wrote:
If your documents will contain a lot of character outside of ISO Latin 1
or ASCII using UTF-8 is best choice, assuming that all editors used can
deal with UTF-8.
Not really, the problem is basically, that Word, which might be used to
to edit the XML sources,
Christopher R. Maden wrote at 15 Mar 2002 02:06:47 -0800:
The parser obviously is not aware that you have chosen ISO 8859-1. That is
the expected error message if an 8859-1 document contains any high bytes
(128+) and the parser is trying to parse it as UTF-8.
1) Do all of your
On Fri, 15 Mar 2002, Christopher R. Maden wrote:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
At 02:58 AM 3/15/02, Jens Stavnstrup wrote:
On Fri, 15 Mar 2002, Christopher R. Maden wrote:
1) Do all of your entities (i.e., files) have encoding declarations? What
are they? Remember
On Fri, 15 Mar 2002, Jirka Kosek wrote:
Jens Stavnstrup wrote:
If your documents will contain a lot of character outside of ISO Latin 1
or ASCII using UTF-8 is best choice, assuming that all editors used can
deal with UTF-8.
Not really, the problem is basically, that Word,