Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Christopher R. Maden
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 At 01:52 AM 3/15/02, Jens Stavnstrup wrote: The editing have been done on a Unix platform with Emacs. Occasionally, when copying text from a word document, Saxon protests (actually Aelfred protests), complaining over bad continuation of multi-byte

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Jens Stavnstrup
On Fri, 15 Mar 2002, Christopher R. Maden wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 At 01:52 AM 3/15/02, Jens Stavnstrup wrote: The editing have been done on a Unix platform with Emacs. Occasionally, when copying text from a word document, Saxon protests (actually Aelfred

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Christopher R. Maden
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 At 02:58 AM 3/15/02, Jens Stavnstrup wrote: On Fri, 15 Mar 2002, Christopher R. Maden wrote: 1) Do all of your entities (i.e., files) have encoding declarations? What are they? Remember that UTF-8 is the default unless you explicitly specify

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Jens Stavnstrup
On Fri, 15 Mar 2002, Jirka Kosek wrote: Jens Stavnstrup wrote: Now I am going to release my colleague on the document. They are going to use a myriad of windows editors (Word, Notepad, etc in different language versions), and I predict this is going to cause a lot of problems. Does

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Jirka Kosek
Jens Stavnstrup wrote: If your documents will contain a lot of character outside of ISO Latin 1 or ASCII using UTF-8 is best choice, assuming that all editors used can deal with UTF-8. Not really, the problem is basically, that Word, which might be used to to edit the XML sources,

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Tony Graham
Christopher R. Maden wrote at 15 Mar 2002 02:06:47 -0800: The parser obviously is not aware that you have chosen ISO 8859-1. That is the expected error message if an 8859-1 document contains any high bytes (128+) and the parser is trying to parse it as UTF-8. 1) Do all of your

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Jens Stavnstrup
On Fri, 15 Mar 2002, Christopher R. Maden wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 At 02:58 AM 3/15/02, Jens Stavnstrup wrote: On Fri, 15 Mar 2002, Christopher R. Maden wrote: 1) Do all of your entities (i.e., files) have encoding declarations? What are they? Remember

Re: DOCBOOK-APPS: Choosing a characterset for DocBook

2002-03-15 Thread Jens Stavnstrup
On Fri, 15 Mar 2002, Jirka Kosek wrote: Jens Stavnstrup wrote: If your documents will contain a lot of character outside of ISO Latin 1 or ASCII using UTF-8 is best choice, assuming that all editors used can deal with UTF-8. Not really, the problem is basically, that Word,