"Egon Schmid (@work)" wrote:
> 
> Jim Winstead wrote:
> 
> > it is in some character set other than utf-8. it needs to be in utf-8.
> > (the html output will be whatever is appropriate for the language, but
> > the xml needs to be in utf-8.)
> 
> There is no need for utf-8. It's a European language and could be
> handled just like the French or German manual. But you are right there
> are many non SGML characters in the Czech manual.

I looked briefly at files with Czech translations. They are not in fact
correct XML files. They should start with correct encoding declaration
on the first line of each file:

<?xml encoding="windows-1250"?>

Jade silently ignores this error, but other XML based tools may have big
problems with it.

> The first non SGML character is in cs/bookinfo.xml line 73 column 6.
> Roman Neuhauser aka Cynic can you look for it. Do you have a Czech
> keyboard? I have no problems with my German keyboard and Umlauts.

I look at the files, and they are using windows-1250 instead of
iso-8859-2 encoding (both encodings are valid for Czech langauge). I
recommend to switch to iso-8859-2. This encoding is ISO standard and
Jade will not produce errors, because iso-8859-2 is not using characters
in 128-160 positions.

Another argument may be, that there are plenty of editors on Windows,
which are able to work with iso-8859-2, but not many Unix editors are
capable of handling windows-1250.

And finally, if Jade encoding method (SP_ENCODING) is set to XML, it
should automatically recognize iso-8859-2. Windows-1250 is not
recognized, because Jade uses incorecct encoding name Windows. AFAIK.

Hope helps.

Jirka

-----------------------------------------------------------------
  Jirka Kosek                        
  e-mail: [EMAIL PROTECTED]
  http://www.kosek.cz

Reply via email to