On Wed, 24 Jan 2001, Egon Schmid (@work) wrote:

> Jim Winstead wrote:
> 
> > it is in some character set other than utf-8. it needs to be in utf-8.
> > (the html output will be whatever is appropriate for the language, but
> > the xml needs to be in utf-8.)
> 
> There is no need for utf-8. It's a European language and could be
> handled just like the French or German manual. But you are right there
> are many non SGML characters in the Czech manual.
> 
> 
> [egon@dilbert;0]/osiris/sw/src/phpdoc$ make
> /usr/bin/jade  -d ./html.dsl -V use-output-dir -t sgml ./phpdocxml.dcl
> manual.xml
> /usr/bin/jade:./cs/bookinfo.xml:73:6:E: non SGML character number 154
> /usr/bin/jade:./cs/bookinfo.xml:73:14:E: non SGML character number 158
> /usr/bin/jade:./cs/bookinfo.xml:73:37:E: non SGML character number 154
> /usr/bin/jade:./cs/preface.xml:7:30:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:10:42:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:18:56:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:21:52:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:22:17:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:28:31:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:31:22:E: non SGML character number 158
> /usr/bin/jade:./cs/preface.xml:40:7:E: non SGML character number 154
> /usr/bin/jade:./cs/preface.xml:42:22:E: non SGML character number 158
> [...]
> 
> The first non SGML character is in cs/bookinfo.xml line 73 column 6.
> Roman Neuhauser aka Cynic can you look for it. Do you have a Czech
> keyboard? I have no problems with my German keyboard and Umlauts. 

Oh, I think I guess what the problem is, it's called 'Microsoft'. They
seem to have placed some characters used in the Czech language in the
range 0x80-0x9f, not in in the official ISO-8859-2. Although, Hungarian
uses the same ISO character set, Goba is using it without any
difficulties, and he's on Windows platform too. I don't know very
much about Windows, does it have it's own character set for each language
that doesn't use ISO-8859-1 or what? Maybe Goba is just lucky, or knows
something we don't. Cynic, I think you should contact him.

-- Jouni

Reply via email to