> On Mon, Feb 16, 2004 at 10:20:30AM -0000, Simon Marlow wrote:
> > Wolfgang Jeltsch <[EMAIL PROTECTED]> writes:
> > > I meant non-ASCII characters in source code comments like this:
> > > {-|
> > > The execution time of this function is /n�/.
> > > -}
> > > Currently, Haddock seems to copy the bytes making up the
> > > non-ASCII character
> > > verbatim to the HTML file. But since the HTML file
> doesn't contain a
> > > character set specification, it is illformed and it depends
> > > on the browser how this situation is handled.
> >
> > It shouldn't be too hard to fix this, at least for Latin-1 (full
> > Unicode would be somewhat harder). I'll add it to the TODO list.
>
> While Haskell's source charset is specified as Unicode, Haskell source
> files don't specify the byte encoding they use, so any source
> file using non-ASCII characters isn't portable. Entrenching Latin-1
> would make the move to Unicode more difficult.
True, but GHC currently assumes Latin-1 as the encoding for source files. I don't
see it as entrenching Latin-1, just that we only accept Latin-1 encoded source files;
at some point in the future we might accept other encodings. Making the same
simplifying assumption in Haddock doesn't seem that big a deal.
Cheers,
Simon
_______________________________________________
Haskell mailing list
[EMAIL PROTECTED]
http://www.haskell.org/mailman/listinfo/haskell