Allan Odgaard <[EMAIL PROTECTED]> wrote on 10/9/06 at
11:02 PM:
This raises two questions:
1. Should Markdown convert tabs to spaces in pre-formated text?
2. If yes, should Markdown be aware of multi-byte characters?
I’d say yes to #1 -- Markdown converts to (X)HTML which
does not define the tab size, and a good rule of thumb is to
always convert to spaces before publishing on the net.
For #1, that's exactly why it does it.
As for #2, Markdown doesn’t know the encoding of the source
document, so that would mean it can’t really be aware of
things such as UTF-8 mb sequences, OTOH if it changes my
pre-formatted text, I would like to have it do the right thing.
If Markdown.pl ever gains explicit support for text encodings, the
rules will be simple: UTF-8 in, UTF-8 out, no exceptions.
This would break the way some people are using it, I'm sure. I
don't really have much sympathy for people who are clinging to
other encodings, though.
I don't think the rules for the syntax (as opposed to the
implementation) need to mention it, though, at least not yet.
I say "yet" because from the get-go I've always considered using
non-ASCII punctuation characters for certain features.
I don't think there's any reason that someone couldn't write a
UTF-8 savvy Markdown implementation using the 1.0 syntax, though.
-J.G.
_______________________________________________
Markdown-Discuss mailing list
Markdown-Discuss@six.pairlist.net
http://six.pairlist.net/mailman/listinfo/markdown-discuss