On 08/03/2011, at 11:56 , Doug McNutt wrote: > There is nothing like an overzealous email client that defaults to changing > ASCII quotes to curly versions when the information being transmitted is a > bunch of shell script or C source.
Ahh, the subtle difference between "Smart Quotes" and curly quotes. > And more on topic it appears that Tidy is trying to be helpful. My experience > with such things is pretty much always bad. Microsoft Excel is terrible that > way. >From the web page of ‘demoronizer’ >(http://www.fourmilab.ch/webtools/demoroniser/): > A little detective work revealed that, as is usually the case when you > encounter something shoddy in the vicinity of a computer, Microsoft > incompetence and gratuitous incompatibility were to blame. Western language > HTML documents are written in the ISO 8859-1 Latin-1 character set, with a > specified set of escapes for special characters. Blithely ignoring this > prescription, as usual, Microsoft use their own "extension" to Latin-1, in > which a variety of characters which do not appear in Latin-1 are inserted in > the range 0x82 through 0x95--this having the merit of being incompatible with > both Latin-1 and Unicode, which reserve this region for additional control > characters. Thus my expectation is that “Remove bogus markup” is written by people who don’t mind throwing the baby (typographical punctuation) out with the bathwater (Microsoft’s “Smart Quotes”). They’re trying to be ‘helpful’ but in their typically overzealous manner they aren’t settling for tilting at windmills, they’re razing them. Rather than improve screen readers to be Unicode compatible, these people are dumbing down the content to suit the screen readers. Would they insist that Georges Seurat use only rectilinear grids for, “A Sunday Afternoon on the Island of La Grande Jatte”? > We are on the verge of HTML-5. Does any one know if such things as &xxx; are > going to get deprecated in favor of unicode? Apart from a small set, HTML 5 deprecates named entities in favour of Unicode numerical entities, expressed in decimal. That is, rather than ‘“’ or ‘“’ use ‘“’. I don’t follow the HTML5 community, so I’m not aware of any plans to deprecate entities in favour of straight Unicode. I prefer using Unicode characters, since that’s what I’m using in my day to day work. Alex -- You received this message because you are subscribed to the "BBEdit Talk" discussion group on Google Groups. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at <http://groups.google.com/group/bbedit?hl=en> If you have a feature request or would like to report a problem, please email "[email protected]" rather than posting to the group. Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>
