On 22/12/10 12:13, Anthony wrote: > On Tue, Dec 21, 2010 at 7:51 PM, Tim Starling <tstarl...@wikimedia.org> wrote: >> In XML 1.1: >> >> "Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | >> [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, >> excluding the surrogate blocks, FFFE, and FFFF. */" > > Where are you reading that? At http://www.w3.org/TR/xml11/#charsets I read:
Ah yes, that would be the XML 1.0 spec. My fault. [...] > >> Without this change, importDump.php gives a fatal error. > > Have you tried escaping them? Does importDump.php work with XML 1.1, > or only XML 1.0? Is the file defined as XML 1.1 or XML 1.0? If the > file is designated as XML 1.1 (*), the control characters are escaped, > and importDump.php still gives a fatal error, it sounds like a bug in > importDump.php. I provided both versions of the XML if you want to muck around with that. I don't think there's much historical value in the control characters. Speaking of historical value, I found the argument between Lars Aronsson and Larry Sanger, which caused Lars to quit and found susning.nu. It happened on May 21. Lars had just spent several days writing dictionary-like articles, and he wrote [[Short words]] to organise the effort. At 12:48, Larry complained about this on [[LA2]], and at 12:53, he created [[Wikipedia is not a dictionary]], which was clearly an attack on what Lars was doing. At 12:54, Lars announced that he was leaving, as a comment on [[Wikipedia is not a dictionary]]. There were several responses. On July 24, Larry erased all the comments from [[Wikipedia is not a dictionary]], and on July 28, he rewrote the original text, toning down the language. Before I found this backup, the earliest version we had of this policy page was from August 17. -- Tim Starling _______________________________________________ WikiEN-l mailing list WikiEN-l@lists.wikimedia.org To unsubscribe from this mailing list, visit: https://lists.wikimedia.org/mailman/listinfo/wikien-l