[issue10733] plistlib rejects strings containing control characters
Behdad Esfahbod added the comment: Replacing all control characters by entities before trying to parse the Plist XML would likely be the best way forward. That wouldn't work. Control characters are disallowed in XML's character set, so they are invalid even if input as entities. Unfortunately this causes a lot of trouble for clients [0], because it means that XML cannot represent the full Unicode repertoire. I'm curious about alternates. Perhaps the expat module can be extended to allow recovering from this if the client chooses to... [0] eg. https://github.com/behdad/fonttools/issues/249 -- nosy: +Behdad.Esfahbod ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Changes by Ezio Melotti ezio.melo...@gmail.com: -- versions: +Python 3.3 -Python 3.1 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Terry J. Reedy tjre...@udel.edu added the comment: Mitchell: 2.6 is closed to revision except for security issues -- nosy: +terry.reedy resolution: wont fix - versions: +Python 3.2 -Python 2.6 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Changes by Terry J. Reedy tjre...@udel.edu: -- nosy: -terry.reedy ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Mitchell Model m...@acm.org added the comment: Thanks for letting me know (and with a personalized message, yet!). I wasn't paying attention -- i verified that the problem exists in 2.7 and 3.1 and I just dragged 3.1 down to 2.6. Although I've been working furiously in Python for the past six months, I haven't been writing or teaching so I haven't been combing the documentation or testing examples or using obscure or forgotten features, which together are the source of nearly all my bug reports. So I just automatically did what I used to do. I understand the issue and difference; I just didn't realize 2.6 was closed -- I didn't really mean that this should be fixed in anything other than the current or even next release of anything. On Dec 24, 2010, at 3:36 PM, Terry J. Reedy wrote: Terry J. Reedy tjre...@udel.edu added the comment: Mitchell: 2.6 is closed to revision except for security issues -- nosy: +terry.reedy resolution: wont fix - versions: +Python 3.2 -Python 2.6 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Ronald Oussoren ronaldousso...@mac.com added the comment: I agree with Martin that this is a tricky one. The file is problematic because it is invalid XML[1], however Apple's tools are perfectly happy to proces the file and as Mitchell notes plistlib exists to interoperate with Apple's plist files. I'm therefore reopening the issue, but with a low priority. It is unlikely that I'll work on this in the near future though. Replacing all control characters by entities before trying to parse the Plist XML would likely be the best way forward. A patch (including testcases) would definitely be appreciated. BTW. I've checked that Apple's Cocoa libraries will read the file, this is not just a bug in the xml1 output formatter of plutil. Using PyObjC: from Foundation import NSDictionary d = NSDictionary.dictionaryWithContentsOfFile_('com.apple.Terminal.plist') [1] It is invalid XML because it contains control characters which are invalid according to the XML specification (http://www.w3.org/TR/2000/REC-xml-20001006#NT-Char). -- priority: normal - low stage: committed/rejected - unit test needed status: closed - open ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Changes by R. David Murray rdmur...@bitdance.com: -- resolution: - wont fix stage: - committed/rejected status: open - closed ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
New submission from Mitchell Model m...@acm.org: plistlib rejects control characters found in XML plists that Apple's 'plutil lint' accepts. I have attached my Terminal preferences as an example. (plistlib accepts the contents of the default Terminal preferences file) -- assignee: ronaldoussoren components: Library (Lib), Macintosh files: com.apple.Terminal.plist messages: 124311 nosy: MLModel, ronaldoussoren priority: normal severity: normal status: open title: plistlib rejects strings containing control characters type: behavior versions: Python 2.6, Python 2.7, Python 3.1 Added file: http://bugs.python.org/file20104/com.apple.Terminal.plist ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Martin v. Löwis mar...@v.loewis.de added the comment: This is tricky. It's clearly ill-formed XML, so I'm not sure that this needs to be bug-compatible with Apple's implementation. -- nosy: +loewis ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue10733] plistlib rejects strings containing control characters
Mitchell Model m...@acm.org added the comment: I can see where that does make it tricky. (I also tried reading the plist after opening the file as binary, but no luck.) The problem here, of course, is that the only reason for the existence of this library is to read Apple's plist files, however XML-invalid some may be. (It is only a small number of my very many .plist files that have invalid characters -- I just happened to pick one of them to try to access in order to print a simple summary of its contents.) I guess since the plist is read using xml.parsers.expat, there's not much that can be done, and it wouldn't be worth anyone's time to hack around this for plistlib, especially since nearly all .plist files appear to be conforming. Thanks for the clarification. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10733 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com