Re: Emacs puts binary junk into the clipboard, marking it as text

Stefan Monnier Tue, 19 Sep 2006 09:16:48 -0700

>> Also IIRC a perfectly valid utf-8 buffer may contain eight-bit-* chars, use
>> to keep track of valid unicode chars that have no corresponding character in
>> emacs-mule.  So the presence of eight-bit-* chars does not imply that the
>> utf-8 encoded form of the text will contain an invalid utf-8 byte sequence.


> Yes, but such eight-bit-* chars can be detected by checking
> `untranslated-utf-8' property.

Sure, but the current code doesn't do that.

>> > And, if Emacs owns a unibyte string, perhaps the right thing
>> > is to make it multibyte according to the current
>> > lang. env. (by string-make-multibyte) at first, then encode
>> > it by utf-8.

>> That sounds terribly fragile/buggy.

> Then, what do you think Emacs should do in such a case?

I think we can't know what should be done, so we should strive for
simplicity and try to avoid losing information.  I.e. just return the
unibyte string as-is.


        Stefan


_______________________________________________
emacs-pretest-bug mailing list
emacs-pretest-bug@gnu.org
http://lists.gnu.org/mailman/listinfo/emacs-pretest-bug

Re: Emacs puts binary junk into the clipboard, marking it as text

Reply via email to