Hello,
I can reproduce this.
C:\Users\ruipb>R -q -e "writeClipboard('categoría'); sessionInfo()"
> writeClipboard('categoría'); sessionInfo()
R version 4.2.1 (2022-06-23 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 22000)
Matrix products: default
locale:
[1] LC_COLLATE=Portuguese_Portugal.utf8 LC_CTYPE=Portuguese_Portugal.utf8
[3] LC_MONETARY=Portuguese_Portugal.utf8 LC_NUMERIC=C
[5] LC_TIME=Portuguese_Portugal.utf8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
loaded via a namespace (and not attached):
[1] compiler_4.2.1
# quoting Andrew: Pasting the result into this e-mail message yields
categorÃa
And with the same sessionInfo() output
R -q -e "writeClipboard('categoría', format = 13)"
# <Ctrl+V> paste clipboard here
categoría
Hope this helps,
Rui Barradas
Às 14:05 de 30/09/2022, Andrew Hart via R-help escreveu:
Hi everyone,
Recently I upgraded to R 4.2.1 which now uses UTF-8 internally as its
native encoding. Very nice. However, I've discovered that if I use
writeClipboard to try and move a string containing accented characters
to the Windows clipboard and then try and paste that into another
application (e.g. notepad, Eclipse, etc.), the accents turn out all
garbled. Here's an example:
writeClipboard("categoría")
Pasting the result into this e-mail message yields
CategorÃa
As near as I can tell, the problem seems to have something to do with
the format parameter of writeClipboard. By default, format has a value
of 1, which tells the clipboard to receive Text in the machine's locale.
If I set format=13 in the call, the accents transfer to the clipboard
correctly:
writeClipboard("categoría", format=13)
and the result is
Categoría
It seems that format=13 may be a better default now that R is using
UTF-8. It would be nice not to have to specify the format every time I
want to copy text to the clipboard with writeClipboard.
Is writeClipboard supposed to perform any kind of encoding conversion or
is the format parameter merely informing the clipboard of the kind of
payload it's being handed?
Btw, with pre-4.2.0 versions of R, this wasn't a problem. I am very much
in favour of R using some kind of Unicode encoding natively, but this
wrinkle seems to be something the user shouldn't have to deal with since
the Windows clipboard is capable of holding Unicode text. Any advice
would be gratefully received.
Thanks,
Andrew.
______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.