On Wednesday, 27 March 2019 at 13:39:07 UTC, Andrey wrote:
I have got some text with UTF-8. For example this part:
<title>ΠαÏάλληλη αναζήÏηÏη</title>
This looks like a UTF-8 sequence which has been UTF-8 encoded.
How to decode it to get this result?
<title>Παράλληλη αναζήτηση</title>
Undo the second UTF-8 encoding by transcoding the UTF-8 into an
8-bit character set (latin1, windows-1252 etc.) which you have to
guess.
I have tried functions like "decode", "byUTF", "to!wchar"...
but no success.
Input string is correct - checked it with
"https://www.browserling.com/tools/utf8-decode".
```decode.d
import std.stdio;
import std.encoding;
void main ()
{
string src = "<title>Î\u00a0αÏάλληλη
αναζήÏηÏη</title>";
Latin1String ls;
transcode (src, ls);
string targ = cast (string) ls;
targ.writeln;
}
```
$ ./decode
<title>Παράλληλη αναζήτηση</title>