2023年11月29日(水) 21:16 youkidearitai <[email protected]>:
>
> 2023年11月29日(水) 20:42 Hans Henrik Bergan <[email protected]>:
> >
> > i think Shift_JIS can also be automatically converted to UTF-8, does
> > this seem right?
> > https://github.com/divinity76/php2utf8/commit/6e08c4c16312961170cce821195816a8d24e23f6
> >
>
> Sorry if it's harsh, not right.
> Shift_JIS is very ambiguous, What will we do if SJIS-2004 or SJIS-win comes?
> How do we guess(detect) SJIS-2004, SJIS-win and SJIS-mac?
>
> <?php
> // Comparison table from https://uic.io/en/charset/compare/shiftjis2004/cp932/
> var_dump("\xfc\x40"); // What is 0xFC40, 騱(SJIS-2004) or 髜(SJIS-win)?
> ?>
>
> In the first place, We **should not** change PHP script character encoding.
> In addition to this, We have to think about various things.
> This is not just a Japanese problem.
>
> --
> ---------------------------
> Yuya Hamada (tekimen)
> - https://tekitoh-memdhoi.info
> - https://github.com/youkidearitai
> -----------------------------
I'm sorry if offend and reposting.
The problem is easy understand.
What do we detect ISO-8859 series?
<?php
var_dump("\xca"); // What is character?
?>
Regards
Yuya
--
---------------------------
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-----------------------------
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php