2024年10月5日(土) 1:20 Tim Düsterhus :
>
> Hi
>
> Am 2024-09-25 09:21, schrieb youkidearitai:
> > I tried implement mb_levenshtein function and create an RFC.
> > https://wiki.php.net/rfc/mb_levenshtein
> > https://github.com/php/php-src/pull/16043
> >
> > I w
://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
f grapheme function was missing.
I think grapheme functions is still lacks functionality.
I would like to add grapheme function.
PHP's Unicode support is still not sufficient.
I would like to strengthen PHP's Unicode support.
After a while, I have plans I would like add RFC for Unicode functions.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
apheme clusters.
UTF-8 can't solve everything, what to program is important for
programmers (byte programming, character programming etc).
Also, other character encodings are also important in mainly CJK.
Character set has a lot of consider of many things.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
functions?)
https://www.php.net/manual/en/book.mbstring.php
There is no consistent solution I think, because you have to think a
lot about multibyte characters.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年7月1日(月) 21:43 youkidearitai :
>
> 2024年7月1日(月) 21:19 Ilija Tovilo :
> >
> > Hi Yuya!
> >
> > On Sat, Jun 29, 2024 at 11:11 PM youkidearitai
> > wrote:
> > >
> > > 2024年4月9日(火) 7:55 youkidearitai :
> > > >
&g
2024年7月1日(月) 21:19 Ilija Tovilo :
>
> Hi Yuya!
>
> On Sat, Jun 29, 2024 at 11:11 PM youkidearitai
> wrote:
> >
> > 2024年4月9日(火) 7:55 youkidearitai :
> > >
> > > Mainly review and approve pull request to mbstring extension.
> > > (prob
2024年4月9日(火) 7:55 youkidearitai :
>
> Mainly review and approve pull request to mbstring extension.
> (probably everything related to Unicode and other character encoding)
> Alex Dowad (alexdowad) suggested that give to me.
> https://github.com/php/php-src/pull/13906#issuecom
sr\log\LoggerInterface". The problem with this is that it's not
> really going to save CPU nor memory because it still has to lowercase
> the namespace.
>
> We could refactor the engine to store the namespace separately from
> the type name. This is a lot more work and will increase the size of
> some types, which might be difficult at a technical level.
>
> I can't think of other implementations right now. If nobody can come
> up with a better implementation, I think we should consider going with
> split-sensitivity on namespaces where it matches the sensitivity of
> the thing it is attached to. A namespaced class would have a case
> sensitive namespace but a namesped function would still have a case
> insensitive one.
Hi
I'm worried that have an impact on Windows (case-insensitive file
system). Even if it's only the Class name.
Looks like need to more discussion.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
s when exactly the RFC has been approved.
It was not good. Restored original an RFC.
Thank you
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年4月12日(金) 18:32 Nicolas Grekas :
>
> Hi
>
> Le jeu. 4 avr. 2024 à 07:41, youkidearitai a écrit :
>>
>> 2024年4月4日(木) 6:30 Tim Düsterhus :
>> >
>> > Hi
>> >
>> > On 4/3/24 10:02, youkidearitai wrote:
>> > > Therefore, I thi
2024年3月26日(火) 23:49 youkidearitai :
>
> 2024年3月26日(火) 21:58 Peter Kokot :
> >
> > On Tue, 26 Mar 2024 at 06:41, youkidearitai wrote:
> > >
> > > Hi, Internals
> > >
> > > Sorry I mistake.
> > > Send again.
> > >
> > &g
(tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年4月4日(木) 6:30 Tim Düsterhus :
>
> Hi
>
> On 4/3/24 10:02, youkidearitai wrote:
> > Therefore, I think require an RFC, I have written a draft an RFC that
> > fixes these issues.
> > https://wiki.php.net/rfc/mb_trim_change_characters
>
> I don't think
/mb_trim_change_characters
Feel free to comments on this.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
ards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年3月26日(火) 21:58 Peter Kokot :
>
> On Tue, 26 Mar 2024 at 06:41, youkidearitai wrote:
> >
> > Hi, Internals
> >
> > Sorry I mistake.
> > Send again.
> >
> > grapheme_str_split going to "Voting" phase.
>
Hi, Internals
Sorry I mistake.
Send again.
grapheme_str_split going to "Voting" phase.
Vote end is 10th April 00:00 GMT
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年3月26日(火) 5:43 David CARLIER :
>
> I second this, I think it is a good addition which makes a lot of sense.
>
> Cheers.
>
> On Mon, 25 Mar 2024 at 20:36, Ayesh Karunaratne wrote:
>>
>> >
>> > 2024年3月9日(土) 15:26 youkidearitai :
>> > >
>&
2024年3月9日(土) 15:26 youkidearitai :
>
> Hello, Internals
>
> I created an wiki for `grapheme_str_split` function.
> Please see:
> https://wiki.php.net/rfc/grapheme_str_split
>
> I would like to "Under Discussion" section.
>
> Best Regards
> Yuya
>
&g
github.com/youkidearitai
-
2024年2月21日(水) 9:16 youkidearitai :
>
> 2024年2月19日(月) 18:55 youkidearitai :
> >
> > -- Forwarded message -----
> > From: youkidearitai
> > Date: 2024年2月19日(月) 18:46
> > Subject: Re: [PHP-DEV][VOTE][RFC] mb_ucfirst and mb_lcfirst functions
&g
2024年3月6日(水) 18:42 Niels Dossche :
>
> On 06/03/2024 01:37, youkidearitai wrote:
> > 2024年3月6日(水) 9:22 youkidearitai :
> >>
> >> Hi, Larry
> >> Hi, Niels
> >>
> >> 2024年3月6日(水) 6:47 Niels Dossche :
> >>>
> >>> Hi Lar
rily familiar with the internals of Zend Engine, but
I'm proud that I've taken particular care in maintaining versions.
Also, My company (Cybozu Inc.) I currently work for understands my PHP
Internals activities and gives me time, so I would like to contribute
further.
Regards
Yuya Hamada
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年3月6日(水) 9:22 youkidearitai :
>
> Hi, Larry
> Hi, Niels
>
> 2024年3月6日(水) 6:47 Niels Dossche :
> >
> > Hi Larry
> > Hi Yuya
> >
> > So first of all, I meant the error handling in cases like these:
> > h
t; The implementation still returns NULL here, so the signature is still
> incorrect. Either it should return false to match the other functions, or
> throw something and not return a value.
>
> On 05/03/2024 18:40, Larry Garfield wrote:
> > On Tue, Mar 5, 2024, at 7:25 AM, youk
//tekitoh-memdhoi.info
> - https://github.com/youkidearitai
> -
Sorry, again.
I checked behavior of mb_str_split function. So Illegal byte sequences
are returned as is.
```
sapi/cli/php -r 'var_dump(mb_str_split("あ\xc2\xf4\x80あ"));'
array(4) {
[0]=
ems you return NULL on failure, but the
> return type in the signature isn't nullable.
> Also, from a quick look, it seems other functions return false instead of
> null on failure. So perhaps the return type should be array|false.
>
> What do you think? :)
>
> Kind rega
ctors using ICU.
If it's fine, I'll create an RFC.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
; Tim Düsterhus
Hi, Internals
There are times when I explore archives (mbstring and character code
etc), so it's helpful to have something left in communication form.
Therefore a mailing list would be better for that.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
2024年2月19日(月) 18:55 youkidearitai :
>
> -- Forwarded message -
> From: youkidearitai
> Date: 2024年2月19日(月) 18:46
> Subject: Re: [PHP-DEV][VOTE][RFC] mb_ucfirst and mb_lcfirst functions
> To:
>
>
> 2024年2月7日(水) 5:19 youkidearitai :
> >
&g
-- Forwarded message -
From: youkidearitai
Date: 2024年2月19日(月) 18:46
Subject: Re: [PHP-DEV][VOTE][RFC] mb_ucfirst and mb_lcfirst functions
To:
2024年2月7日(水) 5:19 youkidearitai :
>
> 2024年2月7日(水) 4:49 youkidearitai :
> >
> > 2024年2月7日(水) 2:56 Juliet
ase map will be needed. But if we document it as
> being for natural language, and set the right expectations, we can fix
> the edge cases later.
>
> -- Tim Starling
>
Hi, Tim
Thank you for Georgian test case.
I added to test case.
If other any comments, please feel free.
Regards
Yuya
2024年2月7日(水) 4:49 youkidearitai :
>
> 2024年2月7日(水) 2:56 Juliette Reinders Folmer
> :
> >
> > On 6-2-2024 3:40, youkidearitai wrote:
> > > 2024年2月6日(火) 8:33 Tim Starling :
> > >> On 2/2/24 20:27, youkidearitai wrote:
> > >>
> > >> I
2024年2月7日(水) 2:56 Juliette Reinders Folmer :
>
> On 6-2-2024 3:40, youkidearitai wrote:
> > 2024年2月6日(火) 8:33 Tim Starling :
> >> On 2/2/24 20:27, youkidearitai wrote:
> >>
> >> I see. I'll change mb_ucfirst using titlecase.
> >>
> >> P
2024年2月6日(火) 8:33 Tim Starling :
>
> On 2/2/24 20:27, youkidearitai wrote:
>
> I see. I'll change mb_ucfirst using titlecase.
>
> Per my comments a month ago on the GitHub issue , I think it is much better
> to use title case for mb_ucfirst() than to use upper case,
2024年2月2日(金) 18:15 Ayesh Karunaratne :
>>
>> On Fri, Feb 2, 2024 at 2:00 AM youkidearitai
>> wrote:
>>
>> > Hi, Internals
>> >
>> > I have just opened the voting "Multibyte ucfirst and lcfirst functions"
>> > RFC.
>&g
fo
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
fy this paragraph.
>
> I'm fine with the concept, but the text needs to be tightened.
>
Thank you very much for pointing that out.
I use translate Google translate, and add example.
If this is difficult to understand, ask someone who knows English.
--
---
Yu
Hi, Internals
I starting discussion to mb_ucfirst and mb_lcfirst.
RFC Link: https://wiki.php.net/rfc/mb_ucfirst
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals
Hi, Internals
I have been create an RFC: https://wiki.php.net/rfc/mb_ucfirst
Please feel free comment.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP
kitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
Cheers for new year.
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
lot of mbstring operations should not be used on invalid strings,
> > and an exception seems to be an appropriate answer despite the huge BC
> > impact.
>
> I think it is only a security issue when people accidentally think
> mb_* functions should be used if it is available. I'
ytes are
> > considered one character, and the parsing is started over again at the
> > invalid byte. Let's consider the following example:
> >
> > mb_strpos("\xf0\x9fABCD", "B"); // int(2)
Yes, that's true. Because mb_strpos is convert to UTF-8 in internal.
However, other mbstring function is temporary convert to UTF-32, then
reconvert to original character encoding.
Anyway, I'll wait Alex's reply.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
en)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
y with PSR-1 "MUST use only UTF-8 without BOM".
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
2023年11月29日(水) 21:16 youkidearitai :
>
> 2023年11月29日(水) 20:42 Hans Henrik Bergan :
> >
> > i think Shift_JIS can also be automatically converted to UTF-8, does
> > this seem right?
> > https://github.com/divinity76/php2utf8/commit/6e08c4c16312961170cce821195816a8
nge PHP script character encoding.
In addition to this, We have to think about various things.
This is not just a Japanese problem.
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals
g.php
array(7) {
["biao_hex"]=>
string(4) "955c"
["zend.multibyte"]=>
string(1) "1"
["zend.script_encoding"]=>
string(4) "sjis"
["zend.detect_unicode"]=>
string(1) "1"
["mbstring.interna
string.internal_encoding" => ini_get("mbstring.internal_encoding"),
> "mbstring.func_overload" => ini_get("mbstring.func_overload"),
> "PHP_VERSION" => PHP_VERSION,
> ));
> ```
Hi, Hans
I'm trying to above code.
Nothing config
2023年11月29日(水) 8:07 Hans Henrik Bergan :
>
> @youkidearitai right now the code specifically deals with
> - UTF8: removing UTF8 BOM and removing `declare(encoding='UTF-8');
> - UTF16LE/UTF16BE/UTF32LE/UTF32BE: converting to UTF8 removing the BOM
> and removing declare(encod
gt; > > It is not necessary as simple: because your code base may contain literal
> > > strings, and changing the encoding of the source file will effectively
> > > change the contents of the strings.
> > >
> > > —Claude
> > >
>
> -
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
2023年11月3日(金) 0:10 youkidearitai :
>
> Hi, Internals
>
> I have just opened voting on the RFC to mb_trim.
> Voting started now, and will run until November 17th, 24:00 GMT
>
> Link:
> https://wiki.php.net/rfc/mb_trim#voting
>
> (It's my first time so please tell
ffected to below.
- https://github.com/php/php-src/issues/10192
- https://github.com/php/php-src/issues/7871
If PHP 8.0's EOL is one more year, There is a one-year grace period
for upgrades.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
uya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
2023年10月19日(木) 22:33 Niels Dossche :
>
> Hi Yuya
>
> On 19/10/2023 13:57, youkidearitai wrote:
> > Hi, internals.
> >
> > 8ctopus san can't send email, so I'm writing new RFC for multibyte
> > trim function.
> > https://wiki.php.net/rfc/m
://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
Hi, internals.
I writing to trim for multibyte support function, mb_trim, mb_ltrim
and mb_rtrim.
https://github.com/php/php-src/pull/12459
Please give me RFC Karma.
Username: youkidearitai
Regards.
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https
--
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
he string_view stuff with
>
> size_t teststrlen = strlen(teststr);
> str_dump(teststrlen, teststr);
> mb_trim(&teststrlen, &teststr, trim_lengths_num, trim_lengths,
> trim_chars);
> str_dump(teststrlen, teststr);
>
> and it should be pure C, i think
>
>
Hi, Hans.
Thank you very much for your code. And sorry for late.
I confirmed works fine.
https://gist.github.com/youkidearitai/0018dee27353c00aebaff3bf57c5b8c6
However, this code is C++17, php-src is C code.
If you would like contribute, I would like written to C code.
Regards
Yuya
2023年10月1日
n.
Please give me some time.
If have any idea, feel free to comment to me.
Regards
Yuya
--
---
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php
Hi, Internals.
When I was watching https://github.com/php/php-src/issues/9216,
I also want to trim function of multibyte trim functions.
I think that in addition to mb_trim,
mb_ltrim and mb_rtrim are also necessary.
What do you think about this?
Regards
Yuya
--
PHP Internals - PHP Runtime Dev
65 matches
Mail list logo