2025年8月26日(火) 19:15 Derick Rethans <[email protected]>:
>
> On Mon, 25 Aug 2025, Christoph M. Becker wrote:
>
> > On 25.08.2025 at 09:26, youkidearitai wrote:
> >
> > > I improvement this RFC. https://wiki.php.net/rfc/eol-oniguruma
> > >
> > > Added more information about maintenance versions. What do you think
> > > about Oniguruma maintenance ended. Please watch and feel free to
> > > comment.
> >
> > First, thank you for caring about this!  I agree that we need a long
> > term solution for this issue.  As I understand it, Oniguruma's
> > greatest advantage over PCRE2 is that it supports other character
> > encodings than Unicode and ANSI, so deprecating mbregex might be a
> > problem for some users.
>
> Yes, but I think Yuya mentioned somewhere else (I can't find it now) in
> an earlier discussion, that many of these users now also moved to UTF-8.
> It would also be possible to rewrite these uses from using mbregex to
> UConverter::convert+pcre.
>
> Incidently, icu also has a regular expression engine, but of course
> that'll operate on UTF-16, and we'd have to create a full new
> implementation for that:
>
> - https://unicode-org.github.io/icu-docs/apidoc/released/icu4c/uregex_8h.html
>
> > Still, the alternative would likely be to bundle liboniguruma, and I
> > don't think that would be a good idea.  So deprecating mbregex as of
> > PHP 8.6.0 seems prudent; if there would be lots of objections, we
> > could still reconsider.
>
> I agree with that.
>
> > Now I wonder how much trouble it would be to separate mbregex from
> > ext-mbstring.  If that can be done with a reasonable amount of work,
> > that would likely be the best course of action (in addition to
> > deprecating mbregex).  We could than move the extension to PECL/PIE,
> > and let users deal with it (I'm not happy what happened to ext-imap,
> > but it's still better than relying on an unmaintained library from a
> > bundled extension).
>
> Seeing code like in mbstring.c
>
> #ifdef HAVE_MBREGEX
>     PHP_MINIT(mb_regex) (INIT_FUNC_ARGS_PASSTHRU);
> #endif
>
> And:
>
> php_mbregex.h:PHP_MINIT_FUNCTION(mb_regex);
> php_mbregex.h:PHP_MSHUTDOWN_FUNCTION(mb_regex);
> php_mbregex.h:PHP_RINIT_FUNCTION(mb_regex);
> php_mbregex.h:PHP_RSHUTDOWN_FUNCTION(mb_regex);
> php_mbregex.h:PHP_MINFO_FUNCTION(mb_regex);
>
> makes it feel that it already sort-of operates as a sub-extension, and
> it wouldn't be *too* much work. But it will still be work. Is it worth
> it?
>
> cheers,
> Derick
> --
> https://derickrethans.nl | https://xdebug.org | https://dram.io
>
> Author of Xdebug. Like it? Consider supporting me: https://xdebug.org/support
>
> mastodon: @[email protected] @[email protected]

Hi, Internals

I created extension mb_onig that separate mbregex functions.
This package include Oniguruma that my update(Unicode 17.0).
https://packagist.org/packages/mb_onig/mb_onig
(This package is experimental)

With this idea, I think it would be good to separate mbregex.
What do you think?

Regards
Yuya

-- 
---------------------------
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-----------------------------

Reply via email to