Regarding naming, I deliberately switched from "irregex" (no "p") to
"regexp", though I can't recall why.  Perhaps just to differentiate.
Shouldn't the plural be "regexen"? :)

Regarding match semantics, the intention was to deliberately leave this
unspecified, though it's such a crucial issue that at least that decision
should be stated clearly.
Specifically, I didn't want to force implementors to use or reinvent the
reference implementation, but to have the option of just compiling SREs to
their native regexp implementations, like SCSH does.

I think there should be a PFN clarifying this and also making it clear that
leftmost-longest is preferred if at all possible.
It's also worth noting that the non-greedy patterns, if supported, should
be leftmost-shortest.

Separately it may be worth noting that the sample implementation now
supports all features except back references (which it likely never will).

-- 
Alex

On Sat, Oct 19, 2024 at 2:29 PM Arthur A. Gleckler <s...@speechcode.com>
wrote:

> On Fri, Oct 18, 2024 at 7:18 AM Daphne Preston-Kendal <d...@nonceword.org>
> wrote:
>
>
>> Also, it’s been raised by John before (on my instigation, heh), but I
>> really do feel a PFN could clarify that leftmost-longest semantics are
>> intended for submatches, if that is indeed the intention. (chibi regexp)
>> uses leftmost-longest; I haven’t tested the ‘other’ sample implementation,
>> irregex (no ‘p’!), but based on what Alex said when the issue was raised
>> last I assume it does as well.
>> <https://srfi-email.schemers.org/srfi-115/msg/3693241/>
>>
>
> Alex, would you please chime in?
>
> Thanks.
>

Reply via email to