Re: [Python-ideas] Membership of infinite iterators

MRAB Wed, 18 Oct 2017 11:25:09 -0700

On 2017-10-18 15:48, Nick Coghlan wrote:

On 18 October 2017 at 22:36, Koos Zevenhoven <[email protected]<mailto:[email protected]>> wrote:
    On Wed, Oct 18, 2017 at 2:08 PM, Nick Coghlan <[email protected]
    <mailto:[email protected]>> wrote:

        That one can only be fixed in count() - list already checks
        operator.length_hint(), so implementing
        itertools.count.__length_hint__() to always raise an exception
        would be enough to handle the container constructor case.


    While that may be a convenient hack to solve some of the cases,
    maybe it's possible for list(..) etc. to give Ctrl-C a chance every
    now and then? (Without a noticeable performance penalty, that is.)
    That would also help with *finite* C-implemented iterables that are
    just slow to turn into a list.

    If I'm not mistaken, we're talking about C-implemented functions
    that iterate over C-implemented iterators. It's not at all obvious
    to me that it's the iterator that should handle Ctrl-C.
It isn't, it's the loop's responsibility. The problem is that one of thecore design assumptions in the CPython interpreter implementation isthat signals from the operating system get handled by the opcode evalloop in the main thread, and Ctrl-C is one of those signals.
This is why "for x in itertools.cycle(): pass" can be interrupted, while"sum(itertools.cycle())" can't: in the latter case, the opcode eval loopisn't running, as we're inside a tight loop inside the sum() implementation.
It's easy to say "Well those loops should all be checking for signalsthen", but I expect folks wouldn't actually like the consequences ofdoing something about it, as:
1. It will make those loops slower, due to the extra overhead ofchecking for signals (even the opcode eval loop includes all sorts oftricks to avoid actually checking for new signals, since doing so isrelatively slow)2. It will make those loops harder to maintain, since the high cost ofchecking for signals means the existing flat loops will need to bereplaced with nested ones to reduce the per-iteration cost of the moreexpensive checks

The re module increments a counter on each iteration and checks forsignals when the bottom 12 bits are 0.

The regex module increments a 16-bit counter on each iteration andchecks for signals when it wraps around to 0.

3. It means making the signal checking even harder to reason about thanit already is, since even C implemented methods that avoid invokingarbitrary Python code could now still end up checking for signals
It's far from being clear to me that making such a change would actuallybe a net improvement, especially when there's an opportunity to mitigatethe problem by having known-infinite iterators report themselves as such.

_______________________________________________
Python-ideas mailing list
[email protected]
https://mail.python.org/mailman/listinfo/python-ideas
Code of Conduct: http://python.org/psf/codeofconduct/

Re: [Python-ideas] Membership of infinite iterators

Reply via email to