Re: [rust-dev] Iterator blocks (yield)

Michael Woerister Sun, 11 Aug 2013 07:43:47 -0700

On 11.08.2013 12:01, Armin Ronacher wrote:

The way "yield return" works in C# is that it rewrites the code into astate machine behind the scenes. It essentially generates a helperclass that encapsulates all the state.
In Rust that's much harder to do due to the type system. Imagine youare doing a yield from a generic hash map. The code that does therewriting would have to place the hash map itself on the helper structthat holds the state. Which means that the person writing thegenerator would have to put that into the return value.

I think transforming the yielding function into a state machine, likedone in C#, will be the way to go for Rust too.

Rust's type system makes this a bit more complicated than in C#.However, the necessary code transformation has many similarities withsupporting closures:* For a closure, the compiler generates a hidden environment struct,containing the captured variables, which is then implicitly passed tothe closure function.* For yield, the compiler also generates a hidden struct, implementingthe state machine logic, but also 'capturing' the arguments and localparameters of the yielding function.

It is really rather similar as far as the type system is concerned. Ifwe can define sound semantics for closures, we should also be able todefine sound semantics for yielding functions.As for having to put the hashmap into the return value, I don't thinkthis is necessary because implementation details of the function arehidden behind the std::Iterator trait.

I imagine, the compiler would do a desugaring like the following:

// Original function

fn yield_some(xs: &'a HashMap<int, float>, a: int, b: int) ->Iterator<float> {

    yield return xs.get(a);
    yield return xs.get(b);
}

// Desugared version

fn yield_some(xs: &'a HashMap<int, float>, a: int, b: int) ->yield_some_iterator<'a> {

    return yield_some_iterator{
        state: 0,
        xs: xs,
        a: a,
        b: b,
    };
}

struct yield_some_iterator<'self> {
    priv state: uint,
priv xs: &'self HashMap<int, float>,
priv a: int,
priv b: int
}

impl std::iterator::Iterator<float> for yield_some_iterator {
    fn next(&self) -> Option<float> {
        match self.state {
            0 => {
                state = 1;
                Some(self.xs.get(self.a))
            }
            1 => {
                state = 2;
                Some(self.xs.get(self.b))
            }
            2 => None
        }
    }
}

As you can see, the compiler substitutes a concrete iterator type. Butthis iterator type can only ever be accessed through thestd::iterator::Iterator interface, which only exposes the return valueof the next() method. The internal state of the iterator (such as thehash map) does not have to be considered by the user. (The lifetimes ofthe yielding function's arguments can, however, influence the lifetimeof the resulting iterator, as is the case with closures and capturedvariables).

I currently have a really hard time thinking about how the c# trickwould work :-(

Maybe you do now :)


Aside from this some random notes from Python:

- generators go in both directions in Python which caused problems
  until Python 3.3 where "yield from" (your "yield ..") was introduced
  that expands into a monstrosity that forwards generators into both
  directions.

Can you elaborate on what you mean by "both directions"?

- instead of using "fn" like "def" in Python I would prefer if it was
  an explicit "yield fn" that indicates that the function generates an
  iterator.  The fact that Python reuses "def" is a source of lots of
  bugs and confusion.

I think Rust has an advantage over Python here, in that for everyfunction the return type is explicitly declared. So, if a functionreturns an Iterator<T> (or better), one does not have to care aboutwhether the function is implemented via 'yield' or if it returns ahandwritten iterator. So I think "yield fn" would be redundant here. Thetype checker won't let through anything confused.



_______________________________________________
Rust-dev mailing list
Rust-dev@mozilla.org
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] Iterator blocks (yield)

Reply via email to