[rust-dev] &self/&mut self in traits considered harmful(?)

SiegeLord Wed, 11 Jun 2014 06:28:11 -0700

First, let me begin with a small discussion about C++ rvalue references.As some of you know, they were introduced to C++ in part to solveproblems like this:


Matrix m;
m.data = {1.0, 2.0, 3.0};
Matrix m2 = m * 2.0 * 5.0 * 10.0;

Before C++11, most implementations the multiplications on the third linewould create two (unnecessary) temporary copies of the Matrix, causingwidespread inefficiency if Matrix was large. By using rvalue references(see the implementation in this gist:https://gist.github.com/SiegeLord/85ced65ab220a3fdc1fc we can reduce thenumber of copies to one. What the C++ does is that the firstmultiplication (* 2.0) creates a copy of the matrix, and the remainingmultiplications move that copy around.

If you look at the implementation, you'll note how complicated the C++move semantics are compared to Rust's (you have to use std::moveeverywhere, define move-constructors and move-assignment witheasy-to-get-wrong implementations etc.). Since Rust has simpler movesemantics, can we do the same thing in Rust?

It turns out we cannot, because the operator overloading in Rust is doneby overloading a trait with a method that takes self by reference:


pub trait Mul<RHS, Result>
{
    fn mul(&self, rhs: &RHS) -> Result;
}

This means that the crucial step of moving out from the temporary cannotbe done without complicated alternatives (explained at the end of thisemail). If we define an a multiplication trait that takes self by value,however then this is possible and indeed relatively trivial (seeimplementation here:https://gist.github.com/SiegeLord/11456760237781442cfe ). This code willact just like the C++ did: it will copy during the first move_mul call,and then move the temporary around:


let m = Matrix{ data: vec![1.0f32, 2.0, 3.0] };
let m2 = (&m).move_mul(2.0).move_mul(5.0).move_mul(10.0);

So there's nothing in Rust move semantics which prevents this usefulpattern, and it'd be possible to do that with syntax sugar if theoperator overload traits did not sabotage it. Pretty much all theexisting users (e.g. num::BigInt and sebcrozet's nalgebra) of operatoroverloading traits take the inefficient route of creating a temporarycopy for each operation (seehttps://github.com/mozilla/rust/blob/master/src/libnum/bigint.rs#L283andhttps://github.com/sebcrozet/nalgebra/blob/master/src/structs/dmat.rs#L593). If the operator overloading traits do not allow you to createefficient implementations of BigNums and linear algebra operations, thetwo use cases why you'd even *have* operator overloading as a languagefeature, why even have that feature?

I think this goes beyond just operator overloading, however, as thesekinds of situations may arise in many other traits. By defining traitmethods as taking &self and &mut self, we are preventing these usefuloptimizations.

Aside from somewhat more complicated impl's, are there any downsides tonever using anything but by value 'self' in traits? If not, then I thinkthat's what they should be using to allow people to create efficientAPIs. In fact, this probably should extend to every member genericfunction argument: you should never force the user to tie their hands byusing a reference. Rust has amazing move semantics, I just don't seewhat is gained by abandoning them whenever you use most traits.

Now, I did say there are complicated alternatives to this. First, youactually *can* move out through a borrowed pointer usingRefCell<Option<T>>. You can see what this looks like here:https://gist.github.com/SiegeLord/e09c32b8cf2df72b2422 . I don't knowhow efficient that is, but it is certainly more fragile. With myby-value MoveMul implementation, the moves are checked by thecompiler... in this case, they are not. It's easy to end up with amoved-out, dangling Matrix. This is what essentially has to be done,however, if you want to preserve the general semantic of the code.

Alternatively, you can use lazy evaluation/expression templates. This isthe route I take in my linear algebra library. Essentially, eachoperation returns a struct (akin to what happens with many Iteratormethods) that stores the arguments by reference. When it comes time toperform assignment, the chained operations are performed element-wise.There are no unnecessary copies and it optimizes well. The problem isthat its a lot more complicated to implement and it pretty much forcesyou to use interior mutability (just Cell this time) if you don't want acrippled API. The latter bit introduces a whole slew of subtle bugs (inmy opinion they are less common than the ones introduced by RefCell).Also, I don't think expression templates are the correct way to wrap,e.g., a LAPACK library. I.e. they only work well when you'reimplementing the math yourself which is not ideal for the morecomplicated algorithms. Along the same lines, it is not immediatelyobvious to me how to extend this lazy evaluation idea to something likenum::BigInt. So far, it seems like lazy evaluation will force dynamicdispatch in that case which is a big shame (i.e. you'd store theoperations in one array, arguments in another and then play them back atthe assignment time).


So, I think the situation is pretty bad. What can be done to fix it?

-SL
_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

[rust-dev] &self/&mut self in traits considered harmful(?)

Reply via email to