[rust-dev] a vision for vectors

Niko Matsakis Wed, 13 Jun 2012 16:15:38 -0700

Hello,

I wanted to check something. We are working on the Great Change to amore flexible vector system and I want to outline the design that's inmy head. This has some implications for How Efficient Rust Code IsWritten, so I wanted to make sure we were all on the same page.


*Implications for writing efficient Rust*

I figured I'd just start with the implications for writing Rust.Currently, to build up a vector, we rely upon an idiom like:


let mut v = [];
for some loop { v += [elt]; }

Now, often, such loops can (and should) be written using a higher-orderfunction (e.g., map, filter). But sometimes not. In such cases, underthe new regime, the recommended idiom would be:


let dv = dvec();
for some loop { v.push(elt); }
let v = dvec::unwrap(dv); // if necessary, convert to a vector

Actually the name `dvec()` (dynamic vector) will probably change—perhapsto vecbuf? mvec? suggestions welcome—but you get the idea. The samewould eventually apply to building up strings.

Basically, the idea is that we have "builder" types for vectors andstrings. These builder types will overallocate and use dirty tricks toachieve reasonable performance. Using convenience operators like `+`will not do such things.


*Details*

The actual implementation strategy is that the representation of vectorswill stay mostly the same as it is now. However, when the compilerallocates vectors, it will always do so for precisely the size they needto be (fill == alloc, in our vector rep). There will be internalfunctions (vec::alloc_empty_with_capacity() or something) that allocatean empty vector but with a large capacity and unsafe functions that canbe used to set the length. These can be used by dvec-like classes butalso by routines like `vec::map()`. Most of this exists today. The onlyreal thing that changes is that we take *away* the tricks the compilerdoes for `+=`.


*Motivation*

Part of the motivation for this change is that when you have task-localvectors, the tricks we play now where we treat vectors both as valuesand as things that can be updated in place don't work so well (this isprecisely why the move was made to unique vectors in the first place, asI understand it). However, task-local vectors are good for a number ofreasons (cheaper copies; easier to ensure memory safety), so I expectwe'll wind up using them a fair amount: to obtain reliably goodperformance, then, a builder like `dvec` can be used that encapsulatesthe task-local vector pointer until construction is complete, making itsafe to append to it in place.

Another motivation is that it is part of a general trend to pushintelligence out of the compiler and into libraries where possible. Wecan build vector append using overloaded operators. This also ensuresthat end-users will be able to design efficient libraries and so forth.Moving things like `vector +` into libraries also simplifies the typechecker, as we can draw on impls to handle all the various cases (@ vs ~vectors, imm vs mut vectors, and so forth).



Niko
_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

[rust-dev] a vision for vectors

Reply via email to