Re: [Haskell-cafe] Advice needed on best way to simulate an STL vector

Brian Hulley Wed, 19 Apr 2006 11:57:30 -0700

On Wednesday 19th April 2006 18:09PM Udo Stenzel wrote:

Brian Hulley wrote:

In C++, STL provides a vector class which behaves as an array except you
can insert/delete elements from it.

Though you shouldn't.  If you constantly insert and delete in the middle
of a std::vector, you're not using the right data structure.  In fact,
std::vector is almost always wrong and std::deque would probably serve
you better.

std::deque only gives fast insert/delete at the ends so for insert/delete inthe middle it is still slow, and any speedup relative to std::vector mightbe offset by extra slowness in subscripting if multiple physical blocks ofmemory are used to simulate a contiguous array. I could have used astd::list (which is doubly linked) but then I'd lose the constant timerandom element access, so in my particular case (which was a text buffer foran edit control implemented as a std::vector of lines where each linecontains some book-keeping info plus a std::vector of character info) thestd::vector seemed to work out to be the best one to use, since there aremore read operations (rendering, parsing etc) than write operations (usertyping a character).

I'm wondering what is the best Haskell
data structure to use to simulate this, either mutable or immutable.

The obvious mutable data structure is an (STRef (STArray i a)).  You can
implement std::vector in terms of that, almost literally translating
from C++.  If you want Haskell code that looks as ugly as C++, you
should do exactly that.

I'm keen to learn what the Haskell way is rather than just porting my oldC++ code directly.

Immutable array-like thing with insertion and deletion are an
ill-conceived idea, imho.  Every write operation would require a
complete copy and often a reallocation, too.

It depends how many write operations there are in practice, versus how manytimes you need to read from it using array access. A reallocation (amortizedcost O(0)) and copy (a simple memcpy) might be very fast compared to thetime it might take for generational garbage collection to deal with theproblem of cells in a previous generation referencing new cells as happensin mutable data structures. But of course it's probably not optimal.

Instead, use some functional sequence implementation, like Finger Trees.
Operations in the middle of the sequence incur a logarithmic cost, but
thats better than constantly copying the whole thing around.  Being
immutable it also results in more idiomatic code where you don't need to
drag the ST monad around everywhere.  You might also consider a
Finger Tree of smallish Arrays, that's about the closest equivalent to
std::deque you can get.

Thanks, I've downloaded a paper about them fromhttp://www.informatik.uni-bonn.de/~ralf/publications/FingerTrees.pdf so I'llsee if I can understand it! Looks interesting...

Best regards, Brian.

_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Advice needed on best way to simulate an STL vector

Reply via email to