Improving std.algorithm.find

Andrei Alexandrescu Sat, 17 Jul 2010 16:00:35 -0700

I was thinking of improving std.find. We have this bug report:


http://d.puremagic.com/issues/show_bug.cgi?id=3923

which is pretty vague but does have a point.

For starters, it should be told that std.algorithm.find _does_ a lot,which at least partially justifies its complexity. One thing that Ihaven't seen other find()s doing is to be able to search in one pass agiven range for multiple other ranges, like this:


int[] a = [ 1, 4, 2, 3 ];
assert(find(a, 5, 4, 3) == tuple([ 4, 2, 3 ], 2));

When passed more than two arguments, find returns a tuple continuing thesearched ranged positioned on the element found and the 1-based index ofthe parameter that was found. The trick is that find() makes exactly onepass through the searched range, which is often more efficient thansearching the same range for each element in turn. Also the one-passapproach works with input ranges so it doesn't put pressure on the rangecapabilities.

However the simplified find() looks like, I'd like to keep this featureunless it brings serious aggravation. Right now it's the #1 factor thatcomplicates find's signature and implementation.

Another aspect I'd like to discuss is use of Boyer-Moore and relatedfast finding techniques. Currently the use of B-M is explicit and a bitawkward. I was thinking instead to automatically detect itsappropriatedness and use it transparently. Bearophile posted at a linkto such a blend string search routine(http://effbot.org/zone/stringlib.htm) that I think can be generalizedquite easily.


Any ideas - please chime in.


Andrei

Improving std.algorithm.find

Reply via email to