Re: Operator overloading

Andrei Alexandrescu Sat, 27 Dec 2008 10:05:18 -0800

Don wrote:

Andrei Alexandrescu wrote:
Bill Baxter wrote:
On Sat, Dec 27, 2008 at 9:42 AM, The Anh Tran <trthe...@gmail.com>wrote:
aarti_pl wrote:
Andrei Alexandrescu pisze:
 > We're trying to make that work. D is due for an operator overhaul.
 >
 > Andrei
Is there any chance that we get possibility to overload "rawoperators",like in C++? I think that they may coexist with currently definedoperatoroverloads with simple semantic rules, which will not allow them towork
together at the same time.
..........
BR
Marcin Kuszczak
(aarti_pl)
Me also have a dream :D

<Daydream mode>
class Foo
{
       auto op(++)(); // bar++
       auto op(++)(int); // ++bar

       op(cast)(uint); // cast(uint)bar // opCast
       auto op(())(int, float); // Foo(123, 123.456) // opCall

       auto op(+)(Foo rhs); // bar1 + bar2
       auto op(+=)(int); // bar += 1234;
       auto op(.)(); // bar.xyz // opDot

       Foo op([][][])(int, char, float); // bar[123]['x'][123.456]

       auto op([..])(); // i = bar2[] // opSlide
       auto op([..])(int, int); // bar[1..10]

       auto op([..]=)(float); // bar[] = 12.3 //opSlideAssign
       auto op([..]=)(int, int, float); // bar[1..3] = 123.4
}
</Dream>
When I suggested this kind of thing long ago, Walter said that it
encourages operator overload abuse, because it suggests that  + is
just a generic symbolic operator rather than something that
specifically means "addition".  That's why D uses "opAdd" instead.
It's supposed to encourage only creating overloads that follow the
original meaning of the operator closely.  That way when you see a+b
you can be reasonably sure that it means addition or something quite
like it.
I think that argument is rather weak and ought to be revisited. It'sweak to start with as if writing "+" in a D program hardly evokesanything else but "plus". What the notation effectively achieved wasput more burden on the programmer to memorize some names for thealready-known symbols. I think the entire operator overloadingbusiness, which started from a legitimate desire to improve on C++'s,ended up worse off.
I feel quite strongly that C++'s operator overloading was a failedexperiment. The original intention (AFAIK) was to allow creation ofmathematical entities which could use natural syntax. The classicexample was complex numbers, and it works reasonably well for that,although it requires you to create an absurd number of repetitivefunctions.
But for anything much more complicated, such as matrices, tensors, biginteger arithmetic, etc -- it's an abject failure. It's clumsy, andcreates masses of temporary objects, which kills performance socompletely that it's unusable. But the whole point of operatoroverloading was to allow nice notation in a performace-orientedlanguage! Expression templates are basically a hack to restoreperformance in most cases, but it comes at a massive cost in simplicity.
And the performance even then is not always optimal.
I think that Walter's idea, in tightening the semantics of overloadedoperators, is the right approach. Unfortunately, it doesn't go farenough, so we get the worst of both worlds: the C++ freedom iscurtailed, but there isn't enough power to replace it.


Very well put.

Ultimately, I think that the problem is that ideally, '+' is not simplya call to a function called 'plus()'. What you'd like an operator tocompile to, depends on the expression in which it is embedded. Formaximum performance, an expression needs to be digested before it isconverted into elementary functions.
In my 'operator overloading without temporaries' proposal in Bugzilla,
I showed that DEFINING a -= b as being identical to a = a - b, and thencreating a symmetric operation for a = b - a allows optimal codegeneration in a great many cases. It's not a complete solution, though.
In particular, irreducible temporaries need more thought. Ideally, insomething like a += b * c + d, b*c would be created in a memory pool,and deleted at the end of the expression.(By contrast, a = b*c+d, would translate to a=b*c; a+=d; so no temporaryis required).

That's an awesome proposal. I'd like to expand it to comprehend fusionas well. Consider:


A = B + C - D;

where the operands are matrices. The best hand-written implementationwould loop once through the three matrices and assign to the destinationelement-wise A[i, j] = B[i, j] + C[i, j] - D[i, j]. However, with anapproach that has only one operator application as its horizon, it isimpossible to achieve that optimization. So I wonder what abstractioncould be devised that makes it easy and natural to support such fusion.Expression templates achieve that by saving the right-hand expressiontree as a type and then using it during the assignment. This requires aconsiderable effort and has some drawbacks.

There are other, less serious problems which also need to be addressed.

Defining ++a as a+=1 is probably a mistake. It raises lots of nasty issues.
* If a is a complex number, a = a + 1 makes perfect sense. But it's notobvious that ++a is sensible.* What type is '1'? Is it an int, a uint, a long, ... You don't havethat issue with increment.


Great points!

As I see it, there are two possible strategies:
(1) Pursuing optimal performance, which requires semantic tightening,and reduced flexibility, or
(2) Pursure simplicity and semantic flexibility, sacrificing performance.

I think those two possibilities are mutually exclusive.

I tend to be more optimistic, but if asked to choose, I'd go for (1).One important lesson learned from C++'s operator overloading is thatfreedom was almost always badly used. Tellingly, whenever operatoroverloading is taught or talked about, the first caveat mentioned isthat defining inconsistent batteries of operators is exceedingly easy.


Andrei

Re: Operator overloading

Reply via email to