Don wrote:
Andrei Alexandrescu wrote:
Bill Baxter wrote:
On Sat, Dec 27, 2008 at 9:42 AM, The Anh Tran <trthe...@gmail.com>
wrote:
aarti_pl wrote:
Andrei Alexandrescu pisze:
> We're trying to make that work. D is due for an operator overhaul.
>
> Andrei
Is there any chance that we get possibility to overload "raw
operators",
like in C++? I think that they may coexist with currently defined
operator
overloads with simple semantic rules, which will not allow them to
work
together at the same time.
..........
BR
Marcin Kuszczak
(aarti_pl)
Me also have a dream :D
<Daydream mode>
class Foo
{
auto op(++)(); // bar++
auto op(++)(int); // ++bar
op(cast)(uint); // cast(uint)bar // opCast
auto op(())(int, float); // Foo(123, 123.456) // opCall
auto op(+)(Foo rhs); // bar1 + bar2
auto op(+=)(int); // bar += 1234;
auto op(.)(); // bar.xyz // opDot
Foo op([][][])(int, char, float); // bar[123]['x'][123.456]
auto op([..])(); // i = bar2[] // opSlide
auto op([..])(int, int); // bar[1..10]
auto op([..]=)(float); // bar[] = 12.3 //opSlideAssign
auto op([..]=)(int, int, float); // bar[1..3] = 123.4
}
</Dream>
When I suggested this kind of thing long ago, Walter said that it
encourages operator overload abuse, because it suggests that + is
just a generic symbolic operator rather than something that
specifically means "addition". That's why D uses "opAdd" instead.
It's supposed to encourage only creating overloads that follow the
original meaning of the operator closely. That way when you see a+b
you can be reasonably sure that it means addition or something quite
like it.
I think that argument is rather weak and ought to be revisited. It's
weak to start with as if writing "+" in a D program hardly evokes
anything else but "plus". What the notation effectively achieved was
put more burden on the programmer to memorize some names for the
already-known symbols. I think the entire operator overloading
business, which started from a legitimate desire to improve on C++'s,
ended up worse off.
I feel quite strongly that C++'s operator overloading was a failed
experiment. The original intention (AFAIK) was to allow creation of
mathematical entities which could use natural syntax. The classic
example was complex numbers, and it works reasonably well for that,
although it requires you to create an absurd number of repetitive
functions.
But for anything much more complicated, such as matrices, tensors, big
integer arithmetic, etc -- it's an abject failure. It's clumsy, and
creates masses of temporary objects, which kills performance so
completely that it's unusable. But the whole point of operator
overloading was to allow nice notation in a performace-oriented
language! Expression templates are basically a hack to restore
performance in most cases, but it comes at a massive cost in simplicity.
And the performance even then is not always optimal.
I think that Walter's idea, in tightening the semantics of overloaded
operators, is the right approach. Unfortunately, it doesn't go far
enough, so we get the worst of both worlds: the C++ freedom is
curtailed, but there isn't enough power to replace it.
Very well put.
Ultimately, I think that the problem is that ideally, '+' is not simply
a call to a function called 'plus()'. What you'd like an operator to
compile to, depends on the expression in which it is embedded. For
maximum performance, an expression needs to be digested before it is
converted into elementary functions.
In my 'operator overloading without temporaries' proposal in Bugzilla,
I showed that DEFINING a -= b as being identical to a = a - b, and then
creating a symmetric operation for a = b - a allows optimal code
generation in a great many cases. It's not a complete solution, though.
In particular, irreducible temporaries need more thought. Ideally, in
something like a += b * c + d, b*c would be created in a memory pool,
and deleted at the end of the expression.
(By contrast, a = b*c+d, would translate to a=b*c; a+=d; so no temporary
is required).
That's an awesome proposal. I'd like to expand it to comprehend fusion
as well. Consider:
A = B + C - D;
where the operands are matrices. The best hand-written implementation
would loop once through the three matrices and assign to the destination
element-wise A[i, j] = B[i, j] + C[i, j] - D[i, j]. However, with an
approach that has only one operator application as its horizon, it is
impossible to achieve that optimization. So I wonder what abstraction
could be devised that makes it easy and natural to support such fusion.
Expression templates achieve that by saving the right-hand expression
tree as a type and then using it during the assignment. This requires a
considerable effort and has some drawbacks.
There are other, less serious problems which also need to be addressed.
Defining ++a as a+=1 is probably a mistake. It raises lots of nasty issues.
* If a is a complex number, a = a + 1 makes perfect sense. But it's not
obvious that ++a is sensible.
* What type is '1'? Is it an int, a uint, a long, ... You don't have
that issue with increment.
Great points!
As I see it, there are two possible strategies:
(1) Pursuing optimal performance, which requires semantic tightening,
and reduced flexibility, or
(2) Pursure simplicity and semantic flexibility, sacrificing performance.
I think those two possibilities are mutually exclusive.
I tend to be more optimistic, but if asked to choose, I'd go for (1).
One important lesson learned from C++'s operator overloading is that
freedom was almost always badly used. Tellingly, whenever operator
overloading is taught or talked about, the first caveat mentioned is
that defining inconsistent batteries of operators is exceedingly easy.
Andrei