Re: UFCS in generic libraries, silent hijacking, and compile errors.

aliak via Digitalmars-d-learn Tue, 13 Mar 2018 16:45:32 -0700

On Sunday, 11 March 2018 at 15:24:31 UTC, Jonathan M Davis wrote:

On Sunday, March 11, 2018 08:39:54 aliak viaDigitalmars-d-learn wrote:
On Saturday, 10 March 2018 at 23:00:07 UTC, Jonathan M Davis
> issue in practice. That doesn't mean that it's never a> problem, but from what I've seen, it's very rarely a> problem, and it's easy to work around if you run into a> particular case where it is a problem.
Ya, it's easy to work around but the caveat there is you needto realize it's happening first, and add that to that it's"rarely a problem" and well ... now it seems scary enough forthis to mentioned somewhere I'd say.
You're talking about a situation where you used a functionwhose parameters match that of a member function exactly enoughthat a member function gets called instead of a free function.That _can_ happen, but in most cases, there's going to be amismatch, and you'll get a compiler error if the type defines amember function that matches the free function. I don't thinkthat I have ever seen that happen or ever seen anyone complainabout it. The only case I recall along those lines was someonewho was trying to use a free function that they'd decided tocall front instead of something else, and it had parametersbeyond just the input range, so that programmer got compilationerrors when they tried to use it in their range-based functions.

Not saying it's common, just something to be aware of that isnon-obvious (well it was not to me at least when I startedgetting in to D). It's _probably_ not going to be a problem, butif it ever is then it's going to be a very hard to detect one.And sure, the solution is to just not use ufcs to be certain, butufcs is pretty damn appealing, which is probably why I didn'trealize this at the beginning. As generic codes bases grow, thechances of this happening is certainly not 0 though.

Essentially yes, though you're passing too many arguments toput. There are cases where put(output, foo) will compile whileoutput.put(foo) will not. In particular,std.range.primitives.put will accept both individual elementsto be written to the output range and ranges of elements to bewritten, whereas typically, an output range will be written toonly accept an element at a time. It's even more extreme withoutput ranges of characters, because the free function put willaccept different string types and convert them, and even if theprogrammer who designed the output range added variousoverloads to put for completeness, it's enough extra work todeal with all of the various character types that they probablydidn't. And put also works with stuff like delegates (mostfrequently used with a toString that accepts an output range),which don't have member functions. So, if you write yourgeneric code to use the member function put, it's only going towork with user-defined types that define the particularoverload(s) of put that you're using in your function, whereasif you use the free function, you have more variety in thetypes of output ranges that your code works with, and you havemore ways that you can call put (e.g. passing a range ofelements instead of a single element).


Ooh ouch, well that's certainly good to know about.

Basically I don't see a reason why we wouldn't want thefollowing to work:
struct S { void f() {} }
void f(S s, int i) {}
S().f(3); // error
So, are you complaining that it's an error, or you want it tobe an error? As it stands, it's an error, because as far as thecompiler is concerned, you tried to call a member function withan argument that it doesn't accept.

Complaining that it is an error :) well, not complaining, moretrying to understand why really. And I appreciate you taking thetime to explain. There're a lot of points in there so here wego...

If you want that code to work, then it would have to add thefree function to the overload set while somehow leaving out theoverloads that matches the member function, which isn't how Ddeals with overloading at this point.

Yeah, I'd say that's an implementation detail, but the main ideawould be to treat an overload set that completely fails as anundefined function so that ufcs would kick in. Your problems withput would also go away then and implementing an output rangewould be less of a hassle.

But if it did, then you have problems as soon as the type addsanother member function overload.

I'm not sure I see how. The member function would win out. Thisis the situation now anyway, with the added (IMO) disadvantage ofufcs being unusable then.

Also, if you have a free function that matches the name of amember function but where their parameters don't match,wouldn't they be unrelated functions?

Well, maybe. The free function takes T as the first parameter soit's certainly related to the type. I suppose they are unrelatedin the same way that:


struct S { f() {} }
g(S s) {}

g and f are unrelated.

At that point, if you wrote code that accidentally matched thefree function instead of the member function, you end up withcode hijacking.

I'm not sure if code hijacking is the correct term here. This isa programmer error. It's exactly the same as if you have f(int)and f(long) and you call f(3) expecting to call f(long). Or ifyou have f(int, int) and f(int) and you accidentally type f(1)instead of f(1, 1).

Just because you made a mistake when typing the code, youcalled entirely the wrong function, and it's very hard to see,because the function names match. Hopefully, testing will catchit (and there's a decent chance that it will), but essentially,the member function has been hijacked by the free function.

The exact same arguments can be made against function overloadinghere. This is as much a hijack as calling the wrong overload.

D's overload rules were written with a strong bias towardspreventing function hijacking. To an extent, that's impossibleonce UFCS comes into play, and Walter went with the choice thathijacked the least and was the simplest to deal with.

Ya, I can understand it's a hard problem. So as it stands now, amember function can hijack an intended ufcs call of a freefunction. The case you've mentioned above though I'm not surequalifies as hijacking. In the above case where a programmeraccidentally types a name wrong, or parameters wrong, they'vemade a mistake. They wanted to call function f but they typed itwrong so they're calling function g. In this other case where amember function hijacks a ufcs call, the programmer intended tocall f, typed f, but is somehow calling g.

Basically, once UFCS comes into play, you have these options:

1. Put all of the functions in the overload set.
2. The member function wins.
3. The free function wins.
4. Have a pseudo-overload set where when there's a conflictbetween a memberfunction and a free function, the member function wins, butfree
   functions that don't match can be called as well.
5. Have a pseudo-overload set where when there's a conflictbetween a memberfunction and a free function, the free function wins, butmember
   functions that don't match can be called as well.
If it's ever the case that the free function wins, then youcan't call the member function if the free function isavailable, which definitely causes problems, so #3 and #5 areout. If all of the functions are in the overload set, thenyou're in basically the same boat, because you can't call themember function if there's a conflict. It's just that the freefunction results in a compilation error as well without usingan alias or the full import path or some other trick to get atthe free function. So, #5 is out.


3, 5 and 1, yes, all out, completely agree here.

That leaves #2 and #4. And as I said, aside from the fact that#4 doesn't fit with how D does overloads in general, you runthe risk of the free function hijacking the member functionwhenever there's a mistake, and you have problems whenever themember functions are altered, making it so that which functiongets called can change silently.

I understand that #4 does not fit with how D currently doesoverloads in general. And you make a good point of getting asilent ufcs call if you alter a member function after the factthough. That would certainly be unwanted.

Hmm... ok touche on that part. I think I may agree with thecurrent D implementation just because of that last point of yoursnow. I'm not entirely sure yet, need to think about it.

Now I'm thinking that if you really want to write a utilityfunction that acts on generic code, but you also want to allowspecialization by a type, then this (not sure it works, nottested):

void util(T, U)(T t, U u) if (hasMember!(T, "util") &&is(typeof(t.util(u)))) {

    t.util(u);
}

void util(T t, int a) // int case
void util(T t, string a) // string case
void util(T, U)(T t, U u) {

// generic case, probably needs constraints I can't think ofthough.

}

And then later:

void g(T)(T t) {
  util(t, 3);
}

Now you get all your cases handled and no compilation error if Timplements one of the cases of util but not the others (I wonderif free function put does this?)

So, that leaves #2, which is what we have.
Basically, D's overload rules are designed to favor compilationerrors over the risk of calling the wrong function, and whileits import system provides ways to differentiate between freefunctions, it really doesn't provide a way to differentiatebetween a member function and a free function except viawhether you use UFCS or not. And when those facts are takeninto account, it makes the most sense for member functions tojust win whenenver a free function and a member function havethe same name. It also has the bonus that it reducescompilation times, because if a free function could ever trumpa member function or was in any fashion included in itsoverload set, then the compiler would have to check all of theavailable functions when UFCS is used instead of looking at themember functions and then only looking at free functions ifthere was no member function with that name.
- Jonathan M Davis

I'm not giving you the compilation times bonus point :p Yes I doagree it saves time but I doubt this would be an issue that wouldstop implementation if the things above were not an issue.


Cheers, thanks again for taking the time.
- Ali

Re: UFCS in generic libraries, silent hijacking, and compile errors.

Reply via email to