Re: A Discussion of Tuple Syntax

Meta Mon, 19 Aug 2013 11:46:35 -0700

On Monday, 19 August 2013 at 16:53:06 UTC, Wyatt wrote:

To be clear, I'm not talking about braces, {}; I'm talkingabout parentheses, (). I read over that whole DIP32 thread acouple times, and didn't see any rationale offered for why thelikely "cleanest" version "can't be used". It wasn't evenbrought up (unless I've missed something subtle). In thesecond thread, linked in the OP here, they were glossed overagain. Now, I fully believe there's a very good reason that'sbeen written somewhere, but I _would_ like to know what thatis, preferably documented somewhere less ephemeral anddifficult to search than the newsgroup (such as in DIP32). Theclosest I've seen so far is the pull request where Walter andAndrei expressed that it should be considered further.

I could very well be wrong, but I would bet that one of thereasons is that (a, b, c) expressions already have well-definedsemantics in D (as well as (2, "a", func()). Example:


void main()
{
        import std.stdio;
        
        //Prints "a"
        writeln((true, false, "a"));
}

Making this a tuple literal would be a change in semantics, whichI don't think would go over well and would break code. Anotherexample:


void main()
{
        int a, b;
        (a, b) = (3, 4);
        assert(a == 0 && b == 4);
}

Of course, for the second case, Kenji's proposed syntax used"auto (a, b) = ...", which would disambiguate it, but it couldconfuse people as to whether the first syntax is somehow relatedto the second.

The octothorpe _is_ much better than the t simply in terms ofreadability, though, even more than q{} or t{}, I have concernsabout its ability to be found with an ordinary search engine byan ordinary user. Have you tried looking for documentation onweird operators with a search engine lately? They don'texactly take to it well. :/ (cf. Perl's <=>)

I'm not sure how much of a problem that would be. There's onlyone other syntactic form that uses # in D, but you're right, itmay cause some difficulty trying to search "d programming #".

Addressing the other suggestion I saw that cropped up, Ipersonally find the two-character "bananas" to be impressivelyugly. I considered suggesting some permutation on that sameidea, but after toying with a few examples I find it ends uplooking awful and I think it's honestly annoying to type themin any form. I even don't like how the unicode version of thatone looks; for doubling up, I think ⟦ ⟧ or ⟪ ⟫ or are easier onthe eyes.

My browser can't even display the second set of characters. Dseems to have generally shied away from using any unicodeoperators (for a good reason. Who the hell has Σ on theirkeyboard?)

I feel weird admitting this, but if we can't use some manner ofbare brace, I think I'd rather have tup(), tup[], tup{} (oreven tuple() et al) as a prefix over any single character.

It's not terrible, but it's rather wordy, especially if tuplesbegin to be used a lot in code.

Can't make it a single underscore? Question mark works bestthen, IMO. It isn't as burdened with meanings elsewhere (surethere's ternary and possibly-match in regex, but...have Iforgotten something?)

It *could* be an underscore; the only thing is that theunderscore is a valid variable name, so the above expressionwould actually be binding two variables, which might surprisesomeone who was expecting otherwise. I don't really care all thatmuch, but it's something to think about.

#(a, ...) looks like to me like it would make a 2-tuplecontaining a and a tuple of "everything else", because of theellipsis' use in templated code. I think this is a littleunclear, so instead I'd prefer #(a, ? ...) (or whatever ends upused for the discard character) to make it explicit.

To be clear, what I have in mind is that this would be "a, plus(none/one?) or more things that can either be elements or nestedtuples". Then, in a construction such as #(head, rest...), restwould be exactly as you describe: a tuple consisting ofeverything after head. The semantics could get tricky, maybe thisneeds more thought.

As a bonus, explicit discard means a simple comma omission isless likely to completely change the meaning of the statement.Compare:
#(a, b, ...)   //bind the first two elements, discard the rest.
#(a, b ...) //bind the first element to a and everythingelse to b
#(a, b, ? ...) //same as the first
#(a, b ? ...)  //syntax error

Granted, there's this case:
#(a, ?, ...)
...but that seems like it would be less common just based onhow people conventionally order their data structures.

That's true. Something to think about. Maybe combine the questionmark and ellipsis like so:


#(a, b, ?..)

Thought: Is there sufficient worth in having different tokensfor discarding a single element vs. a range? e.g.#(a, ?, c, * ...) //bind first and third elements; discard therest
// I'm not attached to the asterisk there.
// +, #, or @ would also make some amount of sense to me.


Not sure. I need to think about it.

- Concatenating tuples with ~. This is nice to have, but notparticularly important.
What does concatenating a tuple actually do?  That is:
auto a = #(1,2) ~ 3; //Result: a == #(1,2,3), right?
auto b = a ~ #(4,5); //Is b == #(1,2,3,#(4,5)) or is b ==#(1,2,3,4,5)?


I think it should work the same as with arrays. So:

auto a = #(1, 2) ~ 3; //Error: 3 is not a tuple
auto a = #(1, 2) ~ #(3); //Result: #(1, 2, 3), just like an array

auto b = a ~ #(4, 5); //Result: #(1, 2, 3, 4, 5). Again, likearrays.

I think keeping the same semantics as arrays would be the bestway to do it. I think it nicely follows the principle of leastastonishment. If you wanted to explicitly append a tuple and haveit nested, you'd need to do:


auto b = a ~ #(#(4, 5));

Which is messy, but at least it's explicit about what is going on.

Great! After this, let's fix properties. ;)


Oh boy, no need to start *another* flame war.

Re: A Discussion of Tuple Syntax

Reply via email to