Re: A Discussion of Tuple Syntax

Wyatt Mon, 19 Aug 2013 09:56:08 -0700

Note: I'm leading off with a reply to bearophile transplantedhere to stop making OT noise in John's thread about TypeTuple.


On Friday, 16 August 2013 at 23:23:59 UTC, bearophile wrote:


It's short, clear, has a precedent with q{}.

Wait, what is q{}? That's something in D? What does that evendo? I can infer that q{} is probably some manner of scoping orgrouping _something_ somehow, but I have to dig into lexical andmanually search for q{ to find out it's [neither of the things Iexpected]. In my view, this right here is really just afundamental problem with single-character prefixes and I feelthat's something we should endeavour to avoid, if possible.

I don't like it a lot, but it's way better than not having
language support for tuples.

On this, I think we all agree.

I'd prefer just using parentheses, but I think there werereadability problems that caused the DIP to end up with:
More than just readability problems. They were discussed whenKenji presented the DIP 32 in this forum. Timon found asignificant problem with the {} syntax.

To be clear, I'm not talking about braces, {}; I'm talking aboutparentheses, (). I read over that whole DIP32 thread a coupletimes, and didn't see any rationale offered for why the likely"cleanest" version "can't be used". It wasn't even brought up(unless I've missed something subtle). In the second thread,linked in the OP here, they were glossed over again. Now, Ifully believe there's a very good reason that's been writtensomewhere, but I _would_ like to know what that is, preferablydocumented somewhere less ephemeral and difficult to search thanthe newsgroup (such as in DIP32). The closest I've seen so faris the pull request where Walter and Andrei expressed that itshould be considered further.


On Friday, 16 August 2013 at 21:07:52 UTC, Meta wrote:

- #(a, b) is unambiguous and would probably be the easiestoption. I don't think it looks too bad, but some people mightfind it ugly and noisy

The octothorpe _is_ much better than the t simply in terms ofreadability, though, even more than q{} or t{}, I have concernsabout its ability to be found with an ordinary search engine byan ordinary user. Have you tried looking for documentation onweird operators with a search engine lately? They don't exactlytake to it well. :/ (cf. Perl's <=>)

Addressing the other suggestion I saw that cropped up, Ipersonally find the two-character "bananas" to be impressivelyugly. I considered suggesting some permutation on that sameidea, but after toying with a few examples I find it ends uplooking awful and I think it's honestly annoying to type them inany form. I even don't like how the unicode version of that onelooks; for doubling up, I think ⟦ ⟧ or ⟪ ⟫ or are easier on theeyes.

It's times like these that I wish the standard PC keyboard hadsomething like guillemets « », or corner brackets ｢｣ (big fan ofthese) in addition to everything else. (Or even that we could use< > for bracing, though at this point I don't think I couldeasily condone that move for D).

I feel weird admitting this, but if we can't use some manner ofbare brace, I think I'd rather have tup(), tup[], tup{} (or eventuple() et al) as a prefix over any single character.

Another stray thought: is there room for a little box of syntaxchocolate so that e.g. tuple(), [||], and ⟦ ⟧ are all valid? Idon't know if we have a precedent like that off the top of myhead and I'm pretty sure I don't like it, but I thought I'd atleast mention it.

- There was no consensus on the pattern matching syntax forunpacking. For example, #(a, _) = #(1, 2) only introduces onebinding, "a", into the surrounding scope. The question is, whatcharacter should go in the place of "_" to signify that a valueshould not be bound? Some suggestions were #(a, $), #(a, @),#(a, ?). I personally think #(a, ?) or #(a, *) would be best,but all that's really necessary is a symbol that cannot alsobe an identifier.

Can't make it a single underscore? Question mark works best then,IMO. It isn't as burdened with meanings elsewhere (sure there'sternary and possibly-match in regex, but...have I forgottensomething?)

Also up for debate was nested patterns, e.g., #(1, 2, #(3,4, #(5, 6))). I don't think there was a consensus on unpackingand pattern matching for this situation. One idea I saw thatlooked good:

Ah, I was wondering about the case of a tuple of tuples. It'snot mentioned in the DIP that I saw, so I assumed it was allowed,but explicit mention is probably warranted.

* Use "..." to pattern match on the tail of anexpressions, so take the above tuple. The pattern #(1, ?, ...)would match the two nested sub-tuples. Or, say, #(1, 2, 3)could be matched by #(1, 2, 3), #(1, ?, 3), #(1, ...), etc. Youobviously can't refer to "..." as a variable, so it alsobecomes a useful way of saying "don't care" for multiple items,e.g., #(a, ...) -> only bind the first item in the tuple. We

#(a, ...) looks like to me like it would make a 2-tuplecontaining a and a tuple of "everything else", because of theellipsis' use in templated code. I think this is a littleunclear, so instead I'd prefer #(a, ? ...) (or whatever ends upused for the discard character) to make it explicit.

Assuming the "..." syntax for unpacking, it would be useful toname the captured tail. For example, you could unpack #(1, 3,#(4, 6)) into #(a, b, x...), where a = 1, b = 3, x = #(4, 6).Similarly, #(head, rest...) results in head = 1, rest = #(2,#(4, 6)). I think this would be very useful.

As a bonus, explicit discard means a simple comma omission isless likely to completely change the meaning of the statement.Compare:

#(a, b, ...)   //bind the first two elements, discard the rest.

#(a, b ...) //bind the first element to a and everything elseto b

#(a, b, ? ...) //same as the first
#(a, b ? ...)  //syntax error

Granted, there's this case:
#(a, ?, ...)

...but that seems like it would be less common just based on howpeople conventionally order their data structures.

Thought: Is there sufficient worth in having different tokens fordiscarding a single element vs. a range? e.g.#(a, ?, c, * ...) //bind first and third elements; discard therest

// I'm not attached to the asterisk there.
// +, #, or @ would also make some amount of sense to me.

- Concatenating tuples with ~. This is nice to have, but notparticularly important.

What does concatenating a tuple actually do?  That is:
auto a = #(1,2) ~ 3; //Result: a == #(1,2,3), right?

auto b = a ~ #(4,5); //Is b == #(1,2,3,#(4,5)) or is b ==#(1,2,3,4,5)?

This is the third or fourth time that I know of that tuplesyntax has come up, and as of yet, nothing has been done aboutit. I'd really like to get the ball rolling on this, as I thinka good syntax for these tuple operations would do D a world ofgood. I'm not a compiler hacker, unfortunately, so I can'timplement it myself as proof of concept... However, I hope thatdiscussing it and working out all the kinks will help pave theway for an actual implementation.


Great! After this, let's fix properties. ;)

-Wyatt

Re: A Discussion of Tuple Syntax

Reply via email to