Re: Unicode block for programming related symbols and codepoints?

Jean-François Colson Sun, 08 Feb 2015 14:48:15 -0800


Le 08/02/15 23:07, Alfred Zett a écrit :

Hi Jean-Francois Colson,
I hope this doesn't mess up the mailing list.
- Indentation codepoint, with no fixed defined graphicalrepresentation. For indentation based programming languages.
That wouldn’t be compliant with existing languages and futurelanguages might use any existing character.
This was for new languages. Creators of future languages mostly orienton whatever is available and make sense, so I may make this proposalas well, so they don't have to choose the half-assed workarounds theyuse now.


I need a few tens of characters for a conlang I’m developping. ☺

The problem is that Unicode only encodes characters which areeffectively used today or which have been used in the past. It doesn’tencode characters which could perhaps be used in a hypothetical newprograming language in the future.

Also, as long as there is stuff likehttps://github.com/sferik/active_emoji it still makes more sense.
Because:
-- specific clients may want to show it different (for example asarrows, lines etc., using another color):
Can’t good editors display tabs in a different color when required ?
Not as reliable and customizable as a special codepoint. For example
--- browsers could let the web page creator let decide the visualrepresentation (character and size) via CSS
can't be done and on-the-fly copy and paste conversion with JavaScriptis horrid and broken for security reasons.But it's an issue even in good editors as well. You need a lexingplugin that may work or not. And the size and other factors are stillfixed. After all, tabs have whitespace semantics that may appeareverywhere in the text.
--- the same with editors, independent from the actual font
--- in case of visual impairment, the user could even change theaccoustical representation if the editor allows it-- unlike a space symbol, it wouldn't need more than one characterper indentation
-- unlike tabs or space, it wouldn't be whitespace
-- unlike normal arrow characters, one could customize the length inan editor and wouldn't have to insert extra spaces for a bettervisual imagery
- A codepoint for string literal quotes, that would spare one theescaping.
I rarely escape quotes.
In a text, I use ’ (U+2019) as an apostrophe and «»“”‘’ as quotes, soI don’t need to escape them.When I use PHP to generate some HTML code, I try to alternate simpleand double quotes as much as possible. That way I rarely need toescape them.
OK, but that's just your scenario. With a language design from thepast. With probably an editor from the past that allows non-unicodeencodings. In a better world, manual code point inserting was a lastresort.
Imagine someone wants to make his text look like written with atypewriter. Or something else.
- A statement separator symbol.
To replace the semicolon in C and the languages based on its syntax?
Again, for future uses. To be honest, this might sound questionable,but this could blur the line between visual line breaks and visualcharacters like semicolons.
Line-break ended comments are separator ended comments.
Of course, that's the least required part of those three proposedcharacters, but I thought for the sake and completeness that shouldn'tmiss.
Come to think of it, two sets of opening and closing block symbolscouldn't harm either. And a continue-after-linebreak symbol as well.
- Other ideas?
Aren’t you trying to reinvent APL?
No. APL places a lot of alien-looking, annoying characters to anyoneexcept mathematicians into your code that are hard to input. Inparticular from the context.
My proposal on the other hand - if implemented right - introduces somereally intuitive looking and easy to input characters, because a boldarrow at the left doesn't need further explanation and your IDE of thefuture can easily place them when pressing tab in the right position.
_______________________________________________
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


_______________________________________________
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode

Re: Unicode block for programming related symbols and codepoints?

Reply via email to