Re: [Semi OT] The programming language wars

Joakim via Digitalmars-d Sat, 21 Mar 2015 12:15:57 -0700

On Saturday, 21 March 2015 at 14:07:28 UTC, FG wrote:

On 2015-03-21 at 06:30, H. S. Teoh via Digitalmars-d wrote:
On Sat, Mar 21, 2015 at 04:17:00AM +0000, Joakim viaDigitalmars-d wrote:
[...]
What I was going to say too, neither CLI or GUI will win,speechrecognition will replace them both, by providing the best ofboth.Rather than writing a script to scrape several shoppingwebsites forthe price of a Galaxy S6, I'll simply tell the intelligentagent on mycomputer "Find me the best deal on a S6" and it will go findit.
I dunno, I find that I can express myself far more preciselyandconcisely on the keyboard than I can verbally. Maybe foreveryday taskslike shopping for the best deals voice recognition is GoodEnough(tm),but for more complex tasks, I have yet to find something moreexpressive
than the keyboard.
"Find me the best deal on a S6" is only a little more complexthan "make me a cup of coffee." Fine for doing predefined tasksbut questionable as an ubiquitous input method. It's hardenough for mathematicians to dictate a theorem without usingany symbolic notation. There is too much ambiguity and room forinterpretation in speech to make it a reliable and easy inputmethod for all tasks. Even in your example:
You say: "Find me the best deal on a S6."
I hear: "Fine me the best teal on A.S. six."
Computer: "Are you looking for steel?"

Just tried it on google's voice search, it thought I said "Findme the best deal on a last sex" the first time I tried. After3-4 more tries- "a sex," "nsx," etc- it finally got it right.But it never messed up anything before "on," only theintentionally difficult S6, which requires context to understand.Ask that question to the wrong person and they'd have no ideawhat you meant by S6 either.

My point is that the currently deployed, state-of-the-art systemsare already much better than what you'd hear or what you thinkthe computer would guess, and soon they will get that last bitright too.

Now imagine the extra trouble if you mix languages. Also, howdo you include meta-text control sequences in a message? Byraising your voice or tilting your head when you say the magicwords? Cf.:
"There was this famous quote QUOTE to be or not to be END QUOTEon page six END PARAGRAPH..."

Just read that out normally and it'll be smart enough to knowthat the upper-case terms you highlighted are punctuation marksand not part of the sentence, by using various grammar and wordfrequency heuristics. In the rare occurrence of real ambiguity,you'll be able to step down to a lower-level editing mode andcorrect it.

Mixing languages is already hellish with keyboards and will be alot easier with speech recognition.

Very awkward, if talking to oneself wasn't awkward already.

Put a headset on and speak a bit lower and nobody watching willknow what you're saying or who you're saying it to.

Therefore I just cannot imagine voice being used anywhere whereexact representation is required, especially in programming:
"Define M1 as a function that takes in two arguments. The stateof the machine labelled ES and an integer number in rangebetween two and six inclusive labelled X. The result of M1 is aboolean. M1 shall return true if and only if the ES memberlabelled squat THATS SQUAT WITH A T AT THE END is equal to zeromodulo B. OH SHIT IT WAS NOT B BUT X. SCRATCH EVERYTHING."

As Paulo alludes to, the current textual representation ofprogramming languages is optimized for keyboard entry.Programming languages themselves will change to allow fluidspeech input.


On Saturday, 21 March 2015 at 15:13:13 UTC, Piotrek wrote:

Just for fun. A visualization of the problem from 2007 (I doubtthere was breakthrough meanwhile)
https://www.youtube.com/watch?v=MzJ0CytAsec

Got a couple minutes into that before I knew current speechrecognition is much better, as it has progressed by leaps andbounds over the intervening eight years. Doesn't mean it's goodenough to throw away your keyboard yet, but it's nowhere nearthat bad anymore.


On Saturday, 21 March 2015 at 15:47:14 UTC, H. S. Teoh wrote:

It's about the ability to abstract, that's
currently missing from today's ubiquitous GUIs. I wouldwillingly leavemy text-based interfaces behind if you could show me a GUI thatgives methe same (or better) abstraction power as the expressiveness ofa CLIscript, for example. Contemporary GUIs fail me on the followingcounts:
1) Expressiveness: there is no simple way of conveying complex

--snip--

5) Precision: Even when working with graphical data, I prefertext-basedinterfaces where practical, not because text is the best way toworkwith them -- it's quite inefficient, in fact -- but because Icanspecify the exact coordinates of object X and the exactdisplacement(s)I desire, rather than fight with the inherently imprecise mousemovement
and getting myself a wrist aneurysm trying to position object X
precisely in a GUI. I have yet to see a GUI that allows you tospecify
things in a precise way without essentially dropping back to a
text-based interface (e.g., an input field that requires you totype innumbers... which is actually not a bad solution; many GUIsdon't evenprovide that, but instead give you the dreaded slider controlwhich isinherently imprecise and extremely cumbersome to use. Or worse,the textbox with the inconveniently-small 5-pixel up/down arrows thatchangesthe value by 0.1 per mouse click, thereby requiring animpracticalnumber of clicks to get you to the right value -- if you'rereallyunlucky, you can't even type in an explicit number but can onlyuse
those microscopic arrows to change it).

A lot of this is simply that you are a different kind of computeruser than the vast majority of computer users. You want to drivea Mustang with a manual transmission and a beast of an engine,whereas most computer users are perfectly happy with their Tauruswith automatic transmission. A touch screen or WIMP GUI suitstheir mundane tasks best, while you need more expressiveness andcontrol so you use the CLI.

The great promise of voice interfaces is that they will _both_ besimple enough for casual users and expressive enough for powerusers, while being very efficient and powerful for both. Westill have some work to do to get these speech recognitionengines there, but once we do, the entire visual interface toyour computer will have to be redone to best suit voice input andnobody will use touch, mice, _or_ keyboards after that.

Re: [Semi OT] The programming language wars

Reply via email to