need some guidance on Python syntax smart editor for use with speech recognition

Eric S. Johansson Sun, 04 Jan 2015 23:52:01 -0800

Some of you will recognize me as someone who pops up occasionally askingquestions as I grope my way to a usable speech driven programmingenvironment. My last set of experiments with a technique calledtogglename and speech driven template notation hit a pretty nasty wallof usability because of a fundamental incompatibility between GUIs andspeech recognition and the lack of support Nuance gives to disabledusers in general.

Before anybody suggests it, yes I know about that guy who gave a talk ata python convention and uses what we call the burp, belch, and fartschool of speech recognition engine abuse. yes that is actually anaffectionate description. :-) what he did is impressive but it's notwhere I'm going

I think the techniques I was experimenting with are good ones becausethey do make it easier to speak code. the problem comes about because ofthe irreversibility of the transformation making editing code asdifficult as it was before.

A little background. Today, Python is an amazingly speech recognitionfriendly programming language (especially if you ignore pep-8). Usingsimple macros, you can pretty much noodle along and write coderelatively easily. A few more specialized pieces and it's almost easy torip, shred, and tear code into new shapes as you realize you went downthe wrong path but still have lots of good idioms.

However, as easy as it is to noodle along, creating code I find myselfsomewhere around 0.8 as effective as I was with my hands and in editingcode, I'm around 0.5 or less. My goal is to make speech drivenprogramming at least on a parity with someone who has useful hands andhopefully 3 to 5 times faster.

a few years ago, a disabled friend of mine pointed out that the hardproblem was not the creation of code but the editing of code. I took hisobservations to heart and have been working on trying to create a speechfriendly environment that that can transform from the speech notation tothe code notation and back again and still remain functionallyidentical. I have some ideas but I need some outside perspective frompeople who know Python better than I do.

The core of the idea is an editor which can present code in two forms.The first form is what you guys all know in love but is horrible tospeak. The second form is something that is easy to speak, and as I saidabove, functionally identical to the code form. An ideal solution wouldgive me the ability to toggle back and forth between these tworepresentations. An experiment would be to play with is displaying bothrepresentations at the same time so you can see what you speak in nearreal-time.

The speech environment lends itself to speaking the broad intent andthen answering questions to fill in the detail to create somethingconcrete. For example, in one of my prototypes (shown below), I statethat I want a class. Then I fill a detail like an initializationfunction, inheriting from a parent, copying in all the arguments etc.and I end up with a full class definition much more quickly than I couldeven type it with good hands. This is what I meant above by 3 to 5 timesfaster than hand generated code.

But with every experimental success, there is usually more than oneproblem. In this case is that I lose all the meta-information when Icreate the instance of the intent plus detail. I can't go back to thatabstract form.

The obvious answer is saving that meta-information in conjunction withthe code but when working in a team environment, that information isgoing to drive you handies up the wall because it's going to visuallyoverwhelm the actual code. Serving the meta-information separately willmean it's even harder to recover a speech friendly version of the codeafter it's been touched.

Another thought experiment has been with always generating syntacticallycorrect code and basing various code generation and navigationconstructs around that.


So the questions I have right now are, or

what's a good open editor ( preferably multiplatform) that actuallydecomposes Python code into fundamental components such as class,expression, etc. and, lets you operate on those components? this is incontrast to editors such as Emacs which give you some fundamental piecesyou can operate on but it's really character oriented and all of thesyntax smartness not really available for coupling to speech recognitionenvironment. it would be great if it was in Python so I don't have tolearn yet another fricking language.

What would be the best way to store meta-information necessary tore-create the speech friendly presentation of code? I don't know if thisis possible but I would like to be able to let handy programmers makechanges that will be propagated automatically into the speech friendlycode presentation without forcing them to learn the new notation.

An example of this is the definition of the class. In my world, a classdefinition looks like this:


uses name:sta
uses init:yes
uses parent:dict
uses arg_list:magic dictonary, long sting, nuance sucks
uses super_arg_list:$arg_list
template class

Note: yes, I speak every single character or type it but with a smarteditor, there's a bunch of optimizations one can use in data entry giventhe context. also, since I wrote this example, I realize that the usesstatement is superfluous and I could just use template: <name> As thetrigger for creating the instance of the template.


going from speech notation to code notation, I generate this:

class simple_class (super_nasty_class):
    def __init__(self, magic dictonary, long sting, nuance sucks):

super(simple_class,self).__init__(magic dictonary, long sting,nuance sucks)

Note: there is a mix of, what I call, codenames and string names inthese examples. The togglename process would transform all string namesinto codenames at some later point in the user experience.

To elaborate on an earlier question, if someone put a doc string intothe class definition I would need to be able to recognize it and put itback into the speech friendly form. Something like this:


class simple_class (super_nasty_class):
    """this is a real simple class to identify problems in the
       speech user interface
   """
    def __init__(self, magic dictonary, long sting, nuance sucks):

super(simple_class,self).__init__(magic dictonary, long sting,nuance sucks)


when transformed back into speech friendly form, it should look like:
uses name:sta
uses init:yes
uses parent:dict
uses arg_list:magic dictonary, long sting, nuance sucks
uses doc_string:

this is a real simple class to identify problems in the speech userinterface

uses super_arg_list:$arg_list
template class

Speech driven programming is a hard problem. So thoughts, ideas would bewelcome. Don't worry about giving me old ideas that have been looked atand rejected because you may have a take on it that I haven't seenconsidered and it's worth trying.

Thank you for reading this far. I know it's a long message and on anunfamiliar topic so I appreciate your attention.


--- eric



--
https://mail.python.org/mailman/listinfo/python-list

need some guidance on Python syntax smart editor for use with speech recognition

Reply via email to