Re: [agi] A question on the symbol-system hypothesis

Mark Waser Thu, 30 Nov 2006 05:43:06 -0800

Bah! I hate it when I rush and get stupid. This is why I'm not apolitician (and why I think the Republican tirade/crusade against"flip-flopping" is so damaging/dangerous). But, it does serve to illustratea number of useful points so I'll just go with it . . . .

I'm writing this e-mail without any additional external information thanI had last night (though I expect to shortly be reading *several* e-mailsfrom the list telling me what an idiot I am :-). However, my subconsciousknowledge-retrieval processes have finally seen fit to provide me with anumber of "You know . . . . "s. I think that observations of this type arevery important to make when considering building an AI. Not allobservations will be compiled into knowledge and not all knowledge will beimmediately accessible to a system even if the system has what it needs toretrieve/derive the knowledge. Designs that assume total knowledgeintegrity and retrieval are exactly as bad as designs that assume infiniteprocessing power and memory.

Clearly, Philip is referring to the analogies questions of the SAT (notthe synonym questions that I got stuck on last night). Clearly, vectorshave direction in addition to distance. And, clearly, Philip is referringto the fact that the directions/vectors that the system generates are not inhuman-readable form . . . . (though I would argue that they are easilyhuman-comprehensible if you write a translator).

<I'm tempted to make a digression into how much commonknowledge/world-modeling we assume/rely upon -- knowledge that my brain wasnot coming up with last night and replacing with a poor substitute instead>

So let me extend and refine my stupid answer (because the core *is*still fundamentally correct) . . . .

Training SVDs on a given corpus produces a database that is alwaysfundamentally isomorphic to pairs of word-pairs and their similaritydistances (normally expressed as the number and frequency ofdimensions/common-usages they have in common) through a very simplealgorithm that compares how they are used in sentences. There are, ofcourse, also various representations that appear more vector-like but thefundamental isomorphism remains.

With the simplest SVD algorithms and most obvious cases, thesedirections can often be easily translated into human terms. For example,hat/head and hands/gloves both have dimensionalities of wore and wear.(Note, however, that if you wrote the SAT test specifically to confuse thistype of system without messing with humans, you could have examples likeyarmulke/temple (dimensions wear-in and wear-to) include possiblysystem-acceptable answers like hole/sock to distract from tuxedo/dance).

With many SVD systems, however, the representation is more vector-likeand *not* conducive to easy translation to human terms. I have two answersto these cases. Answer 1 is that it is still easy for a human to look atthe closest matches to a particular word pair and figure out what they havein common. Answer 2 is that I still contend that this is a major designflaw (which can also be rectified by taking the time to write a translator).You really, really, *really* don't want to create an intelligence that maybe both smarter/faster than you and seriously flawed -- and statisticalknowledge is very, very shallow; very prone to certain types of error; and*not* particularly conducive to being built upon (unless, of course, you useit merely as a subsystem and you're packing up it's results and sending themto an entirely different type of system). You clearly do *not* want asystem of this type at the core of your AI's reasoning processes --particularly since, I contend, this type of system is frequently (and insome classes of systems which are well behaved, always) isomorphic to asystem that *is* easily human-comprehensible. (Note that neural networks,in particular, are a class of system that are *not* well behaved because theinternal data structures formed by the neural network algorithms that weknow most frequently do not correspond to the real-world simplestexplanation unless you get really, really lucky in choosing your number ofnodes and your connections. Nature has clearly found a way around thisproblem but we do not know this solution yet.)


       Mark (going off to be plastered by replies to last night's message)

----- Original Message -----From: "Mark Waser" <[EMAIL PROTECTED]>

To: <agi@v2.listbox.com>
Sent: Wednesday, November 29, 2006 6:21 PM
Subject: Re: [agi] A question on the symbol-system hypothesis

Yes, it was insulting.  I am sorry.  However, I don't think this
conversation is going anywhere.  There are many, many examples just of
the use of SVD and PCI that I think meet your criteria.  The one I
mentioned earlier, to you, that uses SVD on word-pair similarities,
and scores at human-level on the SAT, is an example.  There are
thousands of examples.
Hmmm. We're definitely in very different realms and are currently talkingpast each other. I guess that I'm having trouble seeing how you wouldthink that SVD is at all human-incomprehensible. Training SVD on a givencorpus produces a set of word pairs and their similarity distances througha very simple algorithm. SVD taking the SAT simply involves four or fivedatabase lookups (assuming that there are four or five answers) for eachquestion and taking the answer with the smallest distance. What isincomprehensible about that? Why *can't* I debug a wrong answer (assumingthat I have access to the training corpus)?
----- Original Message -----From: "Philip Goetz" <[EMAIL PROTECTED]>
To: <agi@v2.listbox.com>
Sent: Wednesday, November 29, 2006 5:17 PM
Subject: Re: [agi] A question on the symbol-system hypothesis
On 11/29/06, Mark Waser <[EMAIL PROTECTED]> wrote:
> If you look into the literature of the past 20 years, you will easily
> find several thousand examples.
I'm sorry but either you didn't understand my point or you don'tknowwhat you are talking about (and the constant terseness of your repliesgivesme absolutely no traction on assisting you). If you would provide justoneexample and state why you believe it refutes my point, then you'll givemesomething to answer -- as it is, you're making a meaningless assertionof novalue that I can't even begin to respond to (not to mention the pointthatcontending/assuming that I've overlooked several thousand examples ispretty
insulting).
Yes, it was insulting.  I am sorry.  However, I don't think this
conversation is going anywhere.  There are many, many examples just of
the use of SVD and PCI that I think meet your criteria.  The one I
mentioned earlier, to you, that uses SVD on word-pair similarities,
and scores at human-level on the SAT, is an example.  There are
thousands of examples.

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?list_id=303



-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?list_id=303

Re: [agi] A question on the symbol-system hypothesis

Reply via email to