Re: [agi] rule-based NL system

Matt Mahoney Tue, 01 May 2007 09:55:11 -0700

--- Mark Waser <[EMAIL PROTECTED]> wrote:

> > we want computers to
> > understand natural language because we think: if you know the syntax,
> > the semantics follow easily
> 
> Huh?  "We" don't think anything of the sort.  Syntax is relatively easy. 
> Semantics are AGI.


Not really.  Semantics is an easier problem.  Information retrieval and text
classification systems work pretty well by ignoring word order.  (I know there
are exceptions where word order is important, like "Bob hit Alice").  On the
other hand, parsing natural language is an unsolved problem.

Word order is extremely important in artificial languages such as C.  Almost
any rearrangement of the words in a program will change its meaning.  When you
write a compiler, you develop it in this order: lexical, syntax, semantics. 
This approach does not work for natural language.  Syntax depends on
semantics.  The parse of "I ate pizza with X" depends on the meaning of X. 
Children learn language in the order: lexical, semantics, syntax.

Artificial languages were designed to be processed efficiently on fast,
sequential computers in a clean environment.  Thus we have strictly
unambiguous parsing, a stack-based short term memory (context free grammar),
long chains of sequential dependencies, and a lack of error recovery
capabilities.  On the other hand, natural language evolved to be processed on
massively parallel computers with slow, unreliable components in a noisy
environment with a small short term memory (about 100 bits), and a slow long
term learning rate (a few bits per second).  Thus, we have noisy, ambiguous
messages whose resolution requires integrating thousands of learned lexical,
semantic, syntactic, and pragmatic constraints simultaneously, and a language
structure that allows for gradual but efficient learning and updating of these
constraints in the course of normal communication.

This is not to say that a neural architecture like the brain is the best way
to process natural language.  Rather, natural language is the most efficient
language for communication between the neural computers we already have.


-- Matt Mahoney, [EMAIL PROTECTED]

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=231415&user_secret=fabd7936

Re: [agi] rule-based NL system

Reply via email to