Re: Neural language models (was Re: [singularity] Help get the 400k SIAI matching challenge on DIGG's front page)

Richard Loosemore Thu, 17 May 2007 10:55:29 -0700

Matt Mahoney wrote:

--- Richard Loosemore <[EMAIL PROTECTED]> wrote:
Matt Mahoney wrote:
I doubt you could model sentence structure usefully with a neural network
capable of only a 200 word vocabulary.  By the time children learn to use
complete sentences they already know thousands of words after exposure to
hundreds of megabytes of language. The problem seems to be about O(n^2).
As
you double the training set size, you also need to double the number of
connections to represent what you learned.


-- Matt Mahoney, [EMAIL PROTECTED]
The problem does not need to be O(n^2).
And remember: I used a 200 word vocabulary in a program I wrote 16years ago, on a machine with only one thousandth of today's power.
And besides, solving the problem of understanding sentences could easilybe done in principle with even a vocabulary as small as 200 words.
Richard Loosemore.
What did your simulation actually accomplish?  What were the results?  What do
you think you could achieve on a modern computer?

Oh, I hope there's no misunderstanding: I did not build networks to doany kind of syntactic learning, they just learned relationships betweenphonemic representations and graphemes. (They learned to spell). Whatthey showed was something already known for the learning ofpronunciation: that the system first learns spellings by rote, thenincreases its level of accuracy and at the same time starts to pick upregularities in the mapping. Then it starts to "regularize" thespellings. For example: having learned to spell "height" correctly inthe early stages, it would then start to spell it incorrectly as "hite"because it had learned many other words in which the spelling of thephoneme sequence in "height" would involve "-ite". Then in the laststages it would learn the correct spellings again.


Simple results (there were a few more tentative ideas, but not much).

My goal has always been to understand exactly what to put into thosekinds of mechanisms to get semantics and syntax to be learned in thesame powerful way. That is a task that has occupied me for 20 years(since before those spelling networks, in fact).

What could I do today? Ask me in a year or so. My guess, given all theexperience I have had writing systems and thinking about the issuesinvolved, is that many of the puzzles involved in building systems thatlearn in a powerful way are actually *much* easier than people thinkthey are, but to solve those puzzles we need to shake off a certain wayof thinking. The solutions are just around the corner, but ain't nobodygonna see them if they won't actually believe me enough to go around thecorner and look.

I do my best to shake people out of that way of thinking, but I feellike Stanislaw Lem's Ijon Tichy character, in the Seventh Voyage (in thebook 'Star Diaries') where he tries to shake past versions of himselfawake, but meets the stubborn resistance of people who really want to beleft alone to do some more sleeping.


That's how it feels to me, from my POV.



Richard Loosemore

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=4007604&user_secret=8eb45b07

Re: Neural language models (was Re: [singularity] Help get the 400k SIAI matching challenge on DIGG's front page)

Reply via email to