Safe forms of AGI [WAS Re: [singularity] The humans are dead...]

Richard Loosemore Tue, 29 May 2007 07:20:40 -0700

Keith Elis wrote:

Answer me this, if you dare: Do you believe it's possible to design an
artificial intelligence that won't wipe out humanity?


Yes, most certainly I do.

I can hardly stress this enough.

Did you read my previous post on the subject of motivation systems?This contained much of the idea. I am getting close to the point whereI might have enough spare time to write this into proper form and get itonline, but in the mean time it does exist in the Singularity list archive:


http://www.mail-archive.com/singularity@v2.listbox.com/msg00316.html

The core of the argument for safety is that the types of future AGIsystem being discussed now are based on an extrapolation of the"canonical" AI design of today ... including the egregious flaws in thatcanonical design; and including the same flaws that are preventing usfrom actually building a generally intelligent system.

The flaw I have in mind (there are many, but this is one of the biggest)involves the mechanism that drives the system to do what it does.Currently, this mechanism is assumed to be an extrapolation of the "goalstack" idea, in which goals are represented as explicit statements insome logical language. For this to work, the system has to interpretthe statements: it has to know what the terms in the statements meanand it has to understand how those terms combine to yield the meaning ofthe statement. This works (kind of) for narrow AI but is worse thanuseless for a real AGI: how the system interprets the meaning of anabstract statement is totally out of the control of the researcher, andwhen the system is growing up (as a real AGI must do, whereas narrow AIsnever do this), it cannot use sophisticated concepts before it haslearned them, so it has to make do with only concepts interpretable by ababy.... clearly a ridiculous situation.

All in all, there is a need for a more sophisticated motivationalsystem. When you look into how to do that, it becomes clear that youshould never try to control an AGI with single statements (like "BeFriendly to all humans and try to help them get what they want") becausethis type of control is single-point-of-failure control .... it's ajoke. Cannot ever be made stable.

Instead, what you do is build the motivational system in such a way thatit must always operate from a massive base of thousands of smallconstraints. A system that is constrained in a thousand differentdirections simply cannot fail in a way that one constrain by a singlesupergoal is almost guaranteed to fail.

There is more to the argument than that, of course, but the bottom lineis that when I see arguments relating to what an an AGI "would" do, I ambeside myself with frustration: these are almost always based on theassumption that the AGI is governed by a crude goal-stack motivationalsystem, which (a) will probably never yield an AGI that is smart enoughto be a threat (that's why narrow AI is so stupidly useless), and (b) isso wildly unstable that nobody would try to use it anyway, because thetywill use a broad-based massive-constraint system instead.

The massive constraint system can be designed, I believe, in such a waythat the probability of it going AWOL could be made so low as to benegligible..... we are talking about a system with as much likelihood ofdoing something outside its initial (friendly) motivation as thelikelihood of the sun suddenly qunatum tunneling to the vicinity ofBetelgeuse. If that isn't stable enough for people, I don't know whatwould be.

The stupid part of this is that, as you probably know, I tried to getthis argument discussed on SL4 and ran straight into Yudkwosky's gang.Then I tried to get it discussed here, and there was a but of talk, butnothing much. Amazing, for a topic that generates so much angst.

I sometimes think people actually would prefer there to be a doomsdayscenario, because they like to be scared, or they want to always believethe worst. Solutions to scary problems seem ... boring?




Richard Loosemore

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=4007604&user_secret=7d7fb4d8

Safe forms of AGI [WAS Re: [singularity] The humans are dead...]

Reply via email to