Re: [agi] Goal Driven Systems and AI Dangers [WAS Re: Singularity Outcomes...]

Mark Waser Sat, 24 May 2008 16:02:24 -0700

So if Omuhundro's claim rests on that fact that "being self improving" ispart of the AGI's makeup, and that this will cause the AGI to do certainthings, develop certain subgoals etc. I say that he has quietly inserted a*motivation* (or rather assumed it: does he ever say how this is supposedto work?) into the system, and then imagined some consequences.

I think that I'm missing something here . . . . Omohundro is *explicitly*assuming self-improving and yes, self-improving is a goal/motivation. Whatdo you believe that this proves/disproves? I'm not getting your point.

Further, I do not buy the supposed consequences. Me, I have the"self-improving" motivation too. But it is pretty modest, and also it isjust one among many, so it does not have the consequences that heattributes to the general existence of the self-improvement motivation.


AS I said in my previous e-mail, I don't buy his consequences either.

My point is that since he did not understand that he was making theassumption,

Excuse me? What makes you believe that he didn't understand that he wasmaking the self-improvement assumption or that it was a goal/motivation? Itlooked pretty deliberate to me.

and did not realize the role that it could play in a MotivationalEmotional system (as opposed to a Goal Stack system),

OK. So could you describe what role it would play in an MES system asopposed to a Goal Stack System? I don't see a difference in terms ofeffects.

he made a complete dog's dinner of claiming how a future AGI would*necessarily* behave.

This I agree with -- but not because of any sort of differences between GSand MES systems. I don't believe that his conclusions apply to anintelligent GS system either.

Only in a Goal Stack system is there a danger of a self-improvementsupergoal going awol.

Why? An MES system requires more failures to have a problem, but certaintypes of environment could (and should) cause such a problem.

As far as i can see, his arguments simply do not apply to MES systems: thearguments depend too heavily on the assumption that the architecture is aGoal Stack. It is simply that none of what he says *follows* if an MES isused. Just a lot of non-sequiteurs.

I *STILL* don't get this. His arguments depend heavily upon the systemhaving goals/motivations. Yes, his arguments do not apply to an MES systemwithout motivations. But they do apply to MES systems with motivations(although, again, I don't agree with his conclusions).

When an MES system is set up with motivations (instead of being blank)what happens next depends on the mechanics of the system, and theparticular motivations.

YES! But his argument is that to fulfill *any* motivation, there aregeneric submotivations (protect myself, accumulate power, don't let mymotivation get perverted) that will further the search to fulfill yourmotivation.


= = = = =

As a relevant aside, you never answered my question regarding how youbelieved an MES system was different from a system with a *large* number ofgoal stacks.

----- Original Message -----From: "Richard Loosemore" <[EMAIL PROTECTED]>

To: <agi@v2.listbox.com>
Sent: Friday, May 23, 2008 9:22 PM

Subject: Re: [agi] Goal Driven Systems and AI Dangers [WAS Re: SingularityOutcomes...]

Mark Waser wrote:
he makes a direct reference to goal driven systems, but even more
important he declares that these bad behaviors will *not* be the result
of us programming the behaviors in at the start .... but in an MES
system nothing at all will happen unless the designer makes an explicit
decision to put some motivations into the system, so I can be pretty
sure that he has not considered that type of motivational system when he
makes these comments.
Richard, I think that you are incorrect here.
When Omohundro says that the bad behaviors will *not* be the result of usprogramming the behaviors in at the start, what he means is that the veryfact of having goals or motivations and being self-improving willnaturally lead (**regardless of architecture**) to certain (what I callgeneric) sub-goals (like the acquisition of power/money,self-preservation, etc.) and that the fulfillment of those subgoals,without other considerations (like ethics or common-sense), will resultin what we would consider bad behavior.
This I do not buy, for the following reason.
What is this thing called "being self improving"? Complex concept, that.How are we going to get an AGI to do that? This is a motivation, pure andsimple.
So if Omuhundro's claim rests on that fact that "being self improving" ispart of the AGI's makeup, and that this will cause the AGI to do certainthings, develop certain subgoals etc. I say that he has quietly inserted a*motivation* (or rather assumed it: does he ever say how this is supposedto work?) into the system, and then imagined some consequences.
Further, I do not buy the supposed consequences. Me, I have the"self-improving" motivation too. But it is pretty modest, and also it isjust one among many, so it does not have the consequences that heattributes to the general existence of the self-improvement motivation. Mypoint is that since he did not understand that he was making theassumption, and did not realize the role that it could play in aMotivational Emotional system (as opposed to a Goal Stack system), he madea complete dog's dinner of claiming how a future AGI would *necessarily*behave.
Could an intelligent system be built without a rampaging desire forself-improvement (or, as Omuhundro would have it, rampaging power hunger)?Sure: a system could just modestly want to do interesting things and havenew and pleasureful experiences. At the very least, I don't think thatyou could claim that such an unassuming, hedonistic and unambitious typeof AGI is *obviously* impossible.
I believe that he is correct in that goals or motivations andself-improvement will lead to generic subgoals regardless ofarchitecture. Do you believe that your MES will not derive genericsubgoals under self-improvement?
See above: if self-improvement is just one motivation among many, thenthe answer depends on exactly how it is implemented.
Only in a Goal Stack system is there a danger of a self-improvementsupergoal going awol.
Omohundro's arguments aren't *meant* to apply to an MES system withoutmotivations -- because such a system can't be considered to have goals.His arguments will start to apply as soon as the MES system does havemotivations/goals. (Though, I hasten to add that I believe that hislogical reasoning is flawed in that there are some drives that he missedthat will prevent such bad behavior in any sufficiently advanced system).
As far as i can see, his arguments simply do not apply to MES systems: thearguments depend too heavily on the assumption that the architecture is aGoal Stack. It is simply that none of what he says *follows* if an MES isused. Just a lot of non-sequiteurs.
When an MES system is set up with motivations (instead of being blank)what happens next depends on the mechanics of the system, and theparticular motivations.
Richard Loosemore



-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription:http://www.listbox.com/member/?&;
Powered by Listbox: http://www.listbox.com





-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=103754539-40ed26
Powered by Listbox: http://www.listbox.com

Re: [agi] Goal Driven Systems and AI Dangers [WAS Re: Singularity Outcomes...]

Reply via email to