Re: [agi] What should we do to be prepared?

Mark Waser Wed, 05 Mar 2008 13:28:38 -0800

--- rg <[EMAIL PROTECTED]> wrote:
Matt: Why will an AGI be friendly ?
The question only makes sense if you can define friendliness, which wecan't.


Why Matt, thank you for such a wonderful opening . . . .  :-)

Friendliness *CAN* be defined. Furthermore, it is my contention thatFriendliness can be implemented reasonably easily ASSUMING an AGI platform(i.e. it is just as easy to implement a Friendly AGI as it is to implementan Unfriendly AGI).

I have a formal paper that I'm just finishing that presents my definition ofFriendliness and attempts to prove the above contention (and several others)but would like to to do a preliminary acid test by presenting the core ideasvia several e-mails that I'll be posting over the next few days (i.e. y'allare my lucky guinea pig initial audience :-). Assuming that the ideassurvive the acid test, I'll post the (probably heavily revised :-) formalpaper a couple of days later.


= = = = = = = = = =
PART 1.

The obvious initial starting point is to explicitly recognize that the pointof Friendliness is that we wish to prevent the extinction of the *humanrace* and/or to prevent many other horrible nasty things that would make*us* unhappy. After all, this is why we believe Friendliness is soimportant. Unfortunately, the problem with this starting point is that itbiases the search for Friendliness in a direction towards a specific type ofUnfriendliness. In particular, in a later e-mail, I will show that severalprominent features of Eliezer Yudkowski's vision of Friendliness areactually distinctly Unfriendly and will directly lead to a system/situationthat is less safe for humans.

One of the critically important advantages of my proposed definition/visionof Friendliness is that it is an attractor in state space. If a systemfinds itself outside (but necessarily somewhat/reasonably close) to anoptimally Friendly state -- it will actually DESIRE to reach or return tothat state (and yes, I *know* that I'm going to have to prove thatcontention). While Eli's vision of Friendliness is certainly stable (i.e.the system won't intentionally become unfriendly), there is no "force" ordesire helping it to return to Friendliness if it deviates somehow due to anerror or outside influence. I believe that this is a *serious* shortcomingin his vision of the extrapolation of the collective volition (and yes, thisdoes mean that I believe both that Friendliness is CEV and that I,personally, (and shortly, we collectively) can define a stable path to anattractor CEV that is provably sufficient and arguably optimal and whichshould hold up under all future evolution.


TAKE-AWAY:  Friendliness is (and needs to be) an attractor CEV

PART 2 will describe how to create an attractor CEV and make it more obviouswhy you want such a thing.

!! Let the flames begin !! :-)


-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=95818715-a78a9b
Powered by Listbox: http://www.listbox.com

Re: [agi] What should we do to be prepared?

Reply via email to