Re: [agi] Some thoughts of an AGI designer

Charles D Hixson Tue, 11 Mar 2008 19:40:58 -0700

Mark Waser wrote:

If the motives depend on "satisficing", and the questing forunlimited fulfillment is avoided, then this limits the danger. Theuniverse won't be converted into toothpicks, if a part of setting thegoal for "toothpicks!" is limiting the quantity of toothpicks.(Limiting it reasonably might almost be a definition of friendliness... or at least neutral behavior.)
You have a good point. Goals should be fulfilled after satisficingexcept when the goals are of the form "as <goal> as possible"(hereafter referred to as "unbounded" goals). Unbounded-goal-entities*are* particularly dangerous (although being aware of the dangershould mmitigate it to some degree).
My Friendliness basically works by limiting the amount of interferencewith other's goals (under the theory that doing so will preventother's from interfering with your goals). Stupid entities that can'tsee the self-interest in the parenthetical point are not inclined tobe Friendly. Stupid unbounded-goal-entities are Eliezer'spaperclip-producing nightmare.
And, though I'm not clear on how this should be set up, this"limitation" should be a built-in primitive, i.e. not somethingsubject to removal, but only to strengthening or weakening vialearning. It should ante-date the recognition of visual images. Butit needs to have a slightly stronger residual limitation that it doeswith people. Or perhaps it's initial appearance needs to be duringthe formation of the statement of the problem. I.e., a solution to aproblem can't be sought without knowing limits. People seem to justmanage that via a dynamic sensing approach, and that sometimessuffers from inadequate feedback mechanisms (saying "Enough!").
The limitation is "Don't stomp on other people's goals unless it istruly necessary" *and* "It is very rarely truly necessary".
(It's not clear to me that it differs from what you are saying, butit does seem to address a part of what you were addressing, and Iwasn't really clear about how you intended the satisfaction of to belimited.)
As far as my theory/vision goes, I was pretty much counting on thefact that we are multi-goal systems and that our other goals willgenerally limit any single goal from getting out of hand. Further, ifthat doesn't do it, the proclamation of not stepping on other's goalsunless absolutely necessary should help handle the problem . . . . but. . . . actually you do have a very good point. My theory/vision*does* have a vulnerability toward single-unbounded-goal entities inthat my Friendly attractor has no benefit for such a system (unless,of course it's goal is Friendliness or it is forced to have asecondary goal of Friendliness).

The trouble with "not stepping on other's goals unless absolutelynecessary" is that it relies on mind-reading. The goals of others areoften opaque and not easily verbalizable even if they think to. Thenthere's the question of "unless absolutely necessary". How and whyshould I decide that their goals are more important than mine? So oneneeds to know not only how important their goals are to them, but alsohow important my conflicting goals are to me. And, of course, whetherthere's a means for mutual satisfaction that it's too expensive. (Andjust try to define that "too".)

For some reason I'm reminded of the story about the peasant, his son,and the donkey carrying a load of sponges. I'd just as soon nobody endsup in the creek. ("Please all, please none.")


-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=95818715-a78a9b
Powered by Listbox: http://www.listbox.com

Re: [agi] Some thoughts of an AGI designer

Reply via email to