Re: [agi] Religion-free technical content

Richard Loosemore Mon, 01 Oct 2007 17:38:06 -0700

Mark Waser wrote:

And apart from the global differences between the two types of AGI, itwould be no good to try to guarantee friendliness using the kind ofconventional AI system that is Novamente, because inasmuch as generalgoals would be encoded in such a system, they are explicitly coded as"statement" which are then interpreted by something else. To put itcrudely (and oversimplify slightly) if the goal "Be empathic to theneeds of human beings" were represented just like that, as some kindof proposition, and stored at a particular location, it wouldn't takemuch for a hacker to get inside and change the statement to "Make[hacker's name] rich and sacrifice as much of humanity as necessary".If that were to become the AGI's top level goal, we would then be indeep doodoo. In the system I propose, such events could not happen.
I think that this focuses on the wrong aspect. It is not the fact thatthe goal is explicitly encoded as a statement that is a problem -- it isthe fact that it is in only one place that is dangerous. My assumptionis that your system basically build it's base constraints from a hugenumber of examples and that it is distributed enough that that it wouldbe difficult if not impossible to maliciously change enough to cause aproblem. The fact that you're envisioning your system as not havingeasy-to-read statements is really orthogonal to your argument and asystem that explicitly codes all of it's constraints as readablestatements but still builds it's base constraints from a huge number ofexamples should be virtually as incorruptible as your system (with thedifference being security by obscurity -- which is not a good thing torely upon and also means that your system is less comprehensible).


Mark,

You have put your finger on one aspect of the proposal that came up, ina slightly different way, when Jef Allbright started talking aboutpragmatics: the "semantics" of the system. This is the hardest featureto explain in a short space.

I really did consciously mean to have both things, not just distributedrepresentation of the constraints, but also the fact that the semanticsof the system is distributed. This distributed, semi-opaque semanticsis what I meant by talking about the propositions not being explicitlyencoded, above, and what I also was referring to in my comment to Jef.

If the basic knowledge units ("atoms") of the system develop as a resultof learning mechanisms + real world interaction (which together makethem grounded), then the meaning of any given atom is encoded in thewhole web of connections between it and the other atoms, and also by themechanisms that browse on (/use, /modify) these atoms. It is not easyto point to an atom and say exactly what it does.

This is not an optional part of the framework: it is crucial. It isthe main reason why the system has some complexity. It is also thereason why the system can be properly grounded and is scalable (which iswhat, with an ordinary, conventional AI system, cannot be done becauseof the complex systems problem).

In a sense the system is less comprehensible, but this is only a matterof degree. I don't think it makes any practical difference to ourattempts to govern its behavior. It is going to be comprehensibleenouigh that we can put hooks in for monitoring purposes.

The great benefit of this way of doing things is that, once the systemhas matured to adulthood, it cannot be hacked: you cannot just write aworm to go around hunting for constraints and modifying them in aregular way (as you might be able to do with ordinary distributedconstraints, where the semantics of each individual atom is well definedenough that you can make a clean edit), because if you tried to do thisyou would destabilize the whole thing and turn it into a gibberingwreck. It would stop working ... and the effect would be so dramaticthat we (and it) could easily set up automatic shutdown mechanisms tointervene in such a case.





Richard Loosemore



-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=8660244&id_secret=48729534-6c9bfe

Re: [agi] Religion-free technical content

Reply via email to