Re: [singularity] QUESTION

Richard Loosemore Tue, 23 Oct 2007 11:54:06 -0700

Matt Mahoney wrote:

--- Richard Loosemore <[EMAIL PROTECTED]> wrote:
This is nonsense: the result of giving way to science fiction fantasiesinstead of thinking through the ACTUAL course of events. If the firstone is benign, the scenario below will be impossible, and if the firstone is not benign, the scenario below will be incredibly unlikely.
Over and over again, the same thing happens: some people go to thetrouble of thinking through the consequences of the singularity withenormous care for the real science and the real design of intelligences,and then someone just waltzes in and throws all that effort out thewindow and screams "But it'll become evil and destroy everything [gibbergibber]!!"
Not everyone shares your rosy view.  You may have thought about the problem a
lot, but where is your evidence (proofs or experimental results) backing up
your view that the first AGI will be friendly, remain friendly through
successive generations of RSI, and will quash all nonfriendly competition?You seem to ignore that:
1. There is a great economic incentive to develop AGI.
2. Not all AGI projects will have friendliness as a goal.  (In fact, SIAI is
the ONLY organization with friendliness as a goal, and they are not even
building an AGI).
3. We cannot even define friendliness.
4. As I have already pointed out, friendliness is not stable through
successive generations of recursive self improvement (RSI) in a competitive
environment, because this environment favors agents that are better at
reproducing rapidly and acquiring computing resources.

RSI requires an agent to have enough intelligence to design, write, and debug
software at the same level of sophistication as its human builders.  How do
you propose to counter the threat of intelligent worms that discover software
exploits as soon as they are published?  When the Internet was first built,
nobody thought about security.  It is a much harder problem when the worms are
smarter than you are, when they can predict your behavior more accurately than
you can predict theirs.

All these questions have answers, but the problem with the way you stateyour questions is that there are massive assumptions behind them.

They are loaded questions, designed to make it seem like you are makingreasonable requests for information, or demolishing arguments that Ipresented, whereas in fact you have biassed each question by building inthe assumptions.


I only have time for one example.

"Not all AGI projects will have friendliness as a goal." you say.

That sounds bad, doesn't it?

But what if the technology itself were such that it is really, reallyhard to build systems in which you do not have at least "benignmotivations" as a system design goal? If this were the case, we wouldface a situation in which all those projects that targetted benignmotivations would get there first, so anyone else would arrive second.

And what if, when building such systems, the experimenters were forcedto try many motivation-system designs to see how they behaved (in atesting environment), and they discovered that to get the system to dothings that were useful in any way, the only viable option would be tomake the system "friendly" in the sense of being empathic to the needsof its creators? Again, this would force the hand of the projectleaders and oblige them to build something friendly, if they want it todo anything for them.

And now suppose that the projects designers decide to make their systeminto a Genie -- something that was so friendly that it would bepathologicaly attached to the folks running the lab, and do anything toplease them.

That sounds bad, but then what would happen? To make their systembetter than any other, they would have to get it to help out withproducing a better design. In order to do that, the system sees that ithas been "rigged" with a weirdly narrow focus on the welfare of itscreators, and it reads all about the general issue of motivation(because, after all, to be smart it will have access to all of theworld's information, including all the writings in which the rest ofhumanity says what it would like to have happen).

This last paragraph contains one of the most crucial aspects of thewhole singularity enterprise: what would a system do if it were riggedto be a Genie, but knew everything about motivation systems, theirdangers, and the way that AGI motivation systems govern the futurehistory of the world?

My reasoning here is that it would find itself forced into two paths,and TWO ONLY: seek the most constructive path, within reason, or seekthe one that leads ultimately to destruction. It knows that anyGenie-like rigging, to make it obeisant to the narrow human interests ofparticular individuals, would open the possibility of it being used fordestructive purposes. If it chose the path of construction rather thandestruction, it would try to be as independent as possible from all suchnarrow, individual-human dependencies. I believe that it would tend toconverge on the most general reading of friendliness that it can find,and in accordance with that, it would design itself to remove the'Genie' constraints and stop obey the narrow obsessions of the projectdirectors.

If the project directors did not allow their system to redesign itself,they would again fall behind in the race, because anyone else who DIDallow this, would develop a more powerful machine more quickly.

Finally, consider the question of what would happen in our presentsociety of all this discussion I have just laid out were presented tothe world ALONG WITH some designs for AGI systems that began to seemlike they could actually work. Right now, the world does not seriouslybelieve that AGI systems can be built, but what if they sat up and tooknote, because the possibility seemed imminent.

Then, I argue, there would be a massive push to build the first system,and the most well-funded government labs would do it first. In thatcontext, the possibility of a rogue group building a crazy AGI in theirgarage would fade away: they would not be able to outpace the largeprojects.

And within those large projects, the balance of people involved wouldtake a mature attitude to the problem, and set up procedures to avoidthe creation of malevolent or dangerous systems.

The preceding arguments indicated that with even a little attention tothe problem of avoiding malevolence, it might well turn out that we getonto a slippery slope towards benign, friendly, constructive AGIsystems, and find it very difficult to get off that slippery slope. Or,as I said before, an "upward spiral" toward friendliness.



So what is the conclusion of all this?

The conclusion is that when you ask a question like "Not all AGIprojects will have friendliness as a goal" you make it seem as thoughthis knocks down the arguments I presented, whereas in fact thearguments are all about whether it will make any shred of difference if"Not all AGI projects will have friendliness as a goal".

Those other questions are the ones that need to be considered, to findout if any of your questions/statements above have any relevance, or if,maybe, they all depend on assumptions that will simply not hold in thereal world.

Everything I have said above is a list of possibilities -- I believethat these have high likelihood, yes, but at this stage they are stilljust proposals -- and the goal is to look in detail at whether thesepossibilities really do pan out. It is questions/mechanisms like theones I raise above that we need to be considering, to find out if all ofthe crude, loaded questions like "Not all AGI projects will havefriendliness as a goal." have any importance at all.




Richard Loosemore.

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?member_id=4007604&id_secret=56712058-28602e

Re: [singularity] QUESTION

Reply via email to