[agi] Optimality of probabilistic consistency

Ben Goertzel Sat, 03 Feb 2007 17:00:22 -0800

Hi Russell,

OK, I'll try to specify my ideas in this regard more clearly. Bearin mind though that there are many ways to formalize an intuition,and the style of formalization I'm suggesting here may or may not bethe "right" one. With this sort of thing, you only know if theformalization is right after you've proved some theorems using it....


Given

-- an agent acting in an environment, with a variety of actions tochoose at various points in time-- a specific goal G, in the context of which one is evaluating thatagent

one may define an "implicit expectation function" based on theagent's chosen actions as compared to the goal G.

To wit: If in a certain situation S the agent chooses A instead of B,and the agent is being evaluated as an achiever of goal G, then wemay say that according to the agent's implicit expectation function erelative to goal-context G,


e( degree of achievement of G | taking action A in situation S) >
e( degree of achievement of G | taking action B in situation S)

For example, if I am being evaluated as an agent trying to create abenevolent AGI, and I choose to write this email rather than completethe edit of the Novamente design manuscript, this means thataccording to Ben's implicit expectation function e relative to goal-context "create benevolent AGI",

e( degree of achievement of "create benevolent AGI" | write thisemail) >=e( degree of achievement of "create benevolent AGI" | complete editof Novamente design manuscript)

Now, for a given agent that takes many actions, one will be able toderive many such inequalities describing its implicit expectationsrelative to the goal G.

Furthermore, the terms referred to in these inequalities aregenerally going to be expressible as logical combinations of simplerthings. This allows them to be abstracted from, via probabilisticreasoning.

For instance, a study of many of Ben's actions might reveal that helikes writing more than editing, so that a general study of Ben'sbehavior would yield the following abstract conclusion regardingBen's implicit expectation function:


e( degree of achievement of "create benevolent AGI" | write something) >
e( degree of achievement of "create benevolent AGI" | edit something)

Of course, this abstract conclusion may be wrong -- maybe Benactually likes editing more than writing, but happens to have been insituations where he judged writing was the most important thing to do.

But, in this sort of manner, one can associate a set of "implicitabstract expectations" regarding a system's behavior. This is a setof abstractions describing the agent's apparent pattern of judgments,obtained by analyzing the agent's observed action-selections in thecontext of its known action-possibilities and the specific goal G.

Granted, different observers of the agent might come up withdifferent sets of implicit abstract expectations for the agent. But,for sake of argument, let's assume an ideal probabilistic observer:i.e., an observer who assesses the inequalities between the agent'simplicit abstract expectations using correct probability theory.Naturally this may be a difficult computational problem, so we areassuming this theoretical ideal probabilistic observer is very, verysmart.

Now, the implicit abstract expectations obtained by the idealprobabilistic observer for the agent may relate to each other invarious ways. They may be completely consistent with each other, orthey may be wildly inconsistent with each other. That is: in theview of the ideal probabilistic observer, the agent may behaveaccording to consistent implicit principles, or according to wildlyinconsistent implicit principles.


For instance, if Ben
-- always chooses writing in place of editing (when given a choice)
-- always chooses editing in place of golfing (when given a choice)
-- always chooses golfing in place of writing (when given a choice)

then Ben's implicit abstract expectations, as judged by an idealprobabilistic observer, are going to come out as inconsistent.However, there may be more specific information about what guidesBen's choices that makes the apparent contradiction go away.

(Note that Ben could still be doing the right thing even if he wereapparently acting inconsistently according to these observed implicitabstract expectations. But, if a sufficient amount of evidence hasbeen gathered about Ben, then if Ben is acting consistently an idealprobabilistic observer should be able to create a consistent model ofhis behavior by abstracting from his actions in the way I've described.)

Now, suppose this ideal probabilistic observer is also given the jobof making predictions of Ben's behaviors, based on the implicitabstract expectations it has collected. One may then define the**importance** of a particular implicit abstract expectation, interms of the degree of its tendency to play a useful role in accuratepredictions. (There are obvious formulas for quantifying this notionof importance. We have a lot of experience with this way of definingimportance in our machine learning work in a bioinformatics context,BTW.)

We may then ask: How consistent is the set of important implicitabstract expectations associated with the agent?

My desire in this context is to show that, for agents that areoptimal or near-optimal at achieving the goal G under resourcerestrictions R, the set of important implicit abstract expectationsassociated with the agent (in goal-context G as assessed by an idealprobabilistic observer) should come close to being consistent.

Clearly, this will hold only under certain assumptions about theagent, the goal, and the resource restrictions, and I don't know whatthese assumptions are.

The definition of "close to being consistent" is going to be criticalhere, of course. Observed inconsistencies with little evidenceunderlying them are going to have to be counted less than observedinconsistencies with a lot of evidence underlying them, for example.

The crux of this result, if one were able to show it, would be that:Under appropriate conditions, optimal goal-achieving systems behavein a way that makes sense to a sufficiently intelligent observer.

Now, I agree that this is all kind of obvious, intuitively. But"kind of obvious" doesn't mean "trivial to prove." Pretty much allof Marcus Hutter's results about AIXI are kind of obvious too,perhaps even more so than the hypotheses I've made above -- it'sintuitively quite clear that AIXI can achieve an arbitrarily highlevel of intelligence, and that it can perform just as well as anyother algorithm up to a (large) constant factor. Yet, to prove thisrigorously turned out to be quite a pain, given the mathematicaltools at our disposal, as you can see from the bulk and complexity ofHutter's papers.


-- Ben G

-----
This list is sponsored by AGIRI: http://www.agiri.org/email
To unsubscribe or change your options, please go to:
http://v2.listbox.com/member/?list_id=303

[agi] Optimality of probabilistic consistency

Reply via email to