Re: Questions on Russell's "Why Occam" paper

2005-06-12 Thread Russell Standish
I'm not sure I follow your reasoning, but it wouldn't surprise me if
the Turing subset of my world has additional constraints - namely the
worlds seen by observers whose O(x)'s are prefix machines, not just maps.

On Fri, Jun 10, 2005 at 05:56:50PM +0200, Bruno Marchal wrote:
> 
> Le 10-juin-05, ? 14:59, Patrick Leahy a ?crit :
> 
> >>Russell Standish:
> >>If the AP applies to the Sims Mark VII, then their reality will be a 
> >>description containing a "body" corresponding to their intelligences. 
> >>They will not be aware of the PC that their description is being 
> >>generated on. We, who inhabit the world with the PC will not be aware 
> >>of the countless other PCs, Macs, Xboxes, Eniacs, Turing machines, 
> >>pebbles in Zen monasteries etc running Sims Mark VII. So the PC 
> >>itself is actually irrelevant from the internal perspective of the 
> >>Sims.
> >
> >Well at least we agree on that.  No strange loops in this picture, so 
> >it is unlike the picture you outline in your paper.
> >
> 
> 
> Aargh  Bad luck! A point where I disagree with both Schmidhuber 
> *and* Standish, at least here apparently.
> 
> To explain I must assume comp and ... (for one) explicitly the *result* 
> of my thesis. In a nutshell: it is that, if comp is assumed, then the 
> correct law of physics are derivable from comp. (it makes comp 
> testable: derive physics from comp and compare with empirical physics).
> I will call the physics derived from comp: the comp-physics.
> 
> Please admit this if only for the sake of the argument.
> 
> Suppose I build a simulated city with some self-aware entities evolving 
> in that simulated environment. Then
> 
> Either I simulate the correct comp physics, then apparently the 
> simulated entity cannot know they are simulated by me, but actually 
> this sentence has no meaning, because they are simulated by 2^aleph_0 
> immaterial stories (constituting arithmetical truth), so it is only in 
> a weak sense that they are failed. (actually it is not even possible to 
> simulate comp physics except in the "ridiculous" sense of running 
> (really) the universal dovetailer.
> 
> Or I simulate incorrect comp physics, then the only way we could say 
> the simulated entity are failed is
> 1) either by killing them (in some absolute way) when they discover 
> discrepancies between the comp-physics they can find by herself and 
> their fake environment. But in that case their story is finite and its 
> measure can be shown equal to 0. Or
> 2) eithert I keep up correcting the simulation, but then in the limit I 
> don't fail them. Or I limit the cognitive ability of the entities, but 
> then either I will failed to genuinely fail them, or I will make them 
> inconsistent (and here too the measure can be shown equal to zero, and 
> that is related to the non-cul-sac phenomenon).
> 
> 
> It is an amazing positive consequence of machine's incompleteness that 
> you cannot genuinely failed any (relatively simulated or not) machine 
> having enough introspection power for a "very long time".
> 
> Apparently, In machine's platonia, all lies leads soon or later to a 
> (recognizable) catastroph.
> 
> Tp prevent falling into an inconsistency, this last conclusion follows 
> from comp, and remember that if comp is correct we cannot "know" it is 
> correct, and we cannot probably know that all lies leads soon or later 
> to a (recognizable) catastroph. But if you *bet* on comp, you can bet 
> on it!
> 
> Bruno
> 
> 
> http://iridia.ulb.ac.be/~marchal/

-- 
*PS: A number of people ask me about the attachment to my email, which
is of type "application/pgp-signature". Don't worry, it is not a
virus. It is an electronic signature, that may be used to verify this
email came from me if you have PGP or GPG installed. Otherwise, you
may safely ignore this attachment.


A/Prof Russell Standish  Phone 8308 3119 (mobile)
Mathematics0425 253119 (")
UNSW SYDNEY 2052 [EMAIL PROTECTED] 
Australiahttp://parallel.hpc.unsw.edu.au/rks
International prefix  +612, Interstate prefix 02



pgpREQze8gtH3.pgp
Description: PGP signature


Re: Questions on Russell's "Why Occam" paper

2005-06-11 Thread Russell Standish
On Fri, Jun 10, 2005 at 01:59:16PM +0100, Patrick Leahy wrote:
> 
> On Thu, 9 Jun 2005, Russell Standish wrote:
> 
> >Yes, if you think there is a concrete reality in which everything exists 
> >(your question of where does the observer live?), then the AP is a 
> >tautology.
> 
> What I meant by "where does the observer live", in more formal language, 
> is "how do you account for the (apparent) sense data we have?". 

Perhaps you ask too much. Your question is something along the lines
of "what explains qualia", or "what breathes fire into the
equations". 

I could state baldly that descriptions can conscious. This is no more
preposterous than Tegmarks "mathematical systems can be
conscious". Maybe that satisfies you, maybe not. What can be stated
more convincingly is that all experience (including internal conscious
experience) is a description, a bunch of data interpreted by an
observer. Consequently, the set of all descriptions will contain a
description of your current conscious experience. Hence we can make
strong claims about appearances - the phenomenon. As to whether there
is a Noumenon - I take Laplace's rather agnostic point of view - I have
no need of that hypothesis. Perhaps you do.

> I also 
> have a strong preference for an account where the description of at least 
> our "world" doesn't privilege one particular observer. In particular, this 
> is hard to square with your insistence that "the observer provides the 
> interpretation" of each of your bitstring universes.
> 

The situation is symmetric with respect to all observers. That is
hardly priveleging  an observer.

> As both Hal and I keep trying to emphasise, we are interested in how, or 
> whether, your theory can account for our own existence and the reality (or 
> appearance, if you prefer) that we see around us. So the case of 
> disembodied intelligences is a total a red herring. I don't really care 
> whether these feature in your theory or not, but I do care whether you can 
> account for (apparently) embodied intelligences.
> 

Apparently embodied intelligences are part of the space of all
descriptions by definition. The anthropic principle accounts for the
fact that we observer them, and not (say) flying white rabbits.

> >
> >Instead, one can take the Anthropic Principle as an assertion of the 
> >reality we inhabit...
> 
> Again, you are using a private language... the AP is not regarded as any 
> such assertion by anyone else I've ever heard of. Most people regard their 
> existence as proved by their own subjective experience, not some invented 
> principle. 

Of course - Decartes and all that. The AP applies to what we see in
the world around us, not a proof or otherwise of our own existence.

> 
> > ... and experimentally test it. In all such cases is has been shown to 
> >be true, sometimes spectacularly.
> 
> If we know "experimentally" "the reality we inhabit" (?!), which I guess 
> I've just claimed that we do, why do we need a principle to assert it?
> 
> Likely you mean something completely different, in which case please 
> explain (with examples of said experiments!).

The experiments are the usual suspects showing fine tuning of physical
parameters in the universe. Tegmark's paper is a good review of the
topic. As is Barrow and Tipler's book.

> 
>  Quoting me:
> >>
> >>Then you are implying that the observer can, in a finite time, read and 
> >>attach meaning to a full (space-time) description of itself, including 
> >>the act of reading this description and so on recursively.
> >>
> >
> >Not at all. Consistency is the only requirement. If the observer goes 
> >looking for erself, then e will find erself in the description. It 
> >doesn't imply the observer is doing this all the time.
> 
> I think here we have run into the same inconsistency that you admitted in 
> your discussion with Hal. In your first reply to Hal you assert that the 
> observer O(x) attaches a unique meaning to the description string. Which 
> would imply processing all bits of the string up to the start of the 
> "don't care" region. A later reply suggest that we should in different 
> contexts assume (a) this and (b) what your paper actually says, i.e. the 
> meanings are updated as further bits are read.  Now you have changed this 
> again, and the observer is not (modelled by) a simple mapping but is a 
> free agent who can choose to apply mappings to different "regions" of the 
> bitstring at will.

Yes, as I've further clarified to Hal, O(x) is actually a function of
time. One imagines that in general the region of sigificant bits of x
expands as a function of (psychological) time. 

In my defence, the paper was written over a period of 4 years, and the
O(x) was a later addition to try to clarify points in section 2. I
didn't realise at the time that it introduced some ambiguities into
section 3.

> 
> And even that doesn't actually answer my problem: let's assume the 
> observer *does* "go looking for erself". You claim he will fi

Re: Questions on Russell's "Why Occam" paper

2005-06-10 Thread Bruno Marchal


Le 10-juin-05, à 14:59, Patrick Leahy a écrit :


Russell Standish:
If the AP applies to the Sims Mark VII, then their reality will be a 
description containing a "body" corresponding to their intelligences. 
They will not be aware of the PC that their description is being 
generated on. We, who inhabit the world with the PC will not be aware 
of the countless other PCs, Macs, Xboxes, Eniacs, Turing machines, 
pebbles in Zen monasteries etc running Sims Mark VII. So the PC 
itself is actually irrelevant from the internal perspective of the 
Sims.


Well at least we agree on that.  No strange loops in this picture, so 
it is unlike the picture you outline in your paper.





Aargh  Bad luck! A point where I disagree with both Schmidhuber 
*and* Standish, at least here apparently.


To explain I must assume comp and ... (for one) explicitly the *result* 
of my thesis. In a nutshell: it is that, if comp is assumed, then the 
correct law of physics are derivable from comp. (it makes comp 
testable: derive physics from comp and compare with empirical physics).

I will call the physics derived from comp: the comp-physics.

Please admit this if only for the sake of the argument.

Suppose I build a simulated city with some self-aware entities evolving 
in that simulated environment. Then


Either I simulate the correct comp physics, then apparently the 
simulated entity cannot know they are simulated by me, but actually 
this sentence has no meaning, because they are simulated by 2^aleph_0 
immaterial stories (constituting arithmetical truth), so it is only in 
a weak sense that they are failed. (actually it is not even possible to 
simulate comp physics except in the "ridiculous" sense of running 
(really) the universal dovetailer.


Or I simulate incorrect comp physics, then the only way we could say 
the simulated entity are failed is
1) either by killing them (in some absolute way) when they discover 
discrepancies between the comp-physics they can find by herself and 
their fake environment. But in that case their story is finite and its 
measure can be shown equal to 0. Or
2) eithert I keep up correcting the simulation, but then in the limit I 
don't fail them. Or I limit the cognitive ability of the entities, but 
then either I will failed to genuinely fail them, or I will make them 
inconsistent (and here too the measure can be shown equal to zero, and 
that is related to the non-cul-sac phenomenon).



It is an amazing positive consequence of machine's incompleteness that 
you cannot genuinely failed any (relatively simulated or not) machine 
having enough introspection power for a "very long time".


Apparently, In machine's platonia, all lies leads soon or later to a 
(recognizable) catastroph.


Tp prevent falling into an inconsistency, this last conclusion follows 
from comp, and remember that if comp is correct we cannot "know" it is 
correct, and we cannot probably know that all lies leads soon or later 
to a (recognizable) catastroph. But if you *bet* on comp, you can bet 
on it!


Bruno


http://iridia.ulb.ac.be/~marchal/




Re: Questions on Russell's "Why Occam" paper

2005-06-10 Thread Patrick Leahy


On Thu, 9 Jun 2005, Russell Standish wrote:

Yes, if you think there is a concrete reality in which everything exists 
(your question of where does the observer live?), then the AP is a 
tautology.


What I meant by "where does the observer live", in more formal language, 
is "how do you account for the (apparent) sense data we have?". I also 
have a strong preference for an account where the description of at least 
our "world" doesn't privilege one particular observer. In particular, this 
is hard to square with your insistence that "the observer provides the 
interpretation" of each of your bitstring universes.




However, if you are prepared to allow for the possibility that observers 
exist "nowhere", then things are not quite so simple. One can always 
imagine being the brain-in-the-vat observer a reality which does not 
contain a body, or a brain, in a vat or anywhere else. Usually in this 
scenario, the observer will conclude that there must be a body somewhere 
else, and so concludes that it is inhabiting some kind of virtual 
reality. However, this implicitly assumes there has to a brain 
somewhere, and so implies a reality somewhere else for the brain to 
inhabit. But what if the brain is not required?


Obviously, the last conclusion is full blown solipsism, but that is 
hardly a knock down argument.


As both Hal and I keep trying to emphasise, we are interested in how, or 
whether, your theory can account for our own existence and the reality (or 
appearance, if you prefer) that we see around us. So the case of 
disembodied intelligences is a total a red herring. I don't really care 
whether these feature in your theory or not, but I do care whether you can 
account for (apparently) embodied intelligences.




Instead, one can take the Anthropic Principle as an assertion of the 
reality we inhabit...


Again, you are using a private language... the AP is not regarded as any 
such assertion by anyone else I've ever heard of. Most people regard their 
existence as proved by their own subjective experience, not some invented 
principle. If you don't agree I think we are just arguing about the 
meaning of "exist"... e.g. if we happen to be living in a computer 
simulation, or are just features of the solution of some set of equations, 
I would still say that we (really) exist.



 ... and experimentally test it. In all such cases is has been shown to 
be true, sometimes spectacularly.


If we know "experimentally" "the reality we inhabit" (?!), which I guess 
I've just claimed that we do, why do we need a principle to assert it?


Likely you mean something completely different, in which case please 
explain (with examples of said experiments!).


 Quoting me:


Then you are implying that the observer can, in a finite time, read and 
attach meaning to a full (space-time) description of itself, including 
the act of reading this description and so on recursively.




Not at all. Consistency is the only requirement. If the observer goes 
looking for erself, then e will find erself in the description. It 
doesn't imply the observer is doing this all the time.


I think here we have run into the same inconsistency that you admitted in 
your discussion with Hal. In your first reply to Hal you assert that the 
observer O(x) attaches a unique meaning to the description string. Which 
would imply processing all bits of the string up to the start of the 
"don't care" region. A later reply suggest that we should in different 
contexts assume (a) this and (b) what your paper actually says, i.e. the 
meanings are updated as further bits are read.  Now you have changed this 
again, and the observer is not (modelled by) a simple mapping but is a 
free agent who can choose to apply mappings to different "regions" of the 
bitstring at will.


And even that doesn't actually answer my problem: let's assume the 
observer *does* "go looking for erself". You claim he will find himself, 
but if the description is *complete* my original problem remains: he will 
never finish reading his own description. Consequently the description 
will remain uninterpreted. In particular, he will never get to the part 
which would be interpreted as "himself now". So is there any sense in 
which *himself now* exists?


This assumes that the string description contained a complete definition 
of the observer, which is a natural interpretation of your phrase:


"Since some of these descriptions describe self aware substructures,..."

But maybe you just meant that the string contains references to (tokens 
of) the observer. This would be consistent with your comment a couple of 
posts ago that both "observers" and "descriptions" are primary.  This also 
seems to be consistent with your other recent post in response to Hal, in 
which the bitstrings are treated not so much as universes in the usual 
sense but as either the stream of sense data entering the observer's 
conciousness, or as a continuously-updated description of that 
concio

Re: Questions on Russell's "Why Occam" paper

2005-06-08 Thread Russell Standish
On Thu, Jun 09, 2005 at 01:55:32AM +0100, Patrick Leahy wrote:
> 
> [Russell Standish wrote]:
> 
> >The AP is a statement that observed reality must be consistent with
> >the observer being part of that reality.
> 
> Famously, this can be interpreted as either a trivial tautology (Brandon 
> Carter's original intention, I think), or an almost-obviously false 
> principle of necessity (Barrow & Tipler's SAP). If you think there's a 
> mystery here it suggests you go for the necessity version, but given your 
> infinite ensemble the tautology would suffice perfectly well.

Yes, if you think there is a concrete reality in which everything
exists (your question of where does the observer live?), then the AP
is a tautology. 

However, if you are prepared to allow for the possibility that
observers exist "nowhere", then things are not quite so simple. One
can always imagine being the brain-in-the-vat observer a reality which
does not contain a body, or a brain, in a vat or anywhere
else. Usually in this scenario, the observer will conclude that there
must be a body somewhere else, and so concludes that it is inhabiting
some kind of virtual reality. However, this implicitly assumes there
has to a brain somewhere, and so implies a reality somewhere else for
the brain to inhabit. But what if the brain is not required?

Obviously, the last conclusion is full blown solipsism, but that is
hardly a knock down argument.

Instead, one can take the Anthropic Principle as an assertion of the
reality we inhabit, and experimentally test it. In all such cases is
has been shown to be true, sometimes spectacularly. With the AP, one
recovers some of the properties of a concrete reality, without all of
it. In particular, Marchal's "shared dreaming" follows as a
consequence, and it contradicts solipsism.

> 
> You also said:
> 
> >>>The observer _is_ the interpreter. There may well be more than one 
> >>>observer in the picture, but they'd better agree!
> >>
> >>Why does this follow? 
> >
> >It follows from the Anthropic Principle. If O_1 is consistent with its 
> >observed reality, and O_2 is consistent with its observed reality, and 
> >O_1 observes O_2 in its reality, then O_1 and O_2 must be consistent 
> >with each other (at least with respect to their observed realities).
> 
> Ah. Just to be sure, do you mean that the string the observer "attaches 
> meaning to" is the one which describes the very same observer? This seems 
> to be implied by your comment above; but you don't say it or clearly 
> imply it in your paper.
> 
> Then you are implying that the observer can, in a finite time, read and 
> attach meaning to a full (space-time) description of itself, including the 
> act of reading this description and so on recursively.
> 

Not at all. Consistency is the only requirement. If the observer goes
looking for erself, then e will find erself in the description. It
doesn't imply the observer is doing this all the time.

> Which is impossible, of course.
> 

Of course.

> You also said:
> 
> >I'm not entirely sure I distinguish your difference between "external
> >world" and "internal representation". We're talking about observations
> >here, not models.
> 
> I'm sure you can distinguish *my* mental representation of the world from 
> your own. Hence if we share a world, and you can't distinguish between 
> that world and your internal representation, then you are not granting 
> equal status to other observers such as me.
> 

I'm not sure that is the case. I have a theory of your mind. I get it
most economically by observing my own mind, hence I'm self-aware. My
theory of the mind says that you are doing the same thing. Isn't this
symmetric? 

> You also said (quoting me):
> 
> >>My problem is that you are trying to make your observers work at two 
> >>different levels: as structures within the universes generated 
> >>(somehow!) by your bitstrings, but also as an interpretive principle 
> >>for producing meaning by operating *on* the bitstrings.  It's a bit 
> >>like claiming that PCs are built by "The Sims".
> >
> >Yes it is a bit like that. Obviously, the Anthropic Principle (or its 
> >equivalent) does not work with "The Sims".
> 
> Actually I don't see why not. The existence of The Sims implies a universe 
> compatible with the existence of Sims. But granting this is not so for the 
> sake of the argument, presumably the AP *will* apply to the Sims Mark VII 
> which will be fully self-aware artificial intelligences. 

If the AP applies to the Sims Mark VII, then their reality will be a
description containing a "body" corresponding to their
intelligences. They will not be aware of the PC that their description
is being generated on. We, who inhabit the world with the PC will not
be aware of the countless other PCs, Macs, Xboxes, Eniacs, Turing
machines, pebbles in Zen monasteries etc running Sims Mark VII. So the
PC itself is actually irrelevant from the internal perspective of the Sims.

> But it will still 
> be absu

Re: Questions on Russell's "Why Occam" paper

2005-06-08 Thread Russell Standish
If we're allowing ourselves a little informality, then I'd appeal to
the notion of observer moment. Within any observer moment, a finite
number of bits of the bitstrings has been read, and processed by the
observer. Since only a finite number of bits have been processed to
determine the meaning of reality at that moment, the
observer map O(x) is a prefix map. Hence at any point in time the
arguments in section 2 of the paper hold.

The meaning O(x) could also be called the "observer moment". If
observer moments are enumerable, one can inject OMs into the set of
natural numbers.

Observers find themselves embedded in a psychological time. I have not
been explicit about exactly what this time is, however I envisage it
to probably be what mathematicians call a "time scale", which is a
closed subset of the real numbers. Time could be continuous, or it
could be discrete (eg the set of natural numbers). It could be
something else, eg rational numbers or the Cantor set. All of these
are example time scales. The exact nature of time is something to be
settle later (if possible), but if you are more comfortable witrh
discrete time (as many are on this list), then you are welcome to use integers.

How this feeds back to our original observer map is that we'd expect
the map O(x) to be dependent on time, ie O(t,x). This is consistent
with time being "psychological". The description or "universe" x is
independent of time. It would correspond to what David Deutsch calls a
block universe.

Now perhaps section 3 makes some sense. What I call "robustness" of
the observer, ie that observers will not be fooled by a little noise
on the line - lions in camouflage are still observed to be lions for
instance constrains the form of time evolution of O(t,x). I haven't
formalised exactly what this constraint is, but it is something along
the lines of continuity of |O^{-1}(t,O(t,x))|, or continuity of the
observed complexity of the world. 

On Wed, Jun 08, 2005 at 09:09:04AM -0700, "Hal Finney" wrote:
> Russell Standish writes:
> > On Mon, Jun 06, 2005 at 01:51:36PM -0700, "Hal Finney" wrote:
> > > In particular, if "an observer attaches sequences of meanings to sequences
> > > of prefixes of one of these strings", then it seems that he must have a
> > > domain which does allow some inputs to be prefixes of others.  Isn't that
> > > what "sequences of prefixes" would mean?  That is, if the infinite string
> > > is 01011011100101110111..., then a sequence of prefixes might be 0, 01,
> > > 010, 0101, 01011,   Does O() apply to this sequence of prefixes?  If
> > > so then I don't think it is a prefix map.
> >
> > Yes I agree this is vague, and seemingly contradictory. I'm not sure
> > how to make this more precise, but one way to read the paper is to
> > treat observers as prefix maps for section 2 (Occam's razor), and then
> > for section 3 (White Rabbit problem) ignore the prefix property.
> >
> > It could be that the way of making this more precise is to assume
> > observers have some internal state that is constantly updated (a time
> > counter perhaps), so actually going through a sequence of prefix maps
> > in (psychological) time, but at this stage I don't have an answer.
> 
> Unfortunately I still don't understand this.  You agree that it is a
> seeming contradiction but that doesn't help me to see how to interpret it.
> 
> Here's an idea.  Would it be possible for you to explain how this
> page is meant to be understood, in an INformal way?  Often when people
> present concepts they do a formal writeup, but if they give a seminar
> or explanation they will depart from the formalism and explain what is
> really going on behind the scenes.  That's the kind of explanation I think
> I need.
> 
> Could you explain how these concepts relate to the actual experiences
> we have as human observers?  What are "descriptions" and "meanings"
> in terms of our sensory and mental experiences?  Which "descriptions"
> does an observer observe?  What are the "sequences of prefixes" and
> how do they relate to our day to day lives?  What is the point of the
> equivalence classes and what does that have to do with what we observe?
> 
> I think an informal explanation of these topics would help me, and
> perhaps Paddy, to better understand the structure that you formally
> describe.  At this point I am still failing to see how it all relates
> to my experience of the world as an observer.
> 
> Thanks -
> 
> Hal Finney

-- 
*PS: A number of people ask me about the attachment to my email, which
is of type "application/pgp-signature". Don't worry, it is not a
virus. It is an electronic signature, that may be used to verify this
email came from me if you have PGP or GPG installed. Otherwise, you
may safely ignore this attachment.


A/Prof Russell Standish  Phone 8308 3119 (mobile)
Mathematics0425 253119 (")
UNSW SYDNEY 20

Re: Questions on Russell's "Why Occam" paper

2005-06-08 Thread Patrick Leahy


[Russell Standish wrote]:


The AP is a statement that observed reality must be consistent with
the observer being part of that reality.


Famously, this can be interpreted as either a trivial tautology (Brandon 
Carter's original intention, I think), or an almost-obviously false 
principle of necessity (Barrow & Tipler's SAP). If you think there's a 
mystery here it suggests you go for the necessity version, but given your 
infinite ensemble the tautology would suffice perfectly well.


You also said:

>The observer _is_ the interpreter. There may well be more than one 
>observer in the picture, but they'd better agree!


Why does this follow? 


It follows from the Anthropic Principle. If O_1 is consistent with its 
observed reality, and O_2 is consistent with its observed reality, and 
O_1 observes O_2 in its reality, then O_1 and O_2 must be consistent 
with each other (at least with respect to their observed realities).


Ah. Just to be sure, do you mean that the string the observer "attaches 
meaning to" is the one which describes the very same observer? This seems 
to be implied by your comment above; but you don't say it or clearly 
imply it in your paper.


Then you are implying that the observer can, in a finite time, read and 
attach meaning to a full (space-time) description of itself, including the 
act of reading this description and so on recursively.


Which is impossible, of course.

You also said:


I'm not entirely sure I distinguish your difference between "external
world" and "internal representation". We're talking about observations
here, not models.


I'm sure you can distinguish *my* mental representation of the world from 
your own. Hence if we share a world, and you can't distinguish between 
that world and your internal representation, then you are not granting 
equal status to other observers such as me.


You also said (quoting me):

My problem is that you are trying to make your observers work at two 
different levels: as structures within the universes generated 
(somehow!) by your bitstrings, but also as an interpretive principle 
for producing meaning by operating *on* the bitstrings.  It's a bit 
like claiming that PCs are built by "The Sims".


Yes it is a bit like that. Obviously, the Anthropic Principle (or its 
equivalent) does not work with "The Sims".


Actually I don't see why not. The existence of The Sims implies a universe 
compatible with the existence of Sims. But granting this is not so for the 
sake of the argument, presumably the AP *will* apply to the Sims Mark VII 
which will be fully self-aware artificial intelligences. But it will still 
be absurd to claim that the Sims are responsible for construction of PCs 
(assuming they are not connected to robot arms etc, for which no analogs 
exist in your theory). Let alone for them to construct the actual PC on 
which they are running, as apparently implied by your last message... even 
robot arms wouldn't help there.


Paddy Leahy

==
Dr J. P. Leahy, University of Manchester,
Jodrell Bank Observatory, School of Physics & Astronomy,
Macclesfield, Cheshire SK11 9DL, UK
Tel - +44 1477 572636, Fax - +44 1477 571618



Re: Questions on Russell's "Why Occam" paper

2005-06-08 Thread "Hal Finney"
Russell Standish writes:
> On Mon, Jun 06, 2005 at 01:51:36PM -0700, "Hal Finney" wrote:
> > In particular, if "an observer attaches sequences of meanings to sequences
> > of prefixes of one of these strings", then it seems that he must have a
> > domain which does allow some inputs to be prefixes of others.  Isn't that
> > what "sequences of prefixes" would mean?  That is, if the infinite string
> > is 01011011100101110111..., then a sequence of prefixes might be 0, 01,
> > 010, 0101, 01011,   Does O() apply to this sequence of prefixes?  If
> > so then I don't think it is a prefix map.
>
> Yes I agree this is vague, and seemingly contradictory. I'm not sure
> how to make this more precise, but one way to read the paper is to
> treat observers as prefix maps for section 2 (Occam's razor), and then
> for section 3 (White Rabbit problem) ignore the prefix property.
>
> It could be that the way of making this more precise is to assume
> observers have some internal state that is constantly updated (a time
> counter perhaps), so actually going through a sequence of prefix maps
> in (psychological) time, but at this stage I don't have an answer.

Unfortunately I still don't understand this.  You agree that it is a
seeming contradiction but that doesn't help me to see how to interpret it.

Here's an idea.  Would it be possible for you to explain how this
page is meant to be understood, in an INformal way?  Often when people
present concepts they do a formal writeup, but if they give a seminar
or explanation they will depart from the formalism and explain what is
really going on behind the scenes.  That's the kind of explanation I think
I need.

Could you explain how these concepts relate to the actual experiences
we have as human observers?  What are "descriptions" and "meanings"
in terms of our sensory and mental experiences?  Which "descriptions"
does an observer observe?  What are the "sequences of prefixes" and
how do they relate to our day to day lives?  What is the point of the
equivalence classes and what does that have to do with what we observe?

I think an informal explanation of these topics would help me, and
perhaps Paddy, to better understand the structure that you formally
describe.  At this point I am still failing to see how it all relates
to my experience of the world as an observer.

Thanks -

Hal Finney



Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Russell Standish
On Tue, Jun 07, 2005 at 10:15:03PM +0100, Patrick Leahy wrote:

> >>>Now an observer will expect to find a SAS in one of the descriptions
> >>>as a corrolory of the anthropic principle, which is explicitly stated
> >>>as one of the assumptions in this work. I make no bones about this - I
> >>>consider the anthropic principle a mystery, not self-evident like
> >>>many people.
> >>
> >>Very few supporters of the AP would "expect to find a SAS" in a bitstring.
> >>Until you *specify* a way of interpreting the string, it contains nothing
> >>but bits.
> >
> >The observer specifies the interpretation.
> 
> But the observer is *generated* by the interpretation! Until you have an 
> interpretation, you have no observers. And until you have an observer, you 
> have no interpretation (at least that's how I read the sentence quoted 
> above).

No the observer is somehow primary. As are the descriptions. If it
weren't for the anthropic principle, there would be no connection
between the two, and we'd have a genuine "brain-in-the-vat".

> 
> How can structures which exist as some sort of pattern inside a bit string 
> (or am I supposed to say in the "meaning" integer output by some (other) 
> O(x)?) read a separate bitstring which exists as a parallel universe in 
> Platonia?
> 

That is what they do, by definition. Observers observe. Platonia is a
collection of all possible observations, hence what observers observe
is in Platonia.

> 
> So you find it a mystery that you have a face, eye etc??  If so, what does 
> the AP have to do with this mystery? Actually, maybe it would clarify 
> things if you said what you mean by the AP; it certainly doesn't seem to 
> be very like the AP that I know about.
> 

The AP is a statement that observed reality must be consistent with
the observer being part of that reality.


> I'm not sure whether your "my reality" refers to the external world or 
> your internal representation of it. I guess the latter, otherwise your 
> body would be you, not a token representing you.
> 

I'm not entirely sure I distinguish your difference between "external
world" and "internal representation". We're talking about observations
here, not models.

> >>In particular, any bitstring can be "interpreted" as any other bitstring
> >>by an appropriate map. Hence until you specify an interpreter you are
> >>simply not proposing a theory at all.
> >>
> >
> >The observer _is_ the interpreter. There may well be more than one
> >observer in the picture, but they'd better agree!
> 
> Why does this follow? Your "observers" are maps O(x) from prefix strings 
> to the integers. Why can't you have two inconsistent maps... or rather, 
> how can you possibly avoid such?  And since two different maps don't 
> interact at all (how can a mapping interact with another mapping?) each of 
> your observers seems to be sealed in his own little universe. In which 
> case having >1 observer appears to be an unverifiable speculation, which 
> is why I say it seems like solipsism.
> 

It follows from the Anthropic Principle. If O_1 is consistent with its
observed reality, and O_2 is consistent with its observed reality,
and O_1 observes O_2 in its reality, then O_1 and O_2 must be
consistent with each other (at least with respect to their observed realities).


> >>Most readers of your paper would take it that you are making a strong
> >>ontological proposition, i.e. that the basis of reality is your set of
> >>bitstrings.
> >
> >This is the case.
> 
> Well, if you are making an ontological proposition, you are ipso facto not 
> just explaining appearances. In your model your "bitstrings" *are* the 
> noumenon (in Kant's terminology). 

Sorry, I don't see this. The bitstrings are phenomena. No noumenon
appears. It is possible we are arguing semantic differences only though.

> >
> >>In this case "where they live" is crucial because it
> >>defines the environment the SAS find themselves in.
> >
> >Why?
> 
> An intelligent system is "intelligent" by virtue of the way it interacts 
> with its environment. Think about the Turing test again: we conclude that 
> the computer is (not) intelligent because of the way it interacts with us. 
> To put it another way, you define these things as "observers". This 
> implies something observed. Obviously your model had better account for 
> people (observers) like us observing something like the world we see 
> ("where we live"), and preferably interacting with other people who are 
> granted equal status your ontology.

Does it not? OK, it is not exactly explicit about it, but this
situation should appear somewhere in Platonia.

> 
> >It is not solipsism, if only for the reason that multiple observers
> >exist in our observed reality. They are all as real as our own 
> >consciousness.
> >
> >Bruno Marchal calls this "shared dreaming". It seems apt.
> 
> If that's the *only* reason it's not solipsism, then I would say you just 
> don't have the courage of your convictions. Bruno's "shared dre

Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Russell Standish
On Tue, Jun 07, 2005 at 05:57:17PM +0200, Bruno Marchal wrote:
> 
> Le 07-juin-05, ? 12:28, Russell Standish a ?crit :
> 
> >On Tue, Jun 07, 2005 at 10:37:10AM +0200, Bruno Marchal wrote:
> >>
> >>OK. it seems to me that (equation 14 at
> >>http://parallel.hpc.unsw.edu.au/rks/docs/occam/node4.html  )
> >>
> >>?
> >>
> >
> >In LaTeX, this equation is
> >
> >\frac {d\psi}{d t}={\cal H}(\psi)
> >
> >It supposes time, but not space (TIME postulate). Moreover, it
> >supposes continuous time,
> 
> 
> Yes but that is a lot of assumptions. Why a linear time capable of 
> being represented by the very special line with the usual topology of 
> the reals? I can imagine many topology on the reals.
> 

I thought the definition of the reals defined its topology? Perhaps
you're using topology somewhat differently.

It is a good question as to why time should have a topological
dimension of 1, and I admit to not having a good answer to that. All I
can say is that computationalism also introduces a time with
topological dimension of 1. Tegmark has some arguments as to why the
topological dimension is 1, and not any other number, but these are
not overly persuasive (the argument depends on properties of 2nd order
PDEs, which already carries a greater amount of baggage)

I only assume continuity to make contact with standard QM. It is an
arbitrary assumption in my opinion, and should rightly be viewed with
suspicion. 

> 
> 
> >but I do suggest in the paper how it might
> >be generalised to other possible timescales.
> 
> 
> yes but if you pretend to derive your equation, I don't understand what 
> you mean by generalizing your conclusion (if only by: I have not derive 
> it and it remains some work to do).
> 

Of course there remains some work to be done.

> 
> 
> >Perhaps it also supposes
> >continuity in time for \psi, although this probably flows from
> >assuming continuity of time.
> 
> 
> Why should a function be continuous just because it is defined on a 
> topological space (which is what I assume you are saying when you say 
> continuity of time).
> 

Because then it wouldn't be an evolution. For a state \psi(t') to
depend on the state \psi(t), there must be a corresponding limit
\psi(t')->\psi(t) as t'->t. Of course if t were not continuous, then
this condition is no longer necessary.

> 
> 
> >I do not think time is necessarily
> >continuous - I think it is interesting to explore alternative QMs
> >without this assumption.
> 
> 
> Sure. But again how to talk on derivation then. I mean if someone 
> pretend to derive B from A, then if someone else derive something m?ore 
> general than B from A, it is a critic of the assertion that B has been 
> derived from A. If from facts I can derive the murderer is among John 
> and Charles, I am not so interested in  knowing the derivation can be 
> generalized into leading that the murderer is among John, Charles, Lee, 
> Bruno and Nicole!
> 

To use your analogy, the situation is more like:

Assuming the murderer is John, Charles, Fred or Diana, I have shown
that the murderer must be one of John or Charles. However, if we also
consider Lee, Bruno and Nicole ... there is more work to be done.

The assumption of J,C,F or D corresponds by analogy to the continuity
assumption. 

L,B and N are not continuous - does that sound right? :)

> 
> 
> >
> >The question is whether this is the most general evolution equation
> >for continuous time, or whether there is some more general
> >equation.
> 
> 
> Absolutely.
> 

I think it is, for the reason above. I willing to stand corrected
should that not be the case.

> 
> 
> >Remember, we do have already that \psi is a member of a
> >Hilbert space, so we can write things like:
> 
> 
> OK, but you assume Set theory (that by itself is huge in our context). 
> I show only that you have a preHilbetian space (why should "cauchy 
> sequence of vectors converges).
> 
> 

Yes I do assume set theory. That is stated.

I prove Cauchy sequences converge in the paper. It requires the use of
Kolmogorov axiom no. 6. Hence the space of states is a Hilbert space.

> 
> >
> >\psi(t')-\psi(t) = ...
> >
> >What do you mean by derivability notion for H, and topological notion?
> 
> 
> Topological notion are needed for talking of continuity (a continuous 
> function is just a function from topological space into a topological 
> space such that the inverse image of open set is an open set).. You 
> "assume" the familiar topology of reals, complex number, etc.

Ah yes, you're talking about topological spaces. I just did a quick
refresher course on these using Wikipedia. Indeed, when I make the
arbitrary continuity assumption of time, it is an assumption that time
is a subset of the reals, which has the usual metric and topology
defined. When you say you can imagine many topologies on the reals,
what you are really saying is that you can imagine sets isomorphic to
the reals (1-1 corrspendence), that have many different topologies, eg
the trivial or the discrete

Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Patrick Leahy


On Tue, 7 Jun 2005, Russell Standish wrote:


Hal dealt with this one already, I notice. 2^\aleph_0 = c. \aleph_1 is
something else entirely.


d'oh!







Now an observer will expect to find a SAS in one of the descriptions
as a corrolory of the anthropic principle, which is explicitly stated
as one of the assumptions in this work. I make no bones about this - I
consider the anthropic principle a mystery, not self-evident like
many people.


Very few supporters of the AP would "expect to find a SAS" in a bitstring.
Until you *specify* a way of interpreting the string, it contains nothing
but bits.


The observer specifies the interpretation.


But the observer is *generated* by the interpretation! Until you have an 
interpretation, you have no observers. And until you have an observer, you 
have no interpretation (at least that's how I read the sentence quoted 
above).


How can structures which exist as some sort of pattern inside a bit string 
(or am I supposed to say in the "meaning" integer output by some (other) 
O(x)?) read a separate bitstring which exists as a parallel universe in 
Platonia?







Why should an observer expect to see a token of erself
embedded in reality? That is the mystery of the AP.


What ARE you talking about?  Observer's don't see tokens of
themselves...


I can see that I have a body - if I look in the mirror I can see a
face, eye etc, all of which appear to be under my control. This is a
token embedded in my reality that represents me.


So you find it a mystery that you have a face, eye etc??  If so, what does 
the AP have to do with this mystery? Actually, maybe it would clarify 
things if you said what you mean by the AP; it certainly doesn't seem to 
be very like the AP that I know about.


I'm not sure whether your "my reality" refers to the external world or 
your internal representation of it. I guess the latter, otherwise your 
body would be you, not a token representing you.



In particular, any bitstring can be "interpreted" as any other bitstring
by an appropriate map. Hence until you specify an interpreter you are
simply not proposing a theory at all.



The observer _is_ the interpreter. There may well be more than one
observer in the picture, but they'd better agree!


Why does this follow? Your "observers" are maps O(x) from prefix strings 
to the integers. Why can't you have two inconsistent maps... or rather, 
how can you possibly avoid such?  And since two different maps don't 
interact at all (how can a mapping interact with another mapping?) each of 
your observers seems to be sealed in his own little universe. In which 
case having >1 observer appears to be an unverifiable speculation, which 
is why I say it seems like solipsism.








All that is discussed in this paper is appearances - we only try to
explain the phenomenon (things as they appear). No attempt is made to
explain the noumenon (things as they are), nor do we need to assume
that there is a noumenon.


Most readers of your paper would take it that you are making a strong
ontological proposition, i.e. that the basis of reality is your set of
bitstrings.


This is the case.


Well, if you are making an ontological proposition, you are ipso facto not 
just explaining appearances. In your model your "bitstrings" *are* the 
noumenon (in Kant's terminology).  Kant's point was that you can't infer 
the nature of "things in themselves" from observation. He didn't say that 
you can't speculate about their nature, and even guess right, by chance. 
In effect, this mailing list discusses nothing but the nature of the 
noumenon. (Kant would probably say this is a waste of time, of course).




I think either your terminology or you model has now got very confused.
Are your "observer" TMs the observers (SAS) whose experiences your theory
is trying to explain?


Yes.


In this case "where they live" is crucial because it
defines the environment the SAS find themselves in.


Why?


An intelligent system is "intelligent" by virtue of the way it interacts 
with its environment. Think about the Turing test again: we conclude that 
the computer is (not) intelligent because of the way it interacts with us. 
To put it another way, you define these things as "observers". This 
implies something observed. Obviously your model had better account for 
people (observers) like us observing something like the world we see 
("where we live"), and preferably interacting with other people who are 
granted equal status your ontology.



It is not solipsism, if only for the reason that multiple observers
exist in our observed reality. They are all as real as our own consciousness.

Bruno Marchal calls this "shared dreaming". It seems apt.


If that's the *only* reason it's not solipsism, then I would say you just 
don't have the courage of your convictions. Bruno's "shared dreaming" 
sounds very like Leibniz's pre-established harmony, but that only works if 
you believe in a provident deity (if it ever worked for an

Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Bruno Marchal


Le 07-juin-05, à 12:28, Russell Standish a écrit :


On Tue, Jun 07, 2005 at 10:37:10AM +0200, Bruno Marchal wrote:


OK. it seems to me that (equation 14 at
http://parallel.hpc.unsw.edu.au/rks/docs/occam/node4.html  )

?



In LaTeX, this equation is

\frac {d\psi}{d t}={\cal H}(\psi)

It supposes time, but not space (TIME postulate). Moreover, it
supposes continuous time,



Yes but that is a lot of assumptions. Why a linear time capable of 
being represented by the very special line with the usual topology of 
the reals? I can imagine many topology on the reals.





but I do suggest in the paper how it might
be generalised to other possible timescales.



yes but if you pretend to derive your equation, I don't understand what 
you mean by generalizing your conclusion (if only by: I have not derive 
it and it remains some work to do).





Perhaps it also supposes
continuity in time for \psi, although this probably flows from
assuming continuity of time.



Why should a function be continuous just because it is defined on a 
topological space (which is what I assume you are saying when you say 
continuity of time).





I do not think time is necessarily
continuous - I think it is interesting to explore alternative QMs
without this assumption.



Sure. But again how to talk on derivation then. I mean if someone 
pretend to derive B from A, then if someone else derive something mùore 
general than B from A, it is a critic of the assertion that B has been 
derived from A. If from facts I can derive the murderer is among John 
and Charles, I am not so interested in  knowing the derivation can be 
generalized into leading that the murderer is among John, Charles, Lee, 
Bruno and Nicole!






The question is whether this is the most general evolution equation
for continuous time, or whether there is some more general
equation.



Absolutely.




Remember, we do have already that \psi is a member of a
Hilbert space, so we can write things like:



OK, but you assume Set theory (that by itself is huge in our context). 
I show only that you have a preHilbetian space (why should "cauchy 
sequence of vectors converges).






\psi(t')-\psi(t) = ...

What do you mean by derivability notion for H, and topological notion?



Topological notion are needed for talking of continuity (a continuous 
function is just a function from topological space into a topological 
space such that the inverse image of open set is an open set).. You 
"assume" the familiar topology of reals, complex number, etc.
Derivability is a stronger requirement (although some algebraist would 
introduce many nuances). Someday I will show you make also assumption 
on consciousness, but that is more subtle, and then all physicist if 
not almost scientist are doing them when they pretend to solve the 
consciousness problem like Dennett, or when they put it under the rug 
(a little bit like Lee in his last posts, I would say).


Look Russell, as I said I appreciate your attempt, it is just that, as 
Hal and Paddy mentionned, there remains quite a lot of work to make it 
thoroughly communicable. You should really put more clearly your 
assumptions. You assume a vast part of mathematics, and I would say of 
physics, mainly with your time postulate and your equation. Compare 
your work with those I have mentionned (I will give the reference for 
those you don't have yet). Don't compare it to quickly  to mine where 
the assumptions are made still at a much more basic (logical and 
arithmetical) level. I assume less than Peano arithmetic. I know I 
could seem a little bit presomptuous, but nuance would make the post 
more long and more boring. Hope you don't mind, (actually I would be 
glad someone criticize the most severely possible my work),


Bruno

http://iridia.ulb.ac.be/~marchal/




Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Russell Standish
On Tue, Jun 07, 2005 at 10:37:10AM +0200, Bruno Marchal wrote:
> 
> OK. it seems to me that (equation 14 at 
> http://parallel.hpc.unsw.edu.au/rks/docs/occam/node4.html  ) 
> 
> ?
> 

In LaTeX, this equation is

\frac {d\psi}{d t}={\cal H}(\psi)

It supposes time, but not space (TIME postulate). Moreover, it
supposes continuous time, but I do suggest in the paper how it might
be generalised to other possible timescales. Perhaps it also supposes
continuity in time for \psi, although this probably flows from
assuming continuity of time. I do not think time is necessarily
continuous - I think it is interesting to explore alternative QMs
without this assumption.

The question is whether this is the most general evolution equation
for continuous time, or whether there is some more general
equation. Remember, we do have already that \psi is a member of a
Hilbert space, so we can write things like:

\psi(t')-\psi(t) = ...

What do you mean by derivability notion for H, and topological notion?

>  
> 
> is really presupposing a lot. Where does that come from? It presupposes 
> a space/time geometry, continuity, derivability notion for H, 
> topological notion, etc. 
> 
> To begin with. 
> 
> Bruno 
> 
> http://iridia.ulb.ac.be/~marchal/ 
> 
> 


-- 
*PS: A number of people ask me about the attachment to my email, which
is of type "application/pgp-signature". Don't worry, it is not a
virus. It is an electronic signature, that may be used to verify this
email came from me if you have PGP or GPG installed. Otherwise, you
may safely ignore this attachment.


A/Prof Russell Standish  Phone 8308 3119 (mobile)
Mathematics0425 253119 (")
UNSW SYDNEY 2052 [EMAIL PROTECTED] 
Australiahttp://parallel.hpc.unsw.edu.au/rks
International prefix  +612, Interstate prefix 02



pgpzEx7MGUiyd.pgp
Description: PGP signature


Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Bruno Marchal

Le 07-juin-05, à 09:20, Russell Standish a écrit :

On Tue, Jun 07, 2005 at 08:29:57AM +0200, Bruno Marchal wrote:
Le 06-juin-05, ? 22:51, Hal Finney a ?crit :

I share most of Paddy Leahy's concerns and areas of confusion with
regard to the "Why Occam" discussion so far.  I really don't understand
what it means to explain appearances rather than reality.


Well this I understand. I would even argue that Everett gives an 
example by providing an explanation of the appearance of a wave 
collapse from the SWE (Schroedinger Wave equation) and this without any 
*real*collapse.
And I pretend at least that if comp is correct, then the SWE as an 
*appearance* emerges statistically from the "interference" of all 
computations as seen from some inner point of view of the mean 
universal machine.
But, as I pointed a long time ago Russell is hiding (de facto, not 
intentionally I guess :) many assumptions.

It would be nice to expose these "hidden" assumptions. As far as I'm
aware, all my assumptions are exposed and upfront, where at least you
as a reader can decide if you agree, but there is always the
possibility of some that I've missed.


OK. it seems to me that (equation 14 at
http://parallel.hpc.unsw.edu.au/rks/docs/occam/node4.html  )

 <>

is really presupposing a lot. Where does that come from? It presupposes a space/time geometry, continuity, derivability notion for H, topological notion, etc.

To begin with.

Bruno

http://iridia.ulb.ac.be/~marchal/


Re: Questions on Russell's "Why Occam" paper

2005-06-07 Thread Russell Standish
On Tue, Jun 07, 2005 at 08:29:57AM +0200, Bruno Marchal wrote:
> 
> Le 06-juin-05, ? 22:51, Hal Finney a ?crit :
> 
> >I share most of Paddy Leahy's concerns and areas of confusion with
> >regard to the "Why Occam" discussion so far.  I really don't understand
> >what it means to explain appearances rather than reality.
> 
> 
> Well this I understand. I would even argue that Everett gives an 
> example by providing an explanation of the appearance of a wave 
> collapse from the SWE (Schroedinger Wave equation) and this without any 
> *real*collapse.
> And I pretend at least that if comp is correct, then the SWE as an 
> *appearance* emerges statistically from the "interference" of all 
> computations as seen from some inner point of view of the mean 
> universal machine.
> But, as I pointed a long time ago Russell is hiding (de facto, not 
> intentionally I guess :) many assumptions.

It would be nice to expose these "hidden" assumptions. As far as I'm
aware, all my assumptions are exposed and upfront, where at least you
as a reader can decide if you agree, but there is always the
possibility of some that I've missed.

> There are a lot of "derivation" of the SWE in the literature, it would 
> be interesting that Russell compares them with its own. My favorite one 
> is the one by Henry and another one by Hardy.

The only thing I was aware of by Henry was a derivation of the
correspondence principle from gauge invariance in a paper you sent me,
something I think that Stenger does better in his book (which is
almost published now!).

And as for Hardy, I never found his axioms terribly "reasonable",
unfortunately. 

> Note the incredible derivation of QM from just 5 experiments + a 
> natural principle of simplicity by Julian Swinger in his QM course 
> (taken again by Towsend in its QM textbook). I will give reference once 
> less busy.
>

Sure - I'm not aware of that.
 
> I agree with Hal and Paddy about the lack of clarity in many passages.
> Note that my result is infinitely more modest (despite the 
> appearance!). 

Hardly infinitely more modest. You start from a slightly different
basis (COMP thesis vs all descriptions ensemble), derive the
existence of what I assume, and end up not quite where I end
up. Perhaps if you adopted Kolmogorov probability axioms, you could
get the full QM theory to result. The other things I assume tend to be
assumed by you also - COMP => TIME, and I think you assume PROJ. Not
sure where your work stand with the Anthropic Principle.

> I just prove that if comp is assumed to be correct then a 
> derivation of the SWE *must* exist, without providing it. Well, in the 
> interview of the Lobian machine I do extract some 'quantum logic' from 
> comp, but it is too early to judge if the SWE can be extracted from it. 
> But it should be, in principle, if comp is true. Advantage: I just 
> assume natural numbers and classical logic, I don't assume any geometry 
> or temporality, which for me are really the miraculous things in need 
> to be explained.
> 
> Bruno
> 

-- 
*PS: A number of people ask me about the attachment to my email, which
is of type "application/pgp-signature". Don't worry, it is not a
virus. It is an electronic signature, that may be used to verify this
email came from me if you have PGP or GPG installed. Otherwise, you
may safely ignore this attachment.


A/Prof Russell Standish  Phone 8308 3119 (mobile)
Mathematics0425 253119 (")
UNSW SYDNEY 2052 [EMAIL PROTECTED] 
Australiahttp://parallel.hpc.unsw.edu.au/rks
International prefix  +612, Interstate prefix 02



pgpQV02YpvmwP.pgp
Description: PGP signature


Re: Questions on Russell's "Why Occam" paper

2005-06-06 Thread Bruno Marchal


Le 06-juin-05, à 22:51, Hal Finney a écrit :


I share most of Paddy Leahy's concerns and areas of confusion with
regard to the "Why Occam" discussion so far.  I really don't understand
what it means to explain appearances rather than reality.



Well this I understand. I would even argue that Everett gives an 
example by providing an explanation of the appearance of a wave 
collapse from the SWE (Schroedinger Wave equation) and this without any 
*real*collapse.
And I pretend at least that if comp is correct, then the SWE as an 
*appearance* emerges statistically from the "interference" of all 
computations as seen from some inner point of view of the mean 
universal machine.
But, as I pointed a long time ago Russell is hiding (de facto, not 
intentionally I guess :) many assumptions.
There are a lot of "derivation" of the SWE in the literature, it would 
be interesting that Russell compares them with its own. My favorite one 
is the one by Henry and another one by Hardy.
Note the incredible derivation of QM from just 5 experiments + a 
natural principle of simplicity by Julian Swinger in his QM course 
(taken again by Towsend in its QM textbook). I will give reference once 
less busy.


I agree with Hal and Paddy about the lack of clarity in many passages.
Note that my result is infinitely more modest (despite the 
appearance!). I just prove that if comp is assumed to be correct then a 
derivation of the SWE *must* exist, without providing it. Well, in the 
interview of the Lobian machine I do extract some 'quantum logic' from 
comp, but it is too early to judge if the SWE can be extracted from it. 
But it should be, in principle, if comp is true. Advantage: I just 
assume natural numbers and classical logic, I don't assume any geometry 
or temporality, which for me are really the miraculous things in need 
to be explained.


Bruno



It's hard to
get my mind around this kind of explanation and what to expect from it.
Also the way the Anthropic Principle applies to infinite strings seems
extremely vague until we have a clearer picture of how those strings
relate to reality.

One area I differ:

Paddy Leahy writes, quoting Russell:

However, as the cardinality of "my" ensemble is actually "c"
(cardinality of the real numbers), it is quite probably a completely
different beast.


There you go again with your radical compression. Without the reading 
I've
been doing in the last two weeks, I wouldn't have been able to decode 
this

statement as meaning:

2^\aleph_0 = \aleph_1 (by definition)

To assume c = \aleph_1 is the Continuum Hypothesis, which is 
unprovable

(within standard arithmetic).


Actually Russell did not bring aleph_1 into the picture at all.  All 
that

he referred to was aleph_0 and c which by definition is 2^aleph_0.
c is the cardinality of the reals and of infinite bit strings.  This is
all just definitional.  Whether c is the "next" infinite cardinal after
aleph_0 is the Continuum Hypothesis, but that is not relevant here.

Another area I had trouble with in Russell's answer was the concept of
a prefix map.  I understand that a prefix map is defined as a mapping
whose domain is finite bit strings such that none of them are a prefix
of any other.  But I'm not sure how to relate this to the infinite bit
strings that are "descriptions".

In particular, if "an observer attaches sequences of meanings to 
sequences

of prefixes of one of these strings", then it seems that he must have a
domain which does allow some inputs to be prefixes of others.  Isn't 
that
what "sequences of prefixes" would mean?  That is, if the infinite 
string

is 01011011100101110111..., then a sequence of prefixes might be 0, 01,
010, 0101, 01011,   Does O() apply to this sequence of prefixes?  
If

so then I don't think it is a prefix map.

I want to make it clear by the way that my somewhat pedantic and 
labored

examination of this page is not an attempt to be difficult or stubborn.
Rather, I find that by the third page, I don't understand what is going
on at all!  Even the very first sentence, "In the previous sections, I
demonstrate that formal mathematical systems are the most compressible,
and have highest measure amongst all members of the Schmidhuber 
ensemble,"

has me looking to see if I skipped a page!  I don't see where this is
discussed in any way.  So I hope that by pinning down and crystalizing
exactly what the first page is claiming, it will help me to see what
the more interesting third page is actually able to establish.  I think
Paddy is in much the same situation.

Hal Finney



http://iridia.ulb.ac.be/~marchal/




Re: Questions on Russell's "Why Occam" paper

2005-06-06 Thread Russell Standish
On Mon, Jun 06, 2005 at 01:51:36PM -0700, "Hal Finney" wrote:
> 
> Another area I had trouble with in Russell's answer was the concept of
> a prefix map.  I understand that a prefix map is defined as a mapping
> whose domain is finite bit strings such that none of them are a prefix
> of any other.  But I'm not sure how to relate this to the infinite bit
> strings that are "descriptions".

A prefix map attaches the same output to all strings that share a
common finite length prefix.

> 
> In particular, if "an observer attaches sequences of meanings to sequences
> of prefixes of one of these strings", then it seems that he must have a
> domain which does allow some inputs to be prefixes of others.  Isn't that
> what "sequences of prefixes" would mean?  That is, if the infinite string
> is 01011011100101110111..., then a sequence of prefixes might be 0, 01,
> 010, 0101, 01011,   Does O() apply to this sequence of prefixes?  If
> so then I don't think it is a prefix map.
> 

Yes I agree this is vague, and seemingly contradictory. I'm not sure
how to make this more precise, but one way to read the paper is to
treat observers as prefix maps for section 2 (Occam's razor), and then
for section 3 (White Rabbit problem) ignore the prefix property.

It could be that the way of making this more precise is to assume
observers have some internal state that is constantly updated (a time
counter perhaps), so actually going through a sequence of prefix maps
in (psychological) time, but at this stage I don't have an answer.


> I want to make it clear by the way that my somewhat pedantic and labored
> examination of this page is not an attempt to be difficult or
> stubborn.

Even being difficult and stubborn has its place (to help winkle out
subtle errors of logic eg), so long as you relax enough at other times to
obtain understanding. I appreciate the effort in any case.

> Rather, I find that by the third page, I don't understand what is going
> on at all!  Even the very first sentence, "In the previous sections, I
> demonstrate that formal mathematical systems are the most compressible,
> and have highest measure amongst all members of the Schmidhuber ensemble,"
> has me looking to see if I skipped a page!  I don't see where this is
> discussed in any way. 

This is pretty much a tautology. Formal mathematical systems are a
means of compressing data in the form of facts about numbers. If one
were to include all such possible compression schemes, rather than
just the systems studies by mathematicians to date, one would end up
with the set of Turing machines, or equivalently of computable
functions. The Occam's razor result clearly relates measure to the
amount of compressibility in the description.

Perhaps such a view of mathematics is strange. Certainly I find it
strange when Stephen Wolfram says mathematics is incapable of
understanding complex phenomena, and one should cellular automata
instead. To me, cellular automata are just another example of a
mathematical system.

> So I hope that by pinning down and crystalizing
> exactly what the first page is claiming, it will help me to see what
> the more interesting third page is actually able to establish.  I think
> Paddy is in much the same situation.
> 
> Hal Finney

I hope so too.

-- 
*PS: A number of people ask me about the attachment to my email, which
is of type "application/pgp-signature". Don't worry, it is not a
virus. It is an electronic signature, that may be used to verify this
email came from me if you have PGP or GPG installed. Otherwise, you
may safely ignore this attachment.


A/Prof Russell Standish  Phone 8308 3119 (mobile)
Mathematics0425 253119 (")
UNSW SYDNEY 2052 [EMAIL PROTECTED] 
Australiahttp://parallel.hpc.unsw.edu.au/rks
International prefix  +612, Interstate prefix 02



pgpaBNDuiwTTe.pgp
Description: PGP signature


Re: Questions on Russell's "Why Occam" paper

2005-06-06 Thread Russell Standish
On Mon, Jun 06, 2005 at 12:06:06PM +0100, Patrick Leahy wrote:
> 
> 
> On Mon, 6 Jun 2005, Russell Standish wrote:
> 
> >I am beginning to regret calling the all descriptions ensemble with
> >uniform measure a Schmidhuber ensemble. I think what I meant was that
> >it could be generated by a standard dovetailer algorithm, running for
> >2^\aleph_0 timesteps.
> 
> It can't! Timesteps are denumerable, hence this statement is just a 
> contradiction in terms. You better postulate your ensemble without 
> reference to any algorithm to generate it.

Indeed I do. Only Schmidhuber uses the dovetailer. Hence my "regret".

...

> 
> 2^\aleph_0 = \aleph_1 (by definition)
> 

Hal dealt with this one already, I notice. 2^\aleph_0 = c. \aleph_1 is
something else entirely.

> 
> >
> >Now an observer will expect to find a SAS in one of the descriptions
> >as a corrolory of the anthropic principle, which is explicitly stated
> >as one of the assumptions in this work. I make no bones about this - I
> >consider the anthropic principle a mystery, not self-evident like
> >many people.
> 
> Very few supporters of the AP would "expect to find a SAS" in a bitstring.
> Until you *specify* a way of interpreting the string, it contains nothing 
> but bits.

The observer specifies the interpretation.

> 
> >Why should an observer expect to see a token of erself
> >embedded in reality? That is the mystery of the AP.
> 
> What ARE you talking about?  Observer's don't see tokens of
> themselves... 

I can see that I have a body - if I look in the mirror I can see a
face, eye etc, all of which appear to be under my control. This is a
token embedded in my reality that represents me.

> if anyone (God?) has a 3rd-person/bird's eye view, it is certainly not 
> someone who is included in any particular reality. No way is anything like 
> this implied by the AP. All the AP requires is that there *be* 
> observers/SAS in (real) universes, which is true in our case at least.
> 

Sorry - you lost me here ... oh well.

> >
> >>And now we find not only that the bit string is
> >>a description, but it is a complex enough description to describe SAS's?
> >>How does that work?
> >>
> >
> >The bitstrings are infinite in length. By reading enough bits, they can
> >have arbitrarily complex meanings attached to them.
> >
> 
> In particular, any bitstring can be "interpreted" as any other bitstring 
> by an appropriate map. Hence until you specify an interpreter you are 
> simply not proposing a theory at all.
> 

The observer _is_ the interpreter. There may well be more than one
observer in the picture, but they'd better agree!

> 
> >
> >All that is discussed in this paper is appearances - we only try to
> >explain the phenomenon (things as they appear). No attempt is made to
> >explain the noumenon (things as they are), nor do we need to assume
> >that there is a noumenon.
> 
> Most readers of your paper would take it that you are making a strong 
> ontological proposition, i.e. that the basis of reality is your set of 
> bitstrings. 

This is the case.

If this is *not* the case, and you think the bitstrings may be 
> represented in some deeper reality (or maybe are just metaphors), then 
> what is the motivation for your proposal? Why do we need to think about 
> this intermediate layer of bitstrings? The original simplicity goes out 
> the window.

This latter extrapolation is not the case.

> 
> BTW I'm with Kant: you can't have an appearance without an underlying 
> reality, even if that is unknowable.
> 

I'm not sure Kant says this, but in any case that's not important. I'm
with Marchal, who says if there is an underlying reality which is not
only unknowable, but also unnecessary to explain phenomena, then why
assume that particular hypothesis? It makes no sense.

> >Bruno Marchal has a detailed discussion on this in his thesis, and 
> >concludes that he "has no need for this hypothesis" (what he calls the 
> >extravagant hypothesis).
> >
> >So the former statement is true :[the description strings are] things 
> >that "observer" TM's observe and map to integers. It is also true that 
> >descriptions of self aware observers will appear within the description 
> >by the Anthropic Principle. The phenomenon of observerhood is included. 
> >However where the observers actually live is not a meaningful question 
> >in this framework.
> 
> I think either your terminology or you model has now got very confused. 
> Are your "observer" TMs the observers (SAS) whose experiences your theory 
> is trying to explain? 

Yes.

> In this case "where they live" is crucial because it 
> defines the environment the SAS find themselves in. 

Why?

> If you are not 
> careful your theory becomes effectively that we are all "brains in 
> bottles" or Leibnizian monads, which is solipsism by another name. 

It is not solipsism, if only for the reason that multiple observers
exist in our observed reality. They are all as real as our own consciousness.

Bruno Marchal cal

Re: Questions on Russell's "Why Occam" paper

2005-06-06 Thread "Hal Finney"
I share most of Paddy Leahy's concerns and areas of confusion with
regard to the "Why Occam" discussion so far.  I really don't understand
what it means to explain appearances rather than reality.  It's hard to
get my mind around this kind of explanation and what to expect from it.
Also the way the Anthropic Principle applies to infinite strings seems
extremely vague until we have a clearer picture of how those strings
relate to reality.

One area I differ:

Paddy Leahy writes, quoting Russell:
> > However, as the cardinality of "my" ensemble is actually "c" 
> > (cardinality of the real numbers), it is quite probably a completely 
> > different beast.
>
> There you go again with your radical compression. Without the reading I've 
> been doing in the last two weeks, I wouldn't have been able to decode this 
> statement as meaning:
>
> 2^\aleph_0 = \aleph_1 (by definition)
>
> To assume c = \aleph_1 is the Continuum Hypothesis, which is unprovable 
> (within standard arithmetic).

Actually Russell did not bring aleph_1 into the picture at all.  All that
he referred to was aleph_0 and c which by definition is 2^aleph_0.
c is the cardinality of the reals and of infinite bit strings.  This is
all just definitional.  Whether c is the "next" infinite cardinal after
aleph_0 is the Continuum Hypothesis, but that is not relevant here.

Another area I had trouble with in Russell's answer was the concept of
a prefix map.  I understand that a prefix map is defined as a mapping
whose domain is finite bit strings such that none of them are a prefix
of any other.  But I'm not sure how to relate this to the infinite bit
strings that are "descriptions".

In particular, if "an observer attaches sequences of meanings to sequences
of prefixes of one of these strings", then it seems that he must have a
domain which does allow some inputs to be prefixes of others.  Isn't that
what "sequences of prefixes" would mean?  That is, if the infinite string
is 01011011100101110111..., then a sequence of prefixes might be 0, 01,
010, 0101, 01011,   Does O() apply to this sequence of prefixes?  If
so then I don't think it is a prefix map.

I want to make it clear by the way that my somewhat pedantic and labored
examination of this page is not an attempt to be difficult or stubborn.
Rather, I find that by the third page, I don't understand what is going
on at all!  Even the very first sentence, "In the previous sections, I
demonstrate that formal mathematical systems are the most compressible,
and have highest measure amongst all members of the Schmidhuber ensemble,"
has me looking to see if I skipped a page!  I don't see where this is
discussed in any way.  So I hope that by pinning down and crystalizing
exactly what the first page is claiming, it will help me to see what
the more interesting third page is actually able to establish.  I think
Paddy is in much the same situation.

Hal Finney



Re: Questions on Russell's "Why Occam" paper

2005-06-06 Thread Patrick Leahy



On Mon, 6 Jun 2005, Russell Standish wrote:


I am beginning to regret calling the all descriptions ensemble with
uniform measure a Schmidhuber ensemble. I think what I meant was that
it could be generated by a standard dovetailer algorithm, running for
2^\aleph_0 timesteps.


It can't! Timesteps are denumerable, hence this statement is just a 
contradiction in terms. You better postulate your ensemble without 
reference to any algorithm to generate it.


However, as the cardinality of "my" ensemble is actually "c" 
(cardinality of the real numbers), it is quite probably a completely 
different beast.


There you go again with your radical compression. Without the reading I've 
been doing in the last two weeks, I wouldn't have been able to decode this 
statement as meaning:


2^\aleph_0 = \aleph_1 (by definition)

To assume c = \aleph_1 is the Continuum Hypothesis, which is unprovable 
(within standard arithmetic).





Now an observer will expect to find a SAS in one of the descriptions
as a corrolory of the anthropic principle, which is explicitly stated
as one of the assumptions in this work. I make no bones about this - I
consider the anthropic principle a mystery, not self-evident like
many people.


Very few supporters of the AP would "expect to find a SAS" in a bitstring.
Until you *specify* a way of interpreting the string, it contains nothing 
but bits.



Why should an observer expect to see a token of erself
embedded in reality? That is the mystery of the AP.


What ARE you talking about?  Observer's don't see tokens of themselves... 
if anyone (God?) has a 3rd-person/bird's eye view, it is certainly not 
someone who is included in any particular reality. No way is anything like 
this implied by the AP. All the AP requires is that there *be* 
observers/SAS in (real) universes, which is true in our case at least.





And now we find not only that the bit string is
a description, but it is a complex enough description to describe SAS's?
How does that work?



The bitstrings are infinite in length. By reading enough bits, they can
have arbitrarily complex meanings attached to them.



In particular, any bitstring can be "interpreted" as any other bitstring 
by an appropriate map. Hence until you specify an interpreter you are 
simply not proposing a theory at all.





All that is discussed in this paper is appearances - we only try to
explain the phenomenon (things as they appear). No attempt is made to
explain the noumenon (things as they are), nor do we need to assume
that there is a noumenon.


Most readers of your paper would take it that you are making a strong 
ontological proposition, i.e. that the basis of reality is your set of 
bitstrings. If this is *not* the case, and you think the bitstrings may be 
represented in some deeper reality (or maybe are just metaphors), then 
what is the motivation for your proposal? Why do we need to think about 
this intermediate layer of bitstrings? The original simplicity goes out 
the window.


BTW I'm with Kant: you can't have an appearance without an underlying 
reality, even if that is unknowable.


Bruno Marchal has a detailed discussion on this in his thesis, and 
concludes that he "has no need for this hypothesis" (what he calls the 
extravagant hypothesis).


So the former statement is true :[the description strings are] things 
that "observer" TM's observe and map to integers. It is also true that 
descriptions of self aware observers will appear within the description 
by the Anthropic Principle. The phenomenon of observerhood is included. 
However where the observers actually live is not a meaningful question 
in this framework.


I think either your terminology or you model has now got very confused. 
Are your "observer" TMs the observers (SAS) whose experiences your theory 
is trying to explain? In this case "where they live" is crucial because it 
defines the environment the SAS find themselves in.  If you are not 
careful your theory becomes effectively that we are all "brains in 
bottles" or Leibnizian monads, which is solipsism by another name. Or are 
your "observers" the missing "interpreters" in your theory which give it 
meaning, and allow us to find (in principle) the SAS within the bitstrings 
that represent actual observers like us? In this case it's unhelpful to 
call these meta-entities "observers"; rather, in effect, they constitute 
the (meta-)laws of physics. Incidentally, a TM by itself can't generate 
meaning, as it is only a map from integers to integers. You still have to 
specify externally how to interpret the code as something more than a mere 
number. (E.g. in the Turing test the output bits have to be processed into 
English language text).





The page then goes on to make some comments about measure applied to
universes.  Here again I am confused about how to relate it to all that
has been descibed.  What are the analogs of universes, in this model?
Is it "descriptions", the infinite bit strings?  From what h

Re: Questions on Russell's "Why Occam" paper

2005-06-05 Thread Russell Standish
On Fri, Jun 03, 2005 at 04:22:07PM -0700, "Hal Finney" wrote:
> Russell Standish recently mentioned his paper "Why Occam's Razor" which
> can be found at http://parallel.hpc.unsw.edu.au/rks/docs/occam/ .  Among
> other things he aims to derive quantum mechanics from a Schmidhuber type
> ensemble.  I have tried to read this paper but never really understood it.
> Here I will try to ask some questions, taking it slowly.
> 
> On this page, http://parallel.hpc.unsw.edu.au/rks/docs/occam/node2.html ,
> things get started.  Russell describes a set of infinite bit strings he
> calls "descriptions".  He writes:
> 
> "By contrast to Schmidhuber, I assume a uniform measure over these
> descriptions -- no particular string is more likely than any other."
> 
> This surprises me.  I thought that Schmidhuber assumed a uniform measure
> over bit strings considered as programs for his universal computer.  So
> what is the "contrast" to his work?

Nowhere in Schmidhuber (1997) does he propose a measure over the "input
programs". What he does is is justify the appearance of a universal
prior in the set of descriptions by passing the raw data through a
reference UTM. Presumably the set of descriptions is sampled uniformly
by the observer.

In Schmidhuber (2000), the set of descriptions is generated by a
machine which has resource bounds. This leads to the notion of "speed
prior" which differs from the universal prior in several important
respects. I sometimes refer to the two different ensembles as
Schmidhuber I and Schmidhuber II.

I am beginning to regret calling the all descriptions ensemble with
uniform measure a Schmidhuber ensemble. I think what I meant was that
it could be generated by a standard dovetailer algorithm, running for
2^\aleph_0 timesteps. However, as the cardinality of "my" ensemble is
actually "c" (cardinality of the real numbers), it is quite probably a
completely different beast. It is also not generated by a program,
Schmidhuber style, it simply is (in the sense of being the simplest
set - equivalent to nothing).

> 
> It seems that the greater contrast is that while Schmidhuber assumed that
> the bit strings would be fed into a computer that would produce outputs,
> Russell is taking the bit strings directly as raw data.
> 

Quite true.


> But I am confused about their role.
> 
> "Since some of these descriptions describe self aware substructures..."
> 
> Whoa!  This is a big leap for me.  First, I am not too happy that mere bit
> strings have been elevated with the title "descriptions".  A bit string on
> its own doesn't seem to have the inherent meaning necessary for it to be
> considered a description.  

Many apologies for deploying terminology in a different way to you
expect. A description (in my terminology) does not necessarily have
meaning. It is simply data. This is in accord with how I use the term
in casual English usage too - a description is simply a string of
letters, and may or may not be meaningful. Meaning is attached by an observer.

Now an observer will expect to find a SAS in one of the descriptions
as a corrolory of the anthropic principle, which is explicitly stated
as one of the assumptions in this work. I make no bones about this - I
consider the anthropic principle a mystery, not self-evident like
many people. Why should an observer expect to see a token of erself
embedded in reality? That is the mystery of the AP.

> And now we find not only that the bit string is
> a description, but it is a complex enough description to describe SAS's?
> How does that work?
> 

The bitstrings are infinite in length. By reading enough bits, they can
have arbitrarily complex meanings attached to them.

> It's especially confusing to read the introductory word "since" as though
> this is all quite obvious and need not be explained.  To me it is very
> confusing.
> 

Sorry for not going slow enough. The habits of concise expression are
hard to shake.

> The page goes on to identify these SAS's as observers.  Now they are mappings,
> or equivalently Turing Machines, which map finite bit strings to integers.
> These integers are the "meanings" of the bit strings.

Not equivalently. Not all maps can be represented by a Turing machine,
only computable ones.

> 
> I believe the idea here is that the bit strings are taken as prefixes
> of the "description" bit strings in the ensemble.  It is as though the
> "observers" are observing the "descriptions" a bit at a time, and mapping
> them to a sequence of integer "meanings".  Is that correct?
> 

Indeed that is one interpretation. The most important point is that
the observer map is a "prefix" map, in the sense of prefix machines of
Algorithmic Information Theory. In reading a bit string one bit at a
time, once a meaning is attached to the string, that is the meaning
for evermore - the observer cannot change er mind after reading a few
more bits.

Schmidhuber (2000) deals with machines that do "change their mind", so
perhaps there is some extension 

Questions on Russell's "Why Occam" paper

2005-06-03 Thread "Hal Finney"
Russell Standish recently mentioned his paper "Why Occam's Razor" which
can be found at http://parallel.hpc.unsw.edu.au/rks/docs/occam/ .  Among
other things he aims to derive quantum mechanics from a Schmidhuber type
ensemble.  I have tried to read this paper but never really understood it.
Here I will try to ask some questions, taking it slowly.

On this page, http://parallel.hpc.unsw.edu.au/rks/docs/occam/node2.html ,
things get started.  Russell describes a set of infinite bit strings he
calls "descriptions".  He writes:

"By contrast to Schmidhuber, I assume a uniform measure over these
descriptions -- no particular string is more likely than any other."

This surprises me.  I thought that Schmidhuber assumed a uniform measure
over bit strings considered as programs for his universal computer.  So
what is the "contrast" to his work?

It seems that the greater contrast is that while Schmidhuber assumed that
the bit strings would be fed into a computer that would produce outputs,
Russell is taking the bit strings directly as raw data.

But I am confused about their role.

"Since some of these descriptions describe self aware substructures..."

Whoa!  This is a big leap for me.  First, I am not too happy that mere bit
strings have been elevated with the title "descriptions".  A bit string on
its own doesn't seem to have the inherent meaning necessary for it to be
considered a description.  And now we find not only that the bit string is
a description, but it is a complex enough description to describe SAS's?
How does that work?

It's especially confusing to read the introductory word "since" as though
this is all quite obvious and need not be explained.  To me it is very
confusing.

The page goes on to identify these SAS's as observers.  Now they are mappings,
or equivalently Turing Machines, which map finite bit strings to integers.
These integers are the "meanings" of the bit strings.

I believe the idea here is that the bit strings are taken as prefixes
of the "description" bit strings in the ensemble.  It is as though the
"observers" are observing the "descriptions" a bit at a time, and mapping
them to a sequence of integer "meanings".  Is that correct?

So here is another confusion about the role of the "description" bit
strings in the model.  Are they things that "observer" TM's observe and
map to integers?  Or are they places where observers live, as suggested
by the "Since" line quoted above?  Or both?

Now it gets a little more complicated: "Under the mapping O(x), some
descriptions encode for identical meanings as other descriptions, so
one should equivalence class the descriptions."

The problem I have is, O takes only finite bit strings.  So technically
a "description", which is an infinite bit string, does not encode a
meaning.  What I think is meant here, though, is that two "descriptions"
(i.e. infinite bit strings) will be considered equivalent if for every
finite prefix of the strings, the O() mapping is the same.  So if we
think of O as "observing" the "description" bit strings one by one,
it will go through precisely the same sequence of integer "meanings"
in each case.  Is that right?

"In particular, strings where the bits after some bit number n are
``don't care'' bits, are in fact equivalence classes of all strings that
share the first n bits in common."

I think what this considers is a special O() and a special string prefix
such that if O sees that particular n-bit prefix, all extensions of
that prefix get mapped to the same "meaning" integer.  In that case the
condition described in my previous paragraph would be met, and all
strings with this n-bit prefix would be equivalent.

"One can see that the size of the equivalence class drops off
exponentially with the amount of information encoded by the string."

That seems a little questionable because the size of the equivalence class
is infinite in all cases.  However I think Russell means to use a uniform
measure where the collection of all strings with a particular n-bit
prefix have a measure of 1/2^n.  It's not clear how well this measure
really works or whether it applies to all sets of infinite strings.

"Under O(x), the amount of information is not necessarily equal to the
length of the string, as some of the bits may be redundant."

Now we have this new concept of "the amount of information" which has
not previously been defined.  This sentence is really hard for me.
What does it mean for bits to be redundant?  We just discussed strings
where all those after bit n are "don't care", but this sentence seems
to be envisioning other kinds of redundancies.

"The sum P_O(s) = [sum over p such that O(p)=s of] 2^(-|p|)
where |p| means the number of bits of p consumed by O in returning s,
gives the size of the equivalence class of all descriptions having
meaning s."

Boy, that's a tough one now.  We consider all bit strings p such that O(p)
= s.  Now, is this supposed to just be those cases described earlier where
the bits after |p|