Re: Competitive message routing protocol (was Re: [agi] Deliberative vs Spatial intelligence)

J. Andrew Rogers Thu, 01 May 2008 01:38:15 -0700


On Apr 30, 2008, at 6:56 PM, Matt Mahoney wrote:

Which protocol are you referring to, the one described on my web page
or the abstract one described in my thesis?



Thesis.

In the abstract one I
described a network of n identical (but unreliable) peers, each
connected to c = O(log n) peers,...

Yeah, that is kind of what I meant. You have to design the protocolto guarantee that the aggregate network does not develop pathologiesthat greatly reduce its efficiency and utility, otherwise you *will*end up at a pathological minima. If you look at, for example, theevolution of routing protocols they learned this lesson the hard way-- multiple times. If you study Nash Equilibria and similar, you willfind that there are constrained cases that will allow you to design aprotocol that will generate efficient equilibria, but real networkcharacteristics increasingly violate the set of constrained cases thatwe know how to design a decentralized protocol for. If you study BGP4(core routing protocol), you will see that it essentially uses thevenerable Marginal Cost Pricing (MCP) optimization strategy, but withan increasing number of patches to deal with the fact that theassumptions that make it work are increasingly violated.

Your model above tacitly predicates its optimality on a naive MCPstrategy, but is not particularly well-suited for it. In short, thismeans that you are assuming that the aggregate latency function for atransaction over the network is a close proxy for the transactioncost. At one time this might have been a reasonable assumption, butit becomes less true every year. There are some interesting newstrategies from other fields of mathematics that kind of look like amore adaptive MCP strategy but which does not really solve the problem-- a google search on MCP will put you in the link ballpark to findall the other literature. It is actually possible to make applicationslike yours work in an MCP model, but it requires fine-grained,authoritative accounting in an architecture that is supposed to bedecentralized which kind of defeats the purpose -- you have to be ableto trust every node in a strong sense.

There are an increasing number of problem and application spaces thatare very pathological on decentralized topologies that are optimizedwith conventional MCP models. This is a big theoretical concern rightnow, both because conventional networks are diverging from the robustassumptions that make it valid and because there are organizationsthat want to build decentralized systems that they know upfront cannotbe built in such a way that they do not collapse into an expensivelypathological morass. Most of the basic computer science literature ondecentralized systems limits itself to models based on the assumptionof a naive MCP strategy or partially centralized control to maintainoptimality.

I cannot tell you how to solve the problem, as I am not aware of agood solution other than strictly centralizing key pieces of metadatacontrol which will impose its own serious issues. One of thebenefits of tightly controlled cloud/cluster models (e.g. Google) isthat you can impose optimizing constraints that are utterlyimpractical or impossible to impose on the wild and wooly internet atlarge.

This is a problem I have spent a lot of time on but, other thandeveloping a really deep appreciation of the theory surrounding thedevelopment of pervasively decentralized topology protocols that donot slowly destroy themselves in the wild, I cannot say that I havemade any progress on it. It is a broad theoretical problem that iskind of sneaking up on us, and I suspect over the long run we'll useone of the less than ideal modifications of MCP (like using a centralauthority) to deal with it.

Like I said, I don't have a good solution for you that will allowthings to scale the way you want, I mostly wanted to point out thatthese issues had not been addressed appropriately for the scale beingdiscussed. Interestingly, it seems that hardly anyone that works ondecentralized systems design is aware of these issues -- it is thedomain of theoretical mathematicians and routing geeks. For smallsystems, it really doesn't matter which may be part of the reason why.

For your purposes in the broadest sense, things like kD-trees
will drop dead for pretty trivial systems, never mind for something
ambitious.  On the other hand, generalized distribution with O(n)
storage complexity was solved last year which may or may not address
your issues.


Storage is O(n) using an organizational tree, but that would not be

robust. I think the O(log n) factor is not a big penalty. You wantto

have more pointers per peer as the network grows, and you want to have
more cached and backup copies of messages.

No existing spatial index that is general (i.e. not restricted to aset of data with carefully tailored characteristics) will distributebeyond a few dozen nodes, and it gets worse fast when you adddimensions. The new solution I mention above is fully general andscales extremely well in a distributed, decentralized environment,which seemed to be the issue that needed solving.

The exception to this is if your spatial index is built once and nevermodified (i.e. read-only) in which case it is possible to have onethat is general and which scales to modest size. Its a real mess, andscalability is something like a five- or six-axis space when talkingabout these kinds of algorithms.


J. Andrew Rogers

-------------------------------------------
agi
Archives: http://www.listbox.com/member/archive/303/=now
RSS Feed: http://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
http://www.listbox.com/member/?member_id=8660244&id_secret=101455710-f059c4
Powered by Listbox: http://www.listbox.com

Re: Competitive message routing protocol (was Re: [agi] Deliberative vs Spatial intelligence)

Reply via email to