Re: [agi] Universal intelligence test benchmark

J. Andrew Rogers Sat, 27 Dec 2008 12:52:07 -0800


On Dec 26, 2008, at 7:24 PM, Philip Hunt wrote:

2008/12/27 J. Andrew Rogers <and...@ceruleansystems.com>:

I think many people greatly underestimate how many gaping algorithmholesthere are in computer science for even the most important andmundane tasks.
The algorithm coverage of computer science is woefully incomplete,


Is it? In all my time as a programmer, it's never occurred to me to
think "I wish there was an algorithm to do X". mybe that's just me.
And there are vast numbers of useful algorithms that people use every
day.

Computers are general, so there always exists an obvious algorithm fordoing any particular task. Whether or not that obvious algorithm isefficient is quite another thing, since the real costs of variousalgorithms are far from equivalent even if their functionality is.

The Sieve of Eratosthenes will allow you to factor any integer intheory, but for non-trivial integers you will want to use a numberfield sieve. The limitations of many types of software arefundamentally based in the complexity class of the of the attributesof the algorithms they use. We frequently improperly conflate"theoretically impossible" and "no tractable algorithm currentlyexists".

I wonder (thinking out loud here) are there any statistics for this?
For example if you plot the number of such algorithms that've been
found over time, what sort of curve would you get? (Of course, you'd
have to define "general, elegant algorithm for basic problem", which
might be tricky)

I am still surprised often enough that it is obvious that there isconsiderable amounts of innovation still being done. It both amusesand annoys me no end that some common algorithms have designcharacteristics that reflect long-forgotten assumptions that do noteven make sense in the context they are used e.g. compulsive treebalancing behavior of intrinsically unbalanced data structures.

In short, we have no idea what important and fundamental
algorithms will be discovered from one year to the next that changethe
boundaries of what is practically possible with computer science.


Is this true? It doesn't seem right to me. AIUI the current state of
the art in operating systems, compilers, garbage collectors, etc is
only slightly more efficient than it was 10 or 20 years ago. (In fact,
most practical programs are a good deal less efficient, because faster
processors mean they don't have to be).

It is easy to forget how many basic algorithms we use ubiquitously arerelatively recent. The concurrent B-tree algorithm that ispervasively used in databases, file systems, and just about everythingelse was published in the 1980s. In fact, most of the algorithms thatmake up a modern SQL database as we understand them were developed inthe 1980s, even though the relational model goes back to the 1960s.

I don't think I understand you. To me "indexing" means what the Google
search engine or an SQL database does -- but you're using the word
with a different meaning aren't you?

I mean it exactly like you understand it. Indexed access methods andrepresentations.

Sorry, you've lost me again -- I've never heard of the term
"hyper-rectangles" in relation to relational databases.

Most people haven't, because there are no hyper-rectangles inrelational database *implementations* seeing as how there are nouseful algorithms for representing them. Nonetheless, the underlyingmodel describes operations using hyper-rectangles in high-dimensionalspaces.

In an ideal relational implementation there are never externalindexes, only data organized in its native high-dimensionality logicalspace, since external indexes are a de-normalization.

It is not because it is theoretically impossible, but
because it is only possible if someone discovers a generalalgorithm for
indexing hyper-rectangles -- faking it is not distributable.


How do we know that there is such an algorithm?

We don't unless someone publishes one, but there is a lot of evidencethat seems to imply otherwise and which proves that much of theresearch that has been done was misdirected. Aesthetically, thecurrent algorithms for doing this are nasty ugly hacks, and that lackof elegance is often an indicator that a better way exists.

In the specific case of indexing hyper-rectangles, the first basicalgorithm was published in 1971 (IIRC), but was supplanted by acompletely different family of algorithm in 1981. Virtually allresearch has been based on derivatives of the 1981 algorithm, since itappeared to have better properties. Unfortunately, we can now provethat this algorithm class can never yield a general solution and thata solution must look like a variant of the original 1971 algorithmfamily that has been ignored for a quarter century. Interestingly, theproof of this comes by way of the recent explosion in the research onmassively concurrent data structures due to the proliferation of multi-core CPUs, which has published results that when applied to thisparticular problem show both that we were doing it wrong and gives alot of hints as to how to do it right that suggest a completely noveltype of algorithm that has never been tried.

You have to understand that until a couple years ago, no one was doinguseful research on this particular problem despite the fact that many,many people had done endless quantities of research trying to solve itdown an apparent dead-end path. Computer science is full of this kindof thing, and the elegant solutions only look obvious in hindsight.


Cheers,

J. Andrew Rogers



-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=123753653-47f84b
Powered by Listbox: http://www.listbox.com

Re: [agi] Universal intelligence test benchmark

Reply via email to