--- On Fri, 12/26/08, Philip Hunt <cabala...@googlemail.com> wrote: > > Humans are very good at predicting sequences of > > symbols, e.g. the next word in a text stream. > > Why not have that as your problem domain, instead of text > compression?
That's the same thing, isn't it? > While you're at it you may want to change the size of the "chunks" in > each item of prediction, from characters to either strings or > s-expressions. Though doing so doesn't fundamentally alter the > problem. In the generic test, the fundamental units are bits. It's not entirely suitable for most existing compressors, which tend to be byte oriented. But they are only byte oriented because a lot of data is structured that way. In general, it doesn't need to be. -- Matt Mahoney, matmaho...@yahoo.com ------------------------------------------- agi Archives: https://www.listbox.com/member/archive/303/=now RSS Feed: https://www.listbox.com/member/archive/rss/303/ Modify Your Subscription: https://www.listbox.com/member/?member_id=8660244&id_secret=123753653-47f84b Powered by Listbox: http://www.listbox.com