--- On Fri, 12/26/08, Philip Hunt <cabala...@googlemail.com> wrote:

> > Humans are very good at predicting sequences of
> > symbols, e.g. the next word in a text stream.
> 
> Why not have that as your problem domain, instead of text
> compression?

That's the same thing, isn't it?

> While you're at it you may want to change the size of the "chunks" in
> each item of prediction, from characters to either strings or
> s-expressions. Though doing so doesn't fundamentally alter the
> problem.

In the generic test, the fundamental units are bits. It's not entirely suitable 
for most existing compressors, which tend to be byte oriented. But they are 
only byte oriented because a lot of data is structured that way. In general, it 
doesn't need to be.

-- Matt Mahoney, matmaho...@yahoo.com



-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=123753653-47f84b
Powered by Listbox: http://www.listbox.com

Reply via email to