A new corpus suggestion: Google's One Billion Word Benchmark. The idea would be to get people to stop using the misleading model selection criterion of perplexity and start to realize the principled generality of lossless compression.
I'm really surprised and even dismayed at how much of an uphill battle this has been. It's like people KNOW that they don't want to know the unbiased truth when it is being handed to them on a silver platter. Yes, I know people don't want to know the truth but what surprised me is the degree to which they exhibit intent at a high enough level that they must invest cognitive resources to suppress self-knowledge of their meta-mendacity. On Wed, Jan 22, 2020 at 12:11 PM Matt Mahoney <[email protected]> wrote: > > > On Tue, Jan 21, 2020, 12:45 PM <[email protected]> wrote: > >> On Tuesday, January 21, 2020, at 2:38 PM, Matt Mahoney wrote: >> >> create all possible archives starting with the smallest >> >> Brute Force? Makes no sense but you get 1st place for trying! >> > > I get the prize for simplest description, not for size or speed. > >> *Artificial General Intelligence List <https://agi.topicbox.com/latest>* > / AGI / see discussions <https://agi.topicbox.com/groups/agi> + > participants <https://agi.topicbox.com/groups/agi/members> + delivery > options <https://agi.topicbox.com/groups/agi/subscription> Permalink > <https://agi.topicbox.com/groups/agi/T65747f0622d5047f-M7e16507923773f1395188332> > ------------------------------------------ Artificial General Intelligence List: AGI Permalink: https://agi.topicbox.com/groups/agi/T65747f0622d5047f-Mbd5b91fe71ec2647e7624a31 Delivery options: https://agi.topicbox.com/groups/agi/subscription
