My Lucene OpenNLP patch (LUCENE-2899) contains small test data sets just to create unit tests. The patch runs MaxEnt on this test data and then uses the .bin files to run simple unit tests. These datasets are completely bogus, they only exist to demonstrate a complete round trip.

The chunker output changed slightly from 1.5.2-incubation to 1.5.3. Was this expected? Was there some change in MaxEnt that caused generated models to change? If there was, that's fine, as long as someone expected this. But it does mean that the old models on Sourceforge may be slightly wrong.

Lance


Reply via email to