2011/7/19 Olivier Grisel <[email protected]>: > Maybe we could work on extending the DataFileProvider to make the > defaultdata provider only provide download URLs from the existing gray > licensed opennlp 1.5 models from > http://opennlp.sourceforge.net/models-1.5/ and let the > DataFileProvider download them from there automatically the first time > they are required. The issue then is that every integration tests job > will re-download the same data from sourceforge over and over again... > That will slowdown the builds / tests and waste bandwith for nothing + > add a new way for the builds and test to fail (dependency on the > network / sourceforge availability).
I think having the OpenNLP models in our trunk and use them during development in incubation is no problem. So we don't need to change anything for build and integration tests right now. I would propose to exclude the models when a release is made. In this case the OpenNLP engine has to be prepared to recognize that the required model is missing and download it from Sourceforge. If the model is not missing as during development in our trunk everything is fine. -- Fabian
