That all makes sense to me. Voting for the models does seem prudent for the 
reasons you mention — these are sort of new problems to have and I don’t think 
it’s very well established what the right answer is so validating consensus 
even if not obligated makes sense.

-joey

> On Mar 16, 2021, at 4:08 AM, Jeff Zemerick <jzemer...@apache.org> wrote:
> 
> Joey, thanks for the comments/questions.
> 
> - The models can be named along with the 1.9.3 version to show that's what
> trained it but we should be careful to not give the impression that the
> models *only* work with that version. I think that can be made sufficiently
> clear in the documentation.
> - My opinion is that it would be best if the models were not tied to the
> OpenNLP lifecycle. I would like for the project to be able to release new
> models independently of OpenNLP releases. So I hope we can do the latter
> and train the models from an official OpenNLP release, vote, and publish.
> - I feel that the models fall somewhere in between being a direct binary
> artifact and something more derivative like a Docker container because a
> model needs to be evaluated prior to being made available. Contrast that
> with a docker container which either works or doesn't. More things (how it
> was trained, performance, etc.) should be considered when voting on a model
> release than just if it works or not.
> 
> From that page https://incubator.apache.org/guides/distribution.html:
> 
> - Convenience binaries must be made from IPMC approved ASF releases.
> - Convenience binaries need to follow licensing policy and not include any
> category X licensed software.
> - Convenience binaries should be signed and have hashes to verify their
> contents.
> 
> I think we are ok with those 3 things. I will update the naming of the
> models as Joey suggested (to include the OpenNLP version that created them)
> and update the README to explain 1.9.3 is the version that created them but
> should work with all OpenNLP versions (but only tested with 1.9.3).
> 
> Are there any concerns about the model release process given my responses
> to your questions?
> 
> Thanks,
> Jeff
> 
> 
> 
> 
>> On Mon, Mar 15, 2021 at 7:49 PM Joey Frazee <joey.fra...@icloud.com.invalid>
>> wrote:
>> 
>> Jeff, in the other thread you mentioned “I personally have been thinking
>> of the models as convenience binaries”.
>> 
>> I think that’s the most obvious answer and is what I’d think too.
>> 
>> If that’s the case, then the policies suggest that the version needs to
>> match the version they’re created from. So should this be something like
>> opennlp-ud-models-1.0-1.9.3 or similar?
>> 
>> The other thing, which is murky in practice, is that do the models need to
>> be voted on concurrently with a release or just created by the PMC from an
>> official release and published on Apache supported infrastructure?
>> 
>> Direct binary artifacts are almost always evaluated at the time of a
>> release vote but more derivative ones often aren’t. E.g., a lot of projects
>> publish Docker images from approved releases but not with an independent
>> vote. Which are these?
>> 
>> Incubator recently published some helpful guidelines which clarify related
>> stuff for the podlings:
>> 
>> https://incubator.apache.org/guides/distribution.html
>> 
>> -joey
>> 
>>>> On Mar 15, 2021, at 3:26 PM, Jeff Zemerick <jzemer...@apache.org> wrote:
>>> 
>>> Before starting a second release vote thread for the OpenNLP models and
>>> since this is the first release of pretrained OpenNLP models, I would
>> like
>>> to pause to solicit feedback from the community in regards to the release
>>> configuration.
>>> 
>>> - The files are staged on the ASF dev SVN at
>>> https://dist.apache.org/repos/dist/dev/opennlp/ud-models-1.0/.
>>> - The model files are signed and hashed.
>>> - Includes README, CHANGES, NOTICE, and LICENSE files.
>>> - The training and evaluation outputs are in the training-eval-logs.zip
>>> file (also signed and hashed).
>>> 
>>> Please let me know if anything is missing or should be changed. Once
>> things
>>> are in a good state I will make a PR to document the steps on the website
>>> (OPENNLP-1328) and start a vote thread.
>>> 
>>> ASF Release Creation Process:
>>> https://infra.apache.org/release-publishing.html
>>> 
>>> Thanks,
>>> Jeff
>> 

Reply via email to